You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
🍱 We are excited to share with you that we have released BentoML v1.2, the biggest release since the launch of v1.0. This release includes improvements from all the learning and feedback from our community over the past year. We invite you to read our release blog post for a comprehensive overview of the new features and the motivations behind their development.
Here are a few key points to note before we delve into the new features:
v1.2 ensures complete backward compatibility, meaning that Bentos built with v1.1 will continue to function seamlessly with this release.
We remain committed to supporting v1.1. Critical bug fixes and security updates will be backported to the v1.1 branch.
BentoML documentation has been updated with examples and guides for v1.2. More guides are being added every week.
BentoCloud is fully equipped to handle deployments from both v1.1 and v1.2 releases of BentoML.
⛏️ Introduced a simplified service SDK to empower developers with greater control and flexibility.
Simplified the service and API interfaces as Python classes, allowing developers to add custom logic and use third party libraries flexibly with ease.
Introduced @bentoml.service and @bentoml.api decorators to customize the behaviors of services and APIs.
Moved configuration from YAML files to the service decorator @bentoml.service next to the class definition.
See the vLLM example demonstrating the flexibility of the service API by initializing a vLLM AsyncEngine in the service constructor and run inference with continuous batching in the service API.
Allow flexible saving and loading models using the bentoml.models.create API instead of framework specific APIs, e.g. bentoml.pytorch.save_model, bentoml.tensorflow.save_model.
🚚 Streamlined the deployment workflow to allow more rapid development iterations and a faster time to production.
Enabled direct deployment to production through CLI and Python API from Git projects.
🎨 Improved API development experience with generated web UI and rich Python client.
All bentos are now accompanied by a custom-generated UI in the BentoCloud Playground, tailored to their API definitions.
BentoClient offers a Pythonic way to invoke the service endpoint, allowing parameters to be supplied in native Python format, letting the client efficiently handles the necessary serialization while ensuring compatibility and performance.
🎭 We’ve learned that the best way to showcase what BentoML can do is not through dry, conceptual documentation but through real-world examples. Check out our current list of examples, and we’ll continue to publish new ones to the gallery as exciting new models are released.
One of the remaining items for BentoML 1.2 release, is to support BentoML 1.2 in the Yatai BentoDeployment operator. We plan to have full support for 1.2 in Yatai 2.0. This issue is for tracking BentoML 1.2 support in current version of Yatai.
The text was updated successfully, but these errors were encountered:
You may have seen our latest announcement on BentoML 1.2 release here: https://bentoml.slack.com/archives/CK8PQU2JY/p1708444803664399
One of the remaining items for BentoML 1.2 release, is to support BentoML 1.2 in the Yatai BentoDeployment operator. We plan to have full support for 1.2 in Yatai 2.0. This issue is for tracking BentoML 1.2 support in current version of Yatai.
The text was updated successfully, but these errors were encountered: