Upstream changes for v0.7.0 release by shubhadeepd · Pull Request #134 · NVIDIA/GenerativeAIExamples · GitHub

shubhadeepd · 2024-06-14T07:41:35Z

This release switches all examples to use cloud hosted GPU accelerated LLM and embedding models from Nvidia API Catalog as default. It also deprecates support to deploy on-prem models using NeMo Inference Framework Container and adds support to deploy accelerated generative AI models across the cloud, data center, and workstation using latest Nvidia NIM-LLM.

For detailed changes please refer to CHANGELOG.md file.

shubhadeepd

Waiting for approval from code owners.

Signed-off-by: Shubhadeep Das <shubhadeepd@nvidia.com>

jliberma

Looks good, thanks Shubhadeep

Signed-off-by: Shubhadeep Das <shubhadeepd@nvidia.com>

Previously we tokenized and counted tokens to stop when max tokens was reached. Now we let the mistral.rs engine do it which saves the extra tokenization step. Also dynamo-run prints which engines are compiled in in help message, and some minor lint fixes.

shubhadeepd self-assigned this Jun 14, 2024

shubhadeepd requested review from jliberma, nv-pranjald and sumitkbh June 14, 2024 07:50

shubhadeepd added bug Something isn't working documentation Improvements or additions to documentation enhancement New feature or request dependencies Pull requests that update a dependency file labels Jun 14, 2024

nv-pranjald previously approved these changes Jun 14, 2024

View reviewed changes

shubhadeepd commented Jun 14, 2024

View reviewed changes

shubhadeepd dismissed nv-pranjald’s stale review via ca939ab June 14, 2024 12:14

shubhadeepd force-pushed the v0.7.0-draft branch from 3442fd9 to ca939ab Compare June 14, 2024 12:14

shubhadeepd requested review from mikemckiernan and nv-pranjald June 14, 2024 12:14

shubhadeepd force-pushed the v0.7.0-draft branch from ca939ab to 858eb72 Compare June 14, 2024 18:51

Upstream changes for v0.7.0 release

93211b2

Signed-off-by: Shubhadeep Das <shubhadeepd@nvidia.com>

shubhadeepd force-pushed the v0.7.0-draft branch from 858eb72 to 93211b2 Compare June 18, 2024 06:12

jliberma approved these changes Jun 18, 2024

View reviewed changes

shubhadeepd merged commit b43e8b0 into main Jun 18, 2024

shubhadeepd deleted the v0.7.0-draft branch June 18, 2024 15:48

anniesurla pushed a commit to anniesurla/GenerativeAIExamples that referenced this pull request Jun 5, 2025

Upstream changes for v0.7.0 release (NVIDIA#134)

582f89b

Signed-off-by: Shubhadeep Das <shubhadeepd@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upstream changes for v0.7.0 release#134

Upstream changes for v0.7.0 release#134
shubhadeepd merged 1 commit intomainfrom
v0.7.0-draft

shubhadeepd commented Jun 14, 2024

Uh oh!

shubhadeepd left a comment

Uh oh!

jliberma left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shubhadeepd commented Jun 14, 2024

Uh oh!

shubhadeepd left a comment

Choose a reason for hiding this comment

Uh oh!

jliberma left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants