-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AI Toolkit docs migration #7827
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sffamily First iteration of reviews. Once you've dealt with this, I'll do another more in-depth review.
…e-docs into zhshang/aitoolkit
@ntrogh I just went through and addressed all your 1st pass comments. In addition I added 2 new pages for features we are releasing this week. Also did some minor updates of contents. Please help review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sffamily more feedback
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
Co-authored-by: Nick Trogh <[email protected]>
@ntrogh I've updated and addressed all round 2 comments. pls help review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sffamily Thanks for the updates. Just a couple small updates remaining.
|
||
Bulk run feature in AI Toolkit allows you to run many prompts in batch mode. Compared to playground you can only run each prompt manually in order. | ||
The bulk run feature in AI Toolkit allows you to run multiple prompts in batch. When you use the playground, you can only run one prompt manually at a time, in the order they're listed. Bulk run takes a dataset as input, where each row in the dataset has a prompt as the minimal requirement. Typically teh dataset has multiple rows. Once imported, you can select any prompt to run or run all prompts on the selected model. The responses will be displayed in the same dataset view. Executed dataset can be exported. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The bulk run feature in AI Toolkit allows you to run multiple prompts in batch. When you use the playground, you can only run one prompt manually at a time, in the order they're listed. Bulk run takes a dataset as input, where each row in the dataset has a prompt as the minimal requirement. Typically teh dataset has multiple rows. Once imported, you can select any prompt to run or run all prompts on the selected model. The responses will be displayed in the same dataset view. Executed dataset can be exported. | |
The bulk run feature in AI Toolkit allows you to run multiple prompts in batch. When you use the playground, you can only run one prompt manually at a time, in the order they're listed. Bulk run takes a dataset as input, where each row in the dataset has a prompt as the minimal requirement. Typically, the dataset has multiple rows. Once imported, you can select any prompt to run or run all prompts on the selected model. The responses will be displayed in the same dataset view. The results from running the dataset can be exported. |
|
||
Navigate to `Bulk Run` in the `TOOLS` section in the tree view of AI Toolkit. You can select either a sample dataset to get familiar about the feature, or import a local JSONL file that has at least a field `query` to use as prompts. | ||
1. Open Bulk Run view |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1. Open Bulk Run view | |
1. In the AI Toolkit view, select **TOOLS** > **Bulk Run** to open the Bulk Run view. |
|
||
In the AI Toolkit view, select **TOOLS** > **Bulk Run** to open the Bulk Run view. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the AI Toolkit view, select **TOOLS** > **Bulk Run** to open the Bulk Run view. |
|
||
## Run any prompt | ||
1. Select dataset |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1. Select dataset | |
1. Select either a sample dataset or import a local JSONL file that has a `query` field to use as prompts. |
|
||
Once dataset is loaded in the bulk run view, you can find the table format of the imported dataset, with column of query and column of response. | ||
You can select either a sample dataset to get familiar about the feature, or import a local JSONL file that has at least a field `query` to use as prompts. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can select either a sample dataset to get familiar about the feature, or import a local JSONL file that has at least a field `query` to use as prompts. |
|
||
## Monitor the evaluation job | ||
|
||
Once an evaluation job is started, you can find its status from the evaluation job overview. Each evaluation job has link to the dataset used, logs from evaluation process, timestamp and details of the evaluation. | ||
Once an evaluation job is started, you can find its status from the evaluation job view. | ||
![Running evaluation](./images/evaluation/running.png) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
![Running evaluation](./images/evaluation/running.png) | |
![Running evaluation](./images/evaluation/running.png) | |
Once an evaluation job is started, you can find its status from the evaluation job overview. Each evaluation job has link to the dataset used, logs from evaluation process, timestamp and details of the evaluation. | ||
Once an evaluation job is started, you can find its status from the evaluation job view. | ||
![Running evaluation](./images/evaluation/running.png) | ||
Each evaluation job has link to the dataset used, logs from evaluation process, timestamp and link to the details of the evaluation. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Each evaluation job has link to the dataset used, logs from evaluation process, timestamp and link to the details of the evaluation. | |
Each evaluation job has a link to the dataset that was used, logs from the evaluation process, timestamp, and a link to the details of the evaluation. |
You can also select **Open In Data Wrangler** to open the data in `Data Wrangler` extension. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can also select **Open In Data Wrangler** to open the data in `Data Wrangler` extension. | |
You can also select **Open In Data Wrangler** to open the data with the Data Wrangler extension. |
|
||
You can filter the models in the model catalog by using the following criteria: | ||
- Hosted by: | ||
1. use the filters to reuce the list of available models |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
1. use the filters to reuce the list of available models | |
1. Use the filters to reduce the list of available models |
You can filter the models in the model catalog by using the following criteria: | ||
- Hosted by: | ||
1. use the filters to reuce the list of available models | ||
![Select model in model catalog](./images/models/model_catalog.png) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Put the screenshot after the list of filters
No description provided.