-
Notifications
You must be signed in to change notification settings - Fork 35
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #141 from stanford-crfm/jonathan/1027-weekly-assets
update weekly assets
- Loading branch information
Showing
11 changed files
with
282 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
--- | ||
- type: dataset | ||
name: EXMODD | ||
organization: Beijing Institute of Technology | ||
description: EXMODD (Explanatory Multimodal Open-Domain Dialogue dataset) is a dataset built off the proposed MDCF (Multimodal Data Construction Framework). | ||
created_date: 2023-10-17 | ||
url: https://arxiv.org/pdf/2310.10967.pdf | ||
datasheet: none | ||
modality: image, text | ||
size: unknown | ||
sample: [] | ||
analysis: Models fine-tuned on EXMODD and earlier dataset Image-Chat and then evaluated on Image-Chat validation set. | ||
dependencies: [YFCC100M, Image-Chat] | ||
included: '' | ||
excluded: '' | ||
quality_control: '' | ||
access: open | ||
license: MIT | ||
intended_uses: '' | ||
prohibited_uses: '' | ||
monitoring: none | ||
feedback: Feedback can be sent to authors via [email protected] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
--- | ||
- type: model | ||
name: SALMONN | ||
organization: ByteDance, Tsinghua University | ||
description: SALMONN is a large language model (LLM) enabling speech, audio event, and music inputs. | ||
created_date: 2023-10-20 | ||
url: https://github.com/bytedance/SALMONN | ||
model_card: https://huggingface.co/MSIIP/SALMONN | ||
modality: audio, text; text | ||
analysis: Evaluated on benchmarks pertaining to speech, music, and other audio recognition. | ||
size: unknown | ||
dependencies: [Whisper, BEATs, Vicuna] | ||
training_emissions: unknown | ||
training_time: unknown | ||
training_hardware: unknown | ||
quality_control: '' | ||
access: open | ||
license: Apache 2.0 | ||
intended_uses: '' | ||
prohibited_uses: '' | ||
monitoring: none | ||
feedback: https://huggingface.co/MSIIP/SALMONN/discussions |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
--- | ||
- type: model | ||
name: Pegasus-1 | ||
organization: Twelve Labs | ||
description: Pegasus-1 is a video-language foundation model. | ||
created_date: 2023-10-23 | ||
url: https://app.twelvelabs.io/blog/introducing-pegasus-1 | ||
model_card: none | ||
modality: video; text | ||
analysis: Evaluated in comparison to SOTA video-to-language models. | ||
size: 80B parameters (dense) | ||
dependencies: [MSR-VTT, Video-ChatGPT Video Descriptions Dataset] | ||
training_emissions: unknown | ||
training_time: unknown | ||
training_hardware: unknown | ||
quality_control: Data selected and cleaned to eliminate toxic and biased content. | ||
access: open | ||
license: unknown | ||
intended_uses: '' | ||
prohibited_uses: '' | ||
monitoring: none | ||
feedback: none |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters