Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

weekly update #147

Merged
merged 6 commits into from
Jan 9, 2024
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
20 changes: 20 additions & 0 deletions assets/anthropic.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -545,3 +545,23 @@
prohibited_uses: ''
monitoring: ''
feedback: none
- type: application
name: Claude for Sheets
organization: Anthropic
description: Claude for Sheets is a Google Sheets add-on that allows the usage of Claude directly in Google Sheets.
created_date: 2023-12-21
url: https://workspace.google.com/marketplace/app/claude_for_sheets/909417792257
dependencies: [Claude]
rishibommasani marked this conversation as resolved.
Show resolved Hide resolved
adaptation: ''
output_space: AI-generated text from prompt
quality_control: ''
access: open
license: unknown
terms_of_service: https://claude.ai/legal
intended_uses: as an integrated AI assistant in Google Sheets
prohibited_uses: ''
monitoring: unknown
feedback: Reviews on https://workspace.google.com/marketplace/app/claude_for_sheets/909417792257
monthly_active_users: unknown
user_distribution: unknown
failures: unknown
22 changes: 22 additions & 0 deletions assets/cresta.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
---
- type: model
name: Ocean-1
organization: Cresta
description: Ocean-1 is the culmination of Cresta's experience in deploying generative AI systems for large enterprises and signifies their latest milestone in advancing the cutting edge AI technology for customer facing conversations.
created_date: 2023-06-20
url: https://cresta.com/blog/introducing-ocean-1-worlds-first-contact-center-foundation-model/
model_card: none
modality: text; text
analysis: Outperforms GPT-4 in common sense and reasoning tasks on the basis of both efficiency and accuracy.
size: 7B parameters (dense)
dependencies: [GPT-4, Claude, Falcon-40B]
rishibommasani marked this conversation as resolved.
Show resolved Hide resolved
training_emissions: unknown
training_time: unknown
training_hardware: unknown
quality_control: ''
access: closed
license: unknown
intended_uses: Acting as a contact center chatbot agent.
prohibited_uses: none
monitoring: unknown
feedback: none
22 changes: 22 additions & 0 deletions assets/deci.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
---
- type: model
name: DeciLM
organization: Deci
description: DeciLM is a LLM that on release ranks as the fastest and most accurate model of its size.
created_date: 2023-12-12
url: https://deci.ai/blog/introducing-decilm-7b-the-fastest-and-most-accurate-7b-large-language-model-to-date
model_card: https://deci.ai/model-zoo/decilm-7b/
modality: text; text
analysis: Evaluated on the OpenLLM benchmarks and, on release, outperforms all other 7B models on the OpenLLM Leaderboard.
size: 7B parameters (dense)
dependencies: []
training_emissions: unknown
training_time: unknown
training_hardware: NVIDIA A10 GPUs
quality_control: ''
access: open
license: Apache 2.0
intended_uses: This model is intended for commercial and research use in English and can be fine-tuned for use in other languages.
prohibited_uses: ''
monitoring: unknown
feedback: none
21 changes: 21 additions & 0 deletions assets/google.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -1678,3 +1678,24 @@
within specific downstream applications without prior assessment
monitoring: Google internal monitoring
feedback: Specific queries provided by annotators
- type: model
name: MedLM
organization: Google
description: MedLM is a collection of foundation models tuned to follow natural language instructions for tasks in medicine, such as question answering and creating draft summaries.
created_date: 2023-12-13
url: https://cloud.google.com/vertex-ai/docs/generative-ai/medlm/overview
model_card: https://cloud.google.com/static/vertex-ai/docs/generative-ai/medlm/MedLM-model-card.pdf
modality: text; text
analysis: Assessed on medical benchmarks of professional medical exams, medical research, and consumer queries.
size: unknown
dependencies: []
training_emissions: unknown
training_time: unknown
training_hardware: unknown
quality_control: ''
access: limited
license: unknown
intended_uses: to be used for question answering and creating draft summaries from existing documentation, to be reviewed, edited, and approved by the user before use.
prohibited_uses: ''
monitoring: Google internal monitoring
feedback: none
44 changes: 44 additions & 0 deletions assets/llm360.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,44 @@
---
- type: model
name: Amber
organization: LLM360
description: Amber is the first model in the LLM360 family, an initiative for comprehensive and fully open-sourced LLMs, where all training details, model checkpoints, intermediate results, and additional analyses are made available to the community.
created_date: 2023-12-12
url: https://www.llm360.ai/
model_card: https://huggingface.co/LLM360/Amber
modality: text; text
analysis: Evaluated on several benchmark LLM tasks
size: 7B parameters (dense)
dependencies: [Arxiv, Books, C4, RefinedWeb, StarCoder, StackExchange, Wikipedia]
training_emissions: unknown
training_time: unknown
training_hardware: 56 DGX A100 nodes, each equipped with 4 80GB A100 GPUs
quality_control: ''
access: open
license: Apache 2.0
intended_uses: to support open and collaborative AI research by making the full LLM training process transparent.
prohibited_uses: ''
monitoring: unknown
feedback: https://huggingface.co/LLM360/Amber/discussions

- type: model
name: CrystalCoder
organization: LLM360
description: CrystalCoder is a language model with a balance of code and text data that follows the initiative under LLM360 of its training process being fully transparent.
created_date: 2023-12-12
url: https://www.llm360.ai/
model_card: https://huggingface.co/LLM360/CrystalCoder
modality: text; code, text
analysis: Evaluated on English and coding tasks and benchmarks, and outperforms LLaMA 2 in some.
size: 7B parameters (dense)
dependencies: [SlimPajama dataset, StarCoder]
training_emissions: unknown
training_time: unknown
training_hardware: Trained on the Cerebras Condor Galaxy 1 (CG-1), a 4 exaFLOPS, 54 million core, 64-node cloud AI supercomputer.
quality_control: ''
access: open
license: Apache 2.0
intended_uses: to support open and collaborative AI research by making the full LLM training process transparent.
prohibited_uses: ''
monitoring: unknown
feedback: https://huggingface.co/LLM360/CrystalCoder/discussions
3 changes: 3 additions & 0 deletions js/main.js
Original file line number Diff line number Diff line change
Expand Up @@ -629,6 +629,9 @@ function loadAssetsAndRenderPageContent() {

const paths = [
'assets/adept.yaml',
'assets/cresta.yaml',
'assets/llm360.yaml',
'assets/deci.yaml',
'assets/mila.yaml',
'assets/soochow.yaml',
'assets/baichuan.yaml',
Expand Down
Loading