forked from mlc-ai/mlc-llm
-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor to prepare for parallel sampling #100
Merged
Merged
Changes from 23 commits
Commits
Show all changes
32 commits
Select commit
Hold shift + click to select a range
712ccec
wip
masahi 2d05640
wip
masahi f3742b9
wip
masahi a38c955
fix
masahi b960d4d
fix
masahi a5e6e37
fix
masahi 3b2df21
refactor
masahi 77e0e5f
more refactor
masahi 5808958
wip
masahi e4f21b4
wip
masahi 4080146
more refactor
masahi 9d42deb
more refactor
masahi 9eb92f8
fixed
masahi 18b8e41
fixed mypy
masahi bdb0be3
minor
masahi 27da1c2
msg clean
masahi f9747ac
fix missing finish_reason
masahi 522edd7
remove unnecessary type annot on defaultdict
masahi 5585197
Return requests state from get_requests_to_process
masahi 421d2ea
simplify typing
masahi a83c494
reduced list concat
masahi 5382004
remove dict add and lookup
masahi 55045d0
wrong comment
masahi e7b6a3c
Revert "remove dict add and lookup"
masahi d962435
fix sampler test
masahi 78ab330
make it possible to disable prometheus metrics
masahi 70369fc
collect metrics only in staging engine
masahi b16f787
return False in stop_by_length if request is already finished
masahi fd39416
move check_stopping_sequences to engine_common.py
masahi c8b7f55
add missing free_request method to Dummy cache manager
masahi 1853a54
update Dummy cache manager to operate on sequence
masahi 242e3de
fix request finish condition
masahi File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now this class basically acts like
SequenceGroup
in vllm.