Releases · KerfuffleV2/llm-samplers

09 Nov 08:28

v0.0.7

788e181

v0.0.7

Fix a bug where Mirostat2 sampled twice.
Fix a bug where the flat bias sampler assumed it had the full logits to work with and directly indexed based on token id.
Simplify sampler types by removing type variables. Samplers always use u32 as the token id and f32 as the logit type now.
Add min-p sampler. See: ggerganov/llama.cpp#3483 (comment)
Add top-a sampler. See: https://github.com/BlinkDL/RWKV-LM#the-top-a-sampling-method
Try to avoid unnecessarily running softmax calculation.
Add a try_from_iter_top_k which pre-prunes the logits while building (and also results in the list starting out sorted).

0.0.6 to 0.0.7 Migration

Unfortunately, this involved some breaking changes. Basically, the samplers and chains no
longer take token id and logits type variables anymore. You can have your token ids in any
color you like, as long as it's u32. Same for logits: they're always f32 now.

For example, where previously you would have done SampleRandDistrib::<u32>::new or SampleMirostat2::<u32, f32>::new,
you only need SampleRandDistrib::new, SampleMirostat2::new. Same for creating chains: SamplerChain::<u32, f32>::new will
only need SamplerChain::new.