This library attempts to simplify the process of trying LLMs on Apple silicon:
- Simplifies local model cache management.
- Provides local and online model searches that automatically filter out models that have no chance of fitting in your Mac's memory.
Please see the examples notebook.