What's Changed
- Adding metal moe kernels for prefill by @ibahmed-oai in #191
- Speeding up prefill with new moe kernels by @ibahmed-oai in #192
- Metal: fix resource leak in prefill by @Maratyszcza in #194
- Metal: support sharded checkpoints in the converter by @Maratyszcza in #195
- Polish the python code and browser experience by @dkundel-openai in #198
Full Changelog: v0.0.7...v0.0.8