Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Blaizzy / mlx-vlm Public

Sponsor
Notifications
Fork 86
Star 1k

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: Blaizzy/mlx-vlm

Releases · Blaizzy/mlx-vlm

v0.1.7

30 Dec 01:41

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.7

What's Changed

Fix multi-image and 2x speed improvements (DS-VL2) by @Blaizzy in #157
Refactor utils (model loading, inference and output processing) by @Blaizzy in #161
Fix Llama-3.2-Vision (18x faster generation and 75% less memory usage) by @Blaizzy in #163

⚠️ Breaking Changes

This release introduces some breaking changes. If you encounter any issues, please open an issue or submit a PR.

Full Changelog: v0.1.6...v0.1.7

Contributors

Blaizzy

Assets 2

Loading

All reactions

v0.1.6

22 Dec 20:00

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.6

What's Changed

Fixes DeepSeek quant loading by @Blaizzy in #156
Fixes Florence-2 LM only by @Blaizzy in #156

Full Changelog: v0.1.5...v0.1.6

Contributors

Blaizzy

Assets 2

Loading

All reactions

v0.1.5

22 Dec 17:19

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.5

What's Changed

Add support for Deepseek-vl2 by @Blaizzy in #153
Add support for Language only inputs by @Blaizzy in #153

Full Changelog: v0.1.4...v0.1.5

Contributors

Blaizzy

Assets 2

Loading

All reactions

v0.1.4

05 Dec 22:32

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.4

What's Changed

Add support for Paligemma-2 by @Blaizzy in #142
Update prompt utils + bump version (Paligemma-2) by @Blaizzy in #143

Full Changelog: v0.1.3...v0.1.4

Contributors

Blaizzy

Assets 2

Loading

All reactions

v0.1.3

28 Nov 15:57

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.3

What's Changed

Add lazy eval during conversion by @Blaizzy in #127
Open tokenizer.json within context manager by @neilmehta24 in #129
Fix Bugs in chat UI by @terhechte in #96
Fix broken stream generate for SmolVLM and others by @andimarafioti in #132
Fix idefics3 by @Blaizzy in #133

New Contributors

@neilmehta24 made their first contribution in #129
@terhechte made their first contribution in #96
@andimarafioti made their first contribution in #132

Full Changelog: v0.1.2...v0.1.3

Contributors

terhechte, andimarafioti, and 2 other contributors

Assets 2

Loading

All reactions

v0.1.2

26 Nov 21:46

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.2

What's Changed

Fix padding type casting by @Blaizzy in #125
Idefics 3 support by @pcuenca in #124

New Contributors

@pcuenca made their first contribution in #124

Full Changelog: v0.1.1...v0.1.2

Contributors

pcuenca and Blaizzy

Assets 2

Loading

All reactions

v0.1.1

23 Nov 15:15

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.1

What's Changed

Add example notebooks and support for system role by @Blaizzy in #95
fix pixtral image prompt order for doc VQA by @ndurner in #99
Fix Qwen2-VL OCR and repetition penalty by @Blaizzy in #109
Qwen2-VL performance improvements by @Blaizzy in #113
Faster / more memory efficient Qwen VL by @awni in #114
Add support for Molmo by @Blaizzy in #112
Add support for Florence-2 by @Blaizzy in #105
Fix image masks and update pointing example by @Blaizzy in #117

New Contributors

@ndurner made their first contribution in #99
@awni made their first contribution in #114

Full Changelog: v0.1.0...v0.1.1

Contributors

ndurner, awni, and Blaizzy

Assets 2

Loading

All reactions

v0.1.0

18 Oct 00:15

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.1.0

What's Changed

Add support for Pixtral-12B by @Blaizzy in #67
Fix pixtral multi-image by @hiima234 in #41
Added: Qwen2-VL Unit Tests, Refactored Weight Sanitization by @benzimring in #63
Trainer + Multi image v0.1.0 by @Blaizzy in #41
Fix example scripts in the readme.md to import and use load_config by @mark-lord in #82
Qwen2-VL Improvements (1-2x speedup) by @Blaizzy in #89
Fix Paligemma object detection and segmentation by @Blaizzy in #90
Add support for Llama-3.2-vision & Resize image by @Blaizzy in #83
Fix idefics-2 mask by @Blaizzy in #91

New Contributors

@benzimring made their first contribution in #63
@mark-lord made their first contribution in #82

Full Changelog: v0.0.15...v0.1.0

Contributors

Blaizzy, benzimring, and 2 other contributors

Assets 2

Loading

lin72h, 6, and HongyuS reacted with hooray emoji

All reactions

🎉 3 reactions

3 people reacted

v0.0.15

29 Sep 00:24

Blaizzy

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v0.0.15

What's Changed

Qwen2-VL fix vision tower bug for HD imagges by @Blaizzy in #62

Full Changelog: v0.0.14...v0.0.15

Contributors

Blaizzy

Assets 2

Loading

Goekdeniz-Guelmez reacted with thumbs up emoji

amirhossein-razlighi reacted with rocket emoji

All reactions

👍 1 reaction
🚀 1 reaction

2 people reacted

v0.0.14

28 Sep 16:14

Blaizzy

Compare

Choose a tag to compare

Loading

v0.0.14

What's Changed

Add support for Qwen2-VL by @Blaizzy in #59

Full Changelog: v0.0.13...v0.0.14

Contributors

Blaizzy

Assets 2

Loading

Goekdeniz-Guelmez reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

Previous 1 2 3 4 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.