Apple's machine learning research framework, designed for Apple Silicon's unified memory architecture. NumPy-style Python API; full C++, C, and Swift bindings. Lazy evaluation, dynamic graphs, composable function transformations. Not a serving runtime on its own; you reach for `mlx-lm` (LLMs) or `mlx-vlm` (vision-language) or build on top with oMLX. Apache 2.0, actively developed by Apple ML Research.
Supported platforms: macOS (Apple Silicon), Linux (CUDA), Linux (CPU)
Arrays live in shared memory; ops run on CPU or GPU without copying data. Native fit for Apple Silicon's memory architecture.
NumPy-style Python. PyTorch-style `mlx.nn` and `mlx.optimizers` for building models. Cross-language: Swift, C++, C.
Computations materialize only when needed. Dynamic graph construction; no slow recompiles on shape changes.
Official LLM and vision-language model packages. Where you actually serve text and multimodal models from Python.
Automatic differentiation, vectorization, computation graph optimization. Composable, JAX-style.
// sign in with github to leave a comment. threads live in the repo's discussions tab.