The first Malaysian Large Language Chat Model !


Multi-lingual Malaysian Chat Language Model, 32k context length, Malaysian centric and private.



Able to understand Standard Malay, local Malay, Jawi, Standard English, Manglish, Mandarin and Indonesian as input.

USA map
chart image

Suitable for RAG

Integrate MaLLaM πŸŒ™ with existing knowledge base for better User-Chat experiences, we use GARIS-PANDUAN-SRF-RFP-2024-v1.pdf for an example.

Function Call

Convert Natural text to JSON format for faster system integration, MaLLaM πŸŒ™ achieved 100% JSON-able on NousResearch/json-mode-eval, higher is better.

Better Accuracy

We benchmarked on Malay test set, mesolitica/malay-llm-leaderboard, higher is better.


You can play around with MaLLaM πŸŒ™, try it at Nous App

USA map
USA map

Try the API

Or if you want to integrate MaLLaM πŸŒ™ with existing system and compatible with OpenAI library, Nous LLM Router Documentation


Prepaid based, natively Multi-lingual, general knowledge, RAG, Function Call, Coding and Multi-turn.

Model name Input / 1M tokens Output / 1M tokens
MaLLaM πŸŒ™ Tiny MYR 2.50 MYR 10.00
MaLLaM πŸŒ™ Small MYR 5.00 MYR 15.00


Self-host MaLLaM πŸŒ™ in your private network for 100% privacy, either on-premise or private cloud, read more at MaLLaM πŸŒ™ Self-hosted Enterprise

Frequently asked questions

What is the different this MaLLaM πŸŒ™ and MaLLaM πŸŒ™ open sourced?

This MaLLaM πŸŒ™ continue pretraining on more dataset, finetuned with bigger instruction dataset and aligned with human policy.

What is the rate limit?

Currently we hard limit 100k Tokens per Minute.

How to topup?

Just go to billing page and topup! Minimum RM3 and Maximum RM99.