Ollama logo
AI provider on ChatMitra

Ollama for WhatsApp business

Open-source models — local for free, cloud for $20/mo.

  • Free tier ✓
  • Cost: Free (local) / $20/mo (cloud)
  • Speed: Fast
  • Best for: Self-host or fixed-price cloud
What is a Ollama chatbot?

It's a WhatsApp bot powered by Ollama-hosted open-source models. With ChatMitra you bring your Ollama Cloud API key — setup takes about 10 minutes, and you get fixed-price subscription pricing instead of per-token billing.

Founded
2023
HQ
Palo Alto, USA

What it looks like in real businesses

Four chats from real industries, all using Ollama (gpt-oss:120b-cloud) inside ChatMitra. Mix of English, Hindi, and Hinglish — the way customers actually type.

Aarogya Skin Clinic
Clinic • Online
english
Do you offer acne consultation? What does it cost?
11:05 AM ✓✓
Yes — Dr Mehta's acne consultation is ₹800 for the first visit (30 min) and ₹500 for follow-ups. We also have a derma-grade skincare assessment for ₹1,500 that's worth it if it's your first time. Want our front desk to find you a slot?
11:05 AM
First visit. Saturday morning if possible.
11:05 AM ✓✓
Sure. Share your name and contact — I'll forward your request to the clinic. They'll check Saturday morning availability and message you the slot, location pin, and pre-visit form.
11:05 AM
QuickLoan India
Finance / lending • Online
english
I want a personal loan of ₹5 lakh. What are the rates?
4:22 PM ✓✓
For ₹5L unsecured personal loan, our rates typically start at 11.5% p.a. for 3-year tenure if your credit score is 750+. EMI works out to roughly ₹16,500 / month. Want our loan officer to call you with a personalised quote? Just share your name and a good time to call.
4:22 PM
Rajesh, today after 4 PM works
4:22 PM ✓✓
Thanks Rajesh. I've forwarded your request to our loans team — they'll call you between 4–6 PM today, walk you through the documents needed, and confirm the final rate after KYC.
4:22 PM
Pixel Grove Agency
Agency / B2B services • Online
english
We need a brand identity + website for a new D2C skincare brand. What's a typical timeline?
1:14 PM ✓✓
For D2C skincare we usually run 8 weeks end-to-end — 3 weeks brand (logo, palette, pack design), 4 weeks website, 1 week launch prep. Indicative budget: ₹4–6 lakhs depending on photography and copy scope. Want me to forward your details so our founder can set up a discovery call?
1:14 PM
Yes please, sometime next week
1:14 PM ✓✓
Got it. Share your email and 2-3 preferred slots — I'll forward to our founder and she'll confirm one back to you in a few minutes.
1:14 PM
InsightCRM
B2B / SaaS • Online
english
Looking for a CRM for our 50-person sales team. What's your pricing?
3:08 PM ✓✓
For 50 seats, our Growth plan is ₹1,800 per user / month — ₹90,000 total. That includes pipeline, automations, and our WhatsApp integration. Want me to share your details so our team can set up a 30-min demo?
3:08 PM
Yes, Thursday 4 PM works
3:08 PM ✓✓
Got it. I've shared your interest and preferred slot with our sales team — they'll confirm Thursday 4 PM availability and send the calendar invite shortly.
3:08 PM

Picked by ChatMitra customers in regulated industries (healthcare, finance) who want self-hosted privacy or fixed-price cloud predictability.

Best for
Self-host or fixed-price cloud
Cost
Free (local) / $20/mo (cloud)
Speed
Fast
Setup time
~10 minutes (cloud)

In simple terms: Ollama doesn't try to be everything — it's strong at one specific thing: subscription-priced cloud inference instead of per-token billing. $20/month Pro covers genuine production volume; for highest privacy needs, you can also self-host the same models locally for free.

Not 100% sure Ollama is right for you?

That's okay — you can switch any time inside ChatMitra. Quick orientation:

  • • If cost is your main concern, Groq or DeepSeek are worth a look
  • • If top-shelf quality matters more, OpenAI or Anthropic are the usual picks
  • • If you want balance, Gemini sits well in the middle

Still unsure? Start with Ollama — switching later is one click inside ChatMitra without rebuilding your bot.

What is Ollama, really?

Ollama doesn't try to be everything — it's strong at one specific thing: predictable, subscription-style pricing for hosted open-source models. Pro is $20/month; Max is $100/month.

Ollama is a US open-source project / company founded in 2023. They're best known for making local LLM hosting trivially easy. Ollama Cloud (launched September 2025) extends this to hosted inference with a fixed-price subscription instead of per-token billing.

Ollama models in 2026

The model with the green badge is what we usually recommend for everyday WhatsApp business chats. Step up only if you actually need it.

gpt-oss:120b-cloud Pick
Smart
Context:128K tokens
Input: $0(Free)
Output: $0(Free)
Best for:OpenAI's open-source 120B model on Ollama Cloud — fixed subscription pricing
gpt-oss:20b-cloud
Fast
Context:128K tokens
Input: $0(Free)
Output: $0(Free)
Best for:smaller OpenAI open model on Ollama Cloud
qwen3-coder:480b-cloud
Smart
Context:128K tokens
Input: $0(Free)
Output: $0(Free)
Best for:Qwen's biggest coder model on Ollama Cloud

Pricing in USD as published on Ollama's pricing page. INR figures are approximate. Verified on 2026-04-30.

Source: official model docs · official pricing page

What will Ollama cost my business?

Three rough volumes, costs in approximate INR. Your actual number will vary — see assumptions below.

Ollama has a free tier: Local Ollama is free forever (you run on your own hardware). Cloud has a free tier with 1 concurrent cloud model. Pro ($20/mo) gets 50× more usage and 3 concurrent models. Max ($100/mo) gets 10 concurrent models.
Small store
100 chats / month
Free
Busy store
1,000 chats / month
Free
Diwali / festive scale
10,000 chats / month
Free
Assumptions used:
  • • Avg 15–20 messages per conversation
  • • Avg 50–100 tokens per message
  • • Pricing as published by Ollama on 2026-04-30 — may change. Source: ollama.com/pricing
Common mistake on Ollama: Picking Ollama Cloud Pro when your traffic is bursty. The subscription model has session limits that reset every 5 hours — fine for steady traffic, painful during festival sale spikes. For burst-prone businesses, pay-per-token providers (Groq, Cerebras free tier) handle the spikes better.

Best for these businesses

Pick Ollama Cloud when you want predictable monthly pricing instead of per-token billing surprises. Pick local Ollama when data residency / privacy matters most.

  • Teams that hate per-token billing variability and prefer a fixed monthly cost
  • Privacy-sensitive businesses that want to self-host (legal, healthcare, finance)
  • Developers comfortable with local infrastructure and DevOps
Where it doesn't fit
  • Beginners — Cloud is friendly, local self-hosting requires DevOps
  • Use cases that need closed-flagship quality (GPT-5, Claude, Gemini)

Ollama Cloud's subscription model is great for predictable spend, but if your traffic is bursty (festival sales) you'll hit session limits. For burst-prone businesses, pay-per-token providers handle the spikes better.

How to get your Ollama API key

Whole flow takes about 5 minutes. Mockups below are deliberately generic — Ollama's dashboard may look slightly different by the time you read this, but the steps stay roughly the same.

1. Create your Ollama Cloud account

Sign up using your business email. If you already have a Ollama Cloud login (or a parent-product login), the same one usually works.

ollama.com/cloud
Sign-up form
your-email@business.com
••••••••
Continue

2. Verify your email & claim free credits

Ollama Cloud drops you a free-tier credit (or rate-limit allowance) the moment you verify. That's enough for the first several thousand chats.

Billing • Payment method added

3. Open the API keys section

In the dashboard sidebar (or your profile menu), find "API keys". Click into it.

ollama.com/account/api-keys
API keys
prod-key-1
+ Create new key

4. Create a new secret key

Click "Create new key", give it a name like "ChatMitra production". Copy the key the moment it appears — most providers won't show it again.

Your new API key
sk-•••••••••••••••••
⚠ Copy now — won't be shown again

5. Paste it into ChatMitra

Inside the AI Chatbot wizard (Step 2 — Pick your AI), select Ollama Cloud, paste the key. ChatMitra validates it on the spot. Green tick = ready.

Sign-up form
your-email@business.com
••••••••
Continue

UI may change slightly as Ollama updates their dashboard. The flow stays roughly the same — sign up, add a payment method (if needed), find the API keys section, create a key, copy it.

Set up Ollama in ChatMitra (about 5 minutes)

Once you've copied your API key from Ollama, the rest happens inside ChatMitra. Same wizard you'd use for any other provider — just pre-filled for Ollama.

Step 2 of 5

Pick Ollama as your AI

In the ChatMitra AI Chatbot wizard, Step 2 shows the provider grid. Click Ollama's tile, paste the API key you just copied. ChatMitra checks the key right then — green tick = ready.

Ollama Ollama Selected
Valid
Step 2b

Pick a model

ChatMitra auto-selects gpt-oss:120b-cloud as the default — that's our pick for everyday WhatsApp business chat. You can override if you want a different one.

gpt-oss:120b-cloud
OpenAI's open-source 120B model on Ollama Cloud — fixed subscription pricing
Step 3 of 5

Write the bot's personality

Tell ChatMitra about your business — name, industry, a one-line description. Then write the personality prompt yourself, or click Generate with AI and tweak the draft. Ollama reads this prompt before every reply.

Step 4 of 5

Add a fallback (optional, recommended)

Add up to 3 backup providers. If Ollama rate-limits or has an outage, ChatMitra automatically switches to the backup — your customers don't see an error. Most teams add one of Huggingface or Openrouter here.

Step 5 of 5

Test, then activate

Built-in chat simulator lets you message your bot before any real customer does. Try a tricky question. When it looks right, hit Activate — and the bot starts answering on the WhatsApp number you've already connected to ChatMitra.

In simple words: Ollama is the brain. ChatMitra is the body — the WhatsApp connection, the inbox, the dashboard, the analytics. You don't have to choose between them.

Worried about getting locked in? You can change AI providers any time inside ChatMitra. The bot's personality, conversation history, and customer data stay exactly where they are — only the underlying AI changes. Try Ollama first, switch later if something better fits.

Ollama for Indian SMBs — pros & cons

What's good

  • Predictable subscription pricing — $20/mo or $100/mo, no surprise bills
  • Local self-hosting is genuinely free if you have a decent GPU
  • Strong open-source ethos — model and infra both open
  • Cloud models include OpenAI's GPT-OSS family

What to watch out for

  • Per-token billing isn't available yet (coming soon) — subscription model can feel restrictive
  • Session limits reset every 5 hours — tricky for sustained traffic
  • Local hosting requires real hardware and DevOps know-how
  • Cloud catalog is smaller than Together / Fireworks

Compare with other providers

Which is better — Ollama or Hugging Face? It depends what you need.

Each label is what the provider does best. Match it to your priority — that's usually the right pick.

Top picks by category:

  1. OpenAI — best for accuracy and complex replies
  2. Groq — best for cost and high-volume chat
  3. Google Gemini — best balanced choice for most Indian SMBs
  4. Ollama — open-source models — local for free, cloud for $20/mo

Sources & references

Every model name, context window, and price on this page is copied from Ollama's own public pages on 2026-04-30. Providers update pricing fairly often — please double-check before you commit to a plan.

Try Ollama on your WhatsApp in 5 minutes

15 days free. No card. Switch providers any time without rebuilding your bot.

Frequently Asked Questions

Is Ollama really free?

Local Ollama is free forever — you run the models on your own GPU. Cloud has a free tier with 1 concurrent model, Pro is $20/month, Max is $100/month. Local needs decent hardware (16-24GB VRAM minimum for useful models).

Can I self-host Ollama for my WhatsApp chatbot?

Yes — but you'd need to expose your local Ollama instance to ChatMitra (typically through ngrok or a public IP). Most ChatMitra customers find Cloud easier; self-hosted is for teams with strong privacy or compliance needs.

How does Ollama Cloud's subscription pricing work?

Pro ($20/month) gives you 50× more cloud usage than the free tier and 3 concurrent cloud models. Max ($100/month) gets 10 concurrent models. Both have session limits that reset every 5 hours and weekly limits that reset every 7 days.

Which Ollama Cloud model should I pick?

GPT-OSS 120B is the right default — OpenAI's open-source model, hosted on Ollama Cloud. For lighter tasks, GPT-OSS 20B is faster and uses less of your subscription quota.

Does Ollama handle Hindi?

Through GPT-OSS and Qwen models, yes — Hindi and basic Hinglish work. Regional dialects are weaker. Test before committing.

Try Ollama now →