Readit News logoReadit News
apitman · 6 months ago
I really like Jan, especially the organization's principles: https://jan.ai/

Main deal breaker for me when I tried it was I couldn't talk to multiple models at once, even if they were remote models on OpenRouter. If I ask a question in one chat, then switch to another chat and ask a question, it will block until the first one is done.

Also Tauri apps feel pretty clunky on Linux for me.

diggan · 6 months ago
> Also Tauri apps feel pretty clunky on Linux for me.

All of them, or this one specifically? I've developed a bunch of tiny apps for my own usage (on Linux) with Tauri (maybe largest is just 5-6K LoC) and always felt snappy to me, mostly doing all the data processing with Rust then the UI part with ClojureScript+Reagent.

signbcc · 6 months ago
> especially the organization's principles

I met the team late last year. They’re based out of Singapore and Vietnam. They ghosted me after promising to have two follow-up meetings, and were unresponsive to any emails, like they just dropped dead.

Principles and manifestos are a dime a dozen. It matters if you live by them or just have them as PR pieces. These folks are the latter.

dcreater · 6 months ago
With a name like Menlo research, I assumed they were based in Menlo park. They probably intended that
_the_inflator · 6 months ago
Yep. I really see them as an architecture blueprint with a reference implementation and not so much as a one size fits all app.

I stumbled upon Jan.ai a couple of months ago when I was considering a similar app approach. I was curious because Jan.ai went way beyond what I considered to be limitations.

I haven’t tried Jan.ai yet, I see it as an implementation not a solution.

inkyoto · 6 months ago
> Main deal breaker for me when I tried it was I couldn't talk to multiple models at once […]

… which seems particularly strange considering the size of the cloned GitHub repository to be 1.8GiB which swells up to 4.8GiB after running «make build» – I tried to build it locally (which failed anyway).

It is startling that a relatively simple UI frontend can add 3Gb+ of build artefacts alone – that is the scale of a Linux kernel build.

c-hendricks · 6 months ago
Yeah, webkit2gtk is a bit of a drag
roscas · 6 months ago
Tried to run Jan but it does not start llama server. It also tries to allocate 30gb that is the size of the model but my vram is only 10gb and machine is 32gb, so it does not make sense. Ollama works perfect with 30b models. Another thing that is not good is that it make constant connections to github and other sites.
hoppp · 6 months ago
It probably loads the entire model into ram at once while ollama solves this and does not, it has a better loading strategy
blooalien · 6 months ago
Yeah, if I remember correctly, Ollama loads models in "layers" and is capable of putting some layers in GPU RAM and the rest in regular system RAM.

Dead Comment

SilverRubicon · 6 months ago
Did you see the feature list? It does not deny that makes connections to other sites.

- Cloud Integration: Connect to OpenAI, Anthropic, Mistral, Groq, and others

- Privacy First: Everything runs locally when you want it to

mathfailure · 6 months ago
Is this an alternative to OpenWebUI?
apitman · 6 months ago
Not exactly. OWUI is a server with a web app frontend. Jan is a desktop app you install. But it does have the ability to run a server for other apps like OWUI to talk to.
ekianjo · 6 months ago
Openweb-ui does not include a server.
PeterStuer · 6 months ago
More an alternative to LM Studio I think from the description.
apitman · 6 months ago
Jan also supports connecting to remote APIs (like OpenRouter), which I don't think LM Studio does
jwildeboer · 6 months ago
My name is Jan and I am not an AI thingy. Just FTR. :)
underlines · 6 months ago
Jan here too, and I work with LLMs full time and I'm a speaker about these topics. Annoying how many times people ask me if Jan.ai is me lol
dsp_person · 6 months ago
We need a steve.ai
klausa · 6 months ago
So this is how women named Siri felt in 2011.
lagniappe · 6 months ago
Hello Jan ;)
biinjo · 6 months ago
Im confused. Isn’t the whole premise of Ollama that its locallt ran? What’s the difference or USP when comparing the two.
moron4hire · 6 months ago
That's not the actual tagline being used in the repo. The repo calls itself an alternative to ChatGPT. Whoever submitted the link changed it.
hoppp · 6 months ago
I think its an alternative because ollama has no UI and its hard to use for non-developers who will never touch the CLI
simonw · 6 months ago
Ollama added a chat UI to their desktop apps a week ago: https://ollama.com/blog/new-app
reader9274 · 6 months ago
Tried to run the gpt-oss:20b in ollama (runs perfectly) and tried to connect ollama to jan but it didn't work.
accrual · 6 months ago
I got Jan working with Ollama today. Jan reported it couldn't connect to my Ollama instance on the same host despite it working fine for other apps.

I captured loopback and noticed Ollama returning an HTTP 403 forbidden message to Jan.

The solution was set environment variables:

    OLLAMA_HOST=0.0.0.0
    OLLAMA_ORIGINS=*
Here's the rest of the steps:

- Jan > Settings > Model Providers

- Add new provider called "Ollama"

- Set API key to "ollama" and point to http://localhost:11434/v1

- Ensure variables above are set

- Click "Refresh" and the models should load

Note: Even though an API key is not required for local Ollama, Jan apparently doesn't consider it a valid endpoint unless a key is provided. I set mine to "ollama" and then it allowed me to start a chat.

Deleted Comment

thehamkercat · 6 months ago
Exactly: https://github.com/menloresearch/jan/issues/5474

Can't make it work with ollama endpoint

this seems to be the problem but they're not focusing on it: https://github.com/menloresearch/jan/issues/5474#issuecommen...

semessier · 6 months ago
still looking for vLLM to support Mac ARM Metal GPUs
baggiponte · 6 months ago
Yeah. The docs tell you that you should build it yourself, but…
tough · 6 months ago
but unlike cuda there's no custom kernels for inference in vllm repo...

I think