This is Jared and Art. We met on HN and started building together.
Over the last few months we've been thinking a lot about how AI agents are going to impact the future. We want agents to be something that's actually useful for normal people as well as the 10x'ers. This lead us to building Meha over the last few months, our first swing at our vision! We saw OpenAI release Operators then we said f*k it let's post.
Meha is a desktop app that uses your Chrome browser to execute tasks in the background. It controls your installed Chrome browser and uses LLMs with playwright to plan and execute actions to accomplish your task. You get to see each planning step the bot is doing and have access to its long term memory.
Meha also uses its own file system and can export files for download. Another thing we've been focused on in multi-agent workflows and Meha can run many bots at the same time. One of the reasons why we can ship this for free in the mean time is because of how cheap the agents are. But we are planning to have a Pro version for power users. We prefer not to raise since we're against VC funding.
We have been influenced by a lot of concepts in probabilistic robotics and RL to develop a fairly robust 'agentic' framework. As well as an algorithm for efficiently converting/compressing large html pages into a semantic format. If you're interested we will open source this asap in an SDK (will work with all OpenAI API spec LLMs and with llama.cpp) let us know.
We're currently in beta and working on figuring out what this product will become and super stoked! Let us know what you think. To get access to Meha we have links on our discord to download (Both MacOS and Windows is available). Please give us all the feedback/criticism (even if you hate AI).
Link to Meha: https://meha.ai
For the love of humanity please open source this. This seems tremendously useful by itself.
Perhaps you could create both a Public and Corporate version of the extension, like Copilot does. The Corporate version could have access to all browser data but not share it beyond the bounds of the company.
I then asked to create a CSV of the first 100 flats corresponding to my criteria, it created only 3 entries, purely hallucinated.
But let me know your thougths on rtrvr.ai, looks like we are targeting the same use cases of automation, scraping, research?
Deleted Comment
Happy to hear all the thoughts for those who try the app out! Even if you just have ideas about how agents might look in their final form, there's so many avenue's this tech can take and we have a ton of wild ideas we'll be building so stay tuned. :D
Some demos we will have are:
- Logging into twitter and tweeting
- Finding information from google maps of any nearby business whether that's for leads or finding local restaurant options.
- Scraping anything from wikipedia like current events etc.
- And more!
If it's the latter, how do you go about making sure you're not about to download malicious data to my machine?
For the url generation we do we have safety checks for the urls however it's simply in the prompting. I would love to hear what sort of safety suggestions you have and/or concerns about this sort of experience. Right now we're still figuring out how best to enable people to utilize agents safely.