Once you find the file you're looking for you can resize it, export it to Adobe Premiere Pro, or drag and drop it into another app.
I built Desktop Docs because I keep tons of media files on my computer and I can never remember where I save stuff (lots of screenshots, memes, and downloads). The Apple Photos app also only supports photos in your iCloud.
Desktop Docs supports adding folders or individual files to an AI Library where you can search by the contents of your files, not just file titles.
You can search by objects ("cardboard box"), actions ("man smiling", "car driving"), by emotion ("surprised woman", "sad cowboy"), or the text in the frame (great for screenshots or memes).
It's also 100% private. Make any media searchable without it ever leaving your computer.
How I built it: - 100% Javascript (I'm using Electron JS and React JS). - Embedding generation (CLIP from OpenAI is used to compute the image embeddings and text embeddings for user queries). - Redis (storing and doing KNN search on the embeddings with this DB). - Image/video editing (the app ships with FFmpeg binaries to explode videos into individual frames and scale images).
Demo: https://www.youtube.com/watch?v=EIUgPNHOKKc
If there are any features you'd like to see in Desktop Docs or want to learn more about how I built it, drop me a comment below. Happy to share more.
My first impression was that you'd "just" upload my pictures to OpenAI with a prompt and call it a day.
Maybe highlight that it uses ML running locally? (I see that it's in the FAQ, but in the title)
I'd love to know the specifics. if there's an installable, reproducible local build w/ regular model/updates/maintenance that I could subscribe to, I'd be an interested party to a tool like that.
Apple is a good example: https://www.apple.com/macbook-pro/
I'm so glad someone remembers how software is supposed to work.
Thanks.
https://www.ghacks.net/2022/06/11/new-spotlight-features-in-...
I haven't been able to get this to work in Finder; can you please share which search attribute you're using?
It should do a visual search for some common things, as well as OCR in image text.
https://github.com/jdberry/tag/
Demo looks neat though. I wonder if it can tell me which of my video files are SD, HD, and 4K. I've ripped so much media that I've lost track and did not name my files in such a way that it's obvious what resolution each one is. Something like that probably doesn't even need AI, just a peek at the existing metadata.
I'd personally love it if this also indexed text files and pdfs.
I'm also considering making the models more a la carte so you can manage which ones your want to use depending on your use case. That way you could use for text, image/video, or both.
Of course I want to search for data in my photo downloads / screenshots folders and be able to pull / easily grab / copy the pic(s) to another folder or upload to a site and then paste the text..
I guess chaining this into notion / obsidian or similar would be beneficial as well.
A time-based trial could be good in that respect, and as a benefit for the developer gets a satisfied user hooked-in and not wanting to lose access.
A reliably good app that helps me delete duplicates and shrink the photo library size would be amazing. Could this do something along those lines?
Similarly, I want to delete low quality images: blurred, etc. I don't know of an app that does that.
Lots of articles online that look like semi-blogspam, and I'll go through each product in them... but if someone has a specific human-powered recommendation I'll happily listen :)
Dead Comment
I think practically it doesn’t matter. This is two people selling something for $20 (or maybe $50, I can’t tell). They tell you beforehand, so you know. It’s unlikely they operate in jurisdictions that force software to be refundable. So I guess you can sue them to get your money back. Or chargeback through your credit card.
This is why we can’t have nice things. I miss the old internet where it was just people sending small amounts of money to other people for cool things (people mailed me checks in 1996). And users didn’t have the expectation of legal expenses to account for unlikely edge cases.
IMHO, it would be better to eat a few refunds than risk changing my payment provider.
Just trying to build cool stuff and put it out there into the world.