Motivation is that large-language models have a very straight-forward task of predicting the next token and the dataset is easy to get. With this app I aim to do two things: 1. Gather a fairly large dataset that captures the brush-strokes for various art prompts. 2. Bootstrap an algorithm / model that can decompose any image/art/illustration into brush strokes.
A longer-term goal for this app is to build an auto-complete (Co-pilot or Grammarly equivalent) for art.
ps: This app has some bugs. Keep low expectations
1. Get CLIP embeddings for text & images 2. Put them in a vector database (Pinecone.io or something similar)
It's unreasonably effective. Checkout this search engine: https://same.energy/
Can't wait to get home and actually look carefully. I suspect I'll appreciate it a lot more knowing what it actually is.