Bottom line, every single post requires approximately 10 different prompts to refine the extraction.
ChatGPT does an incredible job parsing, but then lots of effort goes into normalizing and deduping each field. Long story short, your results look quite good to me!
Remote: yes
Willing to relocate: maybe
Technologies: Go, JavaScript, TypeScript, Python, Postgres, SQLite, SQL, PyTorch, FastAI, LLM, NextJS, React, GCP
Résumé/CV: https://u-turn.dev
Email: dudley@u-turn.dev
Product engineering leader helping build great teams. Twenty-plus years of software development. Over the past decade have helped turn around multiple products and teams in crisis.
Currently focusing on helping organizations apply deep learning and LLMs for information extraction from unstructured sources. A recent project in that vain: https://hnjobs.u-turn.dev
Deleted Comment
From a dev perspective this area has a ton of super interesting algorithmic / math / data structure applications, and computational geometry has always been special to me. It's a lot of fun to work on.
If anyone here is interested in this as a user, I'd love for any feedback or comments, here or you can email me directly: tyler@vexlio.com.
Some pages the HN crowd might be interested in:
* https://vexlio.com/blog/making-diagrams-with-syntax-highligh... * https://vexlio.com/solutions/state-diagram-maker/ * https://vexlio.com/blog/speed-up-your-overleaf-workflow-fast...