Readit News logoReadit News
blackcat201 commented on Gödel Agent: A self-referential agent framework for recursive self-improvement   arxiv.org/abs/2410.04444... · Posted by u/tkgally
blackcat201 · a year ago
Shameless plug, for anyone who's interested in "self-improvement" agent check out StreamBench[1] where we benchmark and try out what's essential for improvements in online settings. Basically we find feedback signal is vital and the stronger the signal the more improvement you can get if you were able to feed it back to the agent in terms of weights (LoRA) or in-context examples.

[1] https://arxiv.org/abs/2406.08747

blackcat201 commented on Structured Outputs in the API   openai.com/index/introduc... · Posted by u/davidbarker
blackcat201 · a year ago
Do beware on some reasoning task, our recent work[0] actually found it may cause some performance degradation as well as possible reasoning weakening in JSON. I really hope they fix this in the latest GPT-4o version.

[0] https://arxiv.org/abs/2408.02442

blackcat201 commented on 7.4 earthquake in Taiwan, 34km depth   earthquake.usgs.gov/earth... · Posted by u/throwaway598
stewpy · a year ago
Are fabs knocked offline?
blackcat201 · a year ago
The standard operation is to stop and check if any machine was out of calibration. So yes
blackcat201 commented on Why host your own LLM?   marble.onl/posts/why_host... · Posted by u/andy99
blackcat201 · 2 years ago
I own my LLM not because I need it now but having the luxury to fall back if openai ran out of money
blackcat201 commented on Do we really need a specialized vector database?   modelz.ai/blog/pgvector... · Posted by u/gaocegege
blackcat201 · 2 years ago
I have been following the vector database trend back in 2020 and I ended up with the conclusion: vector search features are a nice to have features which adds more value on existing database (postgres) or text search services (elasticsearch) than using an entirely new framework full of hidden bugs. You could get way higher speedup when you are using the right embedding models and encoding way than just using the vector database with the best underlying optimization. And the bonus side is that you are using a stack which was battle tested (postgres, elasticsearch) vs new kids (pinecone, milvus ... )
blackcat201 commented on Ask HN: Could you share your personal blog here?    · Posted by u/revskill
blackcat201 · 2 years ago
https://theblackcat102.github.io/

Recently I am ranting the AI trends and some short writeup of things I read

u/blackcat201

KarmaCake day266July 14, 2020View Original