Readit News logoReadit News
cwinter commented on The simple beauty of XOR floating point compression   clemenswinter.com/2024/04... · Posted by u/cwinter
sameoldtune · 2 years ago
This algorithm is pretty neat. I wonder if for the use case outlined by the post they could get even better compression through the use of fewer significant digits. For the purposes of recording times, 0. 00768 is likely just as useful as 0.0076792240142822266
cwinter · 2 years ago
yes! the very last example in the post shows what happens if you truncate the mantissa to just the 4 most significant bits
cwinter commented on On getting poetry   newcriterion.com/issues/2... · Posted by u/apollinaire
solresol · 5 years ago
This being HN... where are the poets writing about Silicon Valley, about rationalism meetups, about starting one day too late for the IPO, about the feeling of finally debugging something that's been a recurring problem since before you started with the company... ?
cwinter · 5 years ago
Well if there was ever an appropriate time to plug my dark fantasy machine learning poem this is it: https://clemenswinter.com/2021/03/24/conjuring-a-codecraft-m...
cwinter commented on Mastering Real-Time Strategy Games with Deep RL: Mere Mortal Edition   clemenswinter.com/2021/03... · Posted by u/cwinter
CyberRage · 5 years ago
Interesting blog-post.

I found some similarities with what occurred with Deepmind's Alphastar AI.

One of the weaknesses that seem to manifest in this piece too is the handling of unfamiliar scenarios.

The AI is very confused once it experiences something that was rarely seen in its learning data. Destroyer's big drones confused the bot quite a bit.

Deepmind solved it by intentionally creating agents that introduce different\bizzare strategies(which they called exploiters) in order to develop robustness against such strategies.

cwinter · 5 years ago
The bot has actually never seen Destroyer's big drones during training even once, so I found it somewhat surprising that it even works as well as it does!

Completely agree that adding something like the "League" used by AlphaStar would be one of the top priorities if you wanted to push this project further. I don't think CodeCraft is sufficiently complex to really allow for several very distinct strategies in the same way as StarCraft II, but I would still expect training against a larger pool of more diverse agents to increase robustness quite a bit.

u/cwinter

KarmaCake day409April 19, 2015View Original