cwinter (u/cwinter) - Readit News

cwinter commented on The simple beauty of XOR floating point compression clemenswinter.com/2024/04... · Posted by u/cwinter

sameoldtune · 2 years ago

This algorithm is pretty neat. I wonder if for the use case outlined by the post they could get even better compression through the use of fewer significant digits. For the purposes of recording times, 0. 00768 is likely just as useful as 0.0076792240142822266

cwinter · 2 years ago

yes! the very last example in the post shows what happens if you truncate the mantissa to just the 4 most significant bits

cwinter commented on On getting poetry newcriterion.com/issues/2... · Posted by u/apollinaire

solresol · 5 years ago

This being HN... where are the poets writing about Silicon Valley, about rationalism meetups, about starting one day too late for the IPO, about the feeling of finally debugging something that's been a recurring problem since before you started with the company... ?

cwinter · 5 years ago

Well if there was ever an appropriate time to plug my dark fantasy machine learning poem this is it: https://clemenswinter.com/2021/03/24/conjuring-a-codecraft-m...

cwinter commented on Mastering Real-Time Strategy Games with Deep RL: Mere Mortal Edition clemenswinter.com/2021/03... · Posted by u/cwinter

CyberRage · 5 years ago

Interesting blog-post.

I found some similarities with what occurred with Deepmind's Alphastar AI.

One of the weaknesses that seem to manifest in this piece too is the handling of unfamiliar scenarios.

The AI is very confused once it experiences something that was rarely seen in its learning data. Destroyer's big drones confused the bot quite a bit.

Deepmind solved it by intentionally creating agents that introduce different\bizzare strategies(which they called exploiters) in order to develop robustness against such strategies.

cwinter · 5 years ago

The bot has actually never seen Destroyer's big drones during training even once, so I found it somewhat surprising that it even works as well as it does!

Completely agree that adding something like the "League" used by AlphaStar would be one of the top priorities if you wanted to push this project further. I don't think CodeCraft is sufficiently complex to really allow for several very distinct strategies in the same way as StarCraft II, but I would still expect training against a larger pool of more diverse agents to increase robustness quite a bit.