cfusting (u/cfusting)

cfusting commented on Show HN: Iteratively Building Virtual Creatures in Minecraft github.com/cfusting/condi... · Posted by u/cfusting

ivanjermakov · a year ago

I would put "Minecraft" somewhere in the title.

cfusting · a year ago

Thanks!

cfusting commented on Show HN: Iteratively Building Virtual Creatures in Minecraft github.com/cfusting/condi... · Posted by u/cfusting

crooked-v · a year ago

> cool creatures that walk around and look like cheetahs or whatever

Once you have the actual behavior, everything else is just animation. Maybe you could evolve some keyframes fitting particular criteria?

cfusting · a year ago

Creative idea - I'll think about this.

cfusting commented on Show HN: Iteratively Building Virtual Creatures in Minecraft github.com/cfusting/condi... · Posted by u/cfusting

quantumwoke · a year ago

Cool project. You could definitely do more with this with more modern Minecraft tech like custom models and physics engines.

cfusting · a year ago

Awesome idea!

cfusting commented on Simulated data: the great equalizer in the AI race? medium.com/@charlesbrun/i... · Posted by u/cbrun

cfusting · 6 years ago

The author miss-understands how simulated data is created by GANS, VAEs, and other non-physics based simulations. Let's say you have a dataset and would like to create synthetic data using it and a GAN. Then you wish to estimate the distribution D of the data with a GAN. To do so the GAN learns the joint distribution P(X1, X2, ..., Xn) (where in the image case each X is usually a pixel) such that one may sample from D and obtain a new, synthetic image. Indeed, one will generate novel data but the distribution D that was estimated is merely a description of the original data at best and in practice a little bit (or a lot) off.

Now turn to the machine learning problem we sought to solve with the new synthetic data: what is the P(y|X1, X2, ..., Xn) where y is usually a class like "bird". In other words given an image predict its label. Since the data was generated knowing only the statistics of the original data, it can add no value beyond plausible examples developed using the original data itself.

Will this improve the accuracy of a model by providing additional edge case examples and filling in gaps? Somewhat. Will it understand data not represented by the original data and substitute for more thorough, diverse datasets? Absolutely not.

In terms of model improvement, yes synthetic data can help. In terms of the arms race? No. True examples provide knowledge that is unique. If one used a physics engine (GTA is popular for self-drivings cars) one can gather truly novel data; this is not the case for GANS.

It's concerning how willing people are to write articles on this subject without understanding the mathematics underlying the technology.

Do your homework and RTFM.

cfusting commented on PyTorch: An Imperative Style, High-Performance Deep Learning Library [pdf] arxiv.org/abs/1912.01703... · Posted by u/stablemap

scythe · 6 years ago

It’s true - there have been attempts to fix it, but nobody has created something other people want to use. The Torch project largely replaced all of the then-popular Lua packages — wxLua was dropped for Torch’s internal qtLua, the Lua concurrency libraries (Lanes and luaproc) were ignored in favor of zeroMQ, LPeg and Lua patterns were generally less popular than PCRE and Re2 bindings, et cetera. Maybe Torch is to blame (NIH syndrome), maybe the Lua packages weren’t up to the task, maybe communication within the community is too hard (Lua lacks centralized discussion channels where experienced users are regularly active), but in the end, Lua didn’t come away looking good here.

Learning a new language wasn’t too hard when that language was Python, after all.

cfusting · 6 years ago

We'd all be happier writing math; writing code is just a nuisance.

cfusting commented on PyTorch: An Imperative Style, High-Performance Deep Learning Library [pdf] arxiv.org/abs/1912.01703... · Posted by u/stablemap

scythe · 6 years ago

Man it’s kinda sad as a Lua fan to see so much interest in a project where the main goal is just to not use Lua.

I guess academics like familiarity and Lua insistently refuses to be like other languages (arrays and maps in one type, 1-based arrays, nonstandard builtin patterns, etc).

cfusting · 6 years ago

It's just about package support and the community. If researchers and practitioners were choosing a language based on merit alone it would probably be Julia for native speed and support for scientific computing. It's nice to have a toy language you appreciate but recall the goal is to write math into algorithms; the language is just tool.

cfusting commented on Kali Linux Adds 'Undercover' Mode to Impersonate Windows 10 bleepingcomputer.com/news... · Posted by u/praveenscience

vidarh · 6 years ago

In ~15 years of running Linux on laptops I've yet to have anyone care what I was running looked "weird", including when running e.g. bspwm or other tiling WMs that look totally alien to them.

cfusting · 6 years ago

Ditto. I didn't realize people were getting picked up enough by the authorities to merit building a Windows clone.

cfusting commented on Matlab vs. Julia vs. Python tobydriscoll.net/blog/mat... · Posted by u/mbauman

cfusting · 6 years ago

The choice of language usually comes down to the packages. In any of the three aforementioned languages one can easily and quickly manipulate matrices short of an unwillingness to learn. Julia is nice because it's fast with native code. Python is nice because of Scipy. Matlab is nice because it decides how to spend your money without cause.

I'm an AI researcher / practitioner. For me code accompanying papers is very useful and usually this code is in Python. Occasionally it's Matlab but let's be honest, who cares about those papers :). I'd love to use Julia but the package support just isn't there. Ironically people like me are supposed to be writing this code but with a demanding job and a family it's not likely I will be improving their DataFrame effort anytime soon.

Anyway the MAIN reason I use open source software is because if it isn't working correctly I simply fix the code myself. This isn't possible in the proprietary world. Why would you trust your research or production work with code you can't see and edit?

There's been a lot of talk about documentation. Docs are secondary sources, like WIRED, read the code if you're serious about being correct. Even (especially) hired hands make mistakes and fail to write good tests.

This article reminded me of the fictional Simpson's news article "Old Man Yells at Cloud". It's funny, and he may have a point, but it has no relevance.

cfusting commented on Matlab vs. Julia vs. Python tobydriscoll.net/blog/mat... · Posted by u/mbauman

enriquto · 6 years ago

> (...) ANYTHING other that flipping big matricies around.

but... what else is there, in life? Flipping big matrices around is nearly everything I do, and the python stuff seems too cumbersome for me to bother.

cfusting · 6 years ago

What kind of flipping are you talking about? I can transpose a matrix in numpy with X.T. What is cumbersome about this?