Readit News logoReadit News
aquafox commented on Left to Right Programming   graic.net/p/left-to-right... · Posted by u/graic
aquafox · 7 days ago
The consensus here seems to be that Python is missing a pipe operator. That was one of the things I quickly learned to appreciate when transitioning from Mathematica to R. It makes writing data science code, where the data are transformed by a series of different steps, so much more readable and intuitive.

I know that Python is used for many more things than just data science, so I'd love to hear if in these other contexts, a pipe would also make sense. Just trying to understand why the pipe hasn't made it into Python already.

aquafox commented on Cancer DNA is detectable in blood years before diagnosis   sciencenews.org/article/c... · Posted by u/bookofjoe
mapt · a month ago
This sort of thing is exactly like preventative whole body MRI scans. It's very noisy, very overwhelming data that is only statistically useful in cases we're not even sure about yet. To use it in a treatment program is witchcraft at this moment, probably doing more harm than good.

It COULD be used to craft a pipeline that dramatically improved everyone's health. It would take probably a decade or two of testing (an annual MRI, an annual sequencing effort, an annual very wide blood panel) in a longitudinal study with >10^6 people to start to show significant reductions in overall cancer mortality and improvements in diagnostics of serious illnesses. The diagnostic merit is almost certainly hiding in the data at high N.

The odds are that most of the useful things we would find from this are serendipitous - we wouldn't even know what we were looking at right now, first we need tons of training data thrown into a machine learning algorithm. We need to watch somebody who's going to be diagnosed with cancer 14 years from now, and see what their markers and imaging are like right now, and form a predictive model that differentiates between them and other people who don't end up with cancer 14 years from now. We [now] have the technology for picking through complex multidimensional data looking for signals exactly like this.

In the meantime, though, you have to deal with the fact that the system is set up exclusively for profitable care of well-progressed illnesses. It would be very expensive to run such a trial, over a long period of time, and the administrators would feel ethically bound to unblind and then report on every tiny incidentaloma, which completely fucks the training process.

This US is institutionally unable to run this study. The UK or China might, though.

aquafox · a month ago
> This sort of thing is exactly like preventative whole body MRI scans. It's very noisy, very overwhelming data that is only statistically useful in cases we're not even sure about yet. To use it in a treatment program is witchcraft at this moment, probably doing more harm than good.

The child of a friend of mine has PTEN-Hamartom-Tumor-Syndrom, a tendency to develop tumors throughout life due to a mutation in the PTEN gene. The poor child gets whole body MRIs and other check-ups every half year. As someone in biological data science, I always tell the parents how difficult it will be to prevent false positives, because we don't have a lot of data on routine full body check-ups on healty people. We just know the huge spectrum on how healthy/ok tissue looks like.

aquafox commented on Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights   jameshard.ing/pilot... · Posted by u/jamesharding
aquafox · 2 months ago
I would plot the destination matrix as a jeatap where each row is a departure and each column an arrival and color is the number of trips. Additionally, you could cluster the rows and columns of this heatmap.
aquafox commented on Airpass – Easily overcome WiFi time limits   airpass.tiagoalves.me/... · Posted by u/herbertl
WD-42 · 2 months ago
This is a cool visualization, thanks.
aquafox · 2 months ago
On a related note: Transporting a human in a car is (in relation to weight and size) like using a standard shopping cart to transport two 1L bottles of water. So the next time you walk through a pedestrian area, imagine everyone carrying a bag would use a shopping cart instead. That would be a huge traffic jam -- exactly like what you see on the road!
aquafox commented on Last fifty years of integer linear programming: Recent practical advances (2024)   inria.hal.science/hal-047... · Posted by u/teleforce
aquafox · 2 months ago
Could someone maybe give a high-level explanation into why commercial ILP solvers (e.g. Gurobi) are that much better than free/open-source ones? Is it because ILP is inherently that difficult to solve (I know it's NP-hard), that the best solvers are just a large ensemble of heuristics for very specific sub-problems and thus no general "good" strategy has made it's way into the public domain?

Dead Comment

aquafox commented on Show HN: Rv, a Package Manager for R   github.com/A2-ai/rv... · Posted by u/Keats
arbutus8 · 3 months ago
Ultimately, you're right that `rv` and `renv` get you to the same spot, both create reproducible, isolated projects. `renv` has a few issues that we often hit that lead to `rv`.

`renv` is an iterative process of installing some packages, then snapshotting your project state, and then trying to reproduce. The time between the installation and snapshot can often lose information (think `install.packages("my_pkg", repos = "https://my-repo.com")`, your repo source is lost by time the snapshot occurs). You can also install incompatible versions over-time.

rv solves both of these problems because it will lock the source at the time of installation. Additionally, because it is declarative, we are able to resolve the full dependency tree before installing packages to ensure everything will be compatible.

While I am a big proponent of using rv, if renv fits your needs, then switching to rv may not be worth it. For our organization, we did have multiple issues with renv, so created a replacement for it that we hope others in the community will find useful to address their needs.

aquafox · 3 months ago
I had similar issues in the past. Setting up renv, everything seems good, but after working in a project for a few weeks and installing packages, renv constantly complains about the library being out-of-sync and resolving these complaints took way longer than new ones came around. I think renv has good intentions, but there are just too many edge cases (Bioconductor, installing an experimental package from Github, weird package dependencies etc.), that it always failed me in a real-world scenario.
aquafox commented on Stack Overflow is almost dead   blog.pragmaticengineer.co... · Posted by u/Jerry2
louison11 · 3 months ago
My heart goes to the stack overflow community which has always been very kind and helpful, essentially working for free. As a self-taught developer since the age of 8, I literally grew up learning how to code through SO, asking hundreds of questions and answering many more. So many bugs that would take 2-3 days to fix would eventually find their answer through it. But now ChatGPT does that in minutes… so it’s for the best!
aquafox · 3 months ago
Until there is a radically new version of {popular programming language} with breaking changes and no new and correct answers to train on.
aquafox commented on Modern LaTeX   github.com/mrkline/modern... · Posted by u/signa11
aquafox · 4 months ago
Don't get me wrong, I love LaTeX, having written my PhD thesis in it. But with the current tools, I would use Quarto instead. It's much easier, you can still "inject" LaTeX and it's quicker for less technical collaborators to adapt.
aquafox commented on The End of Programming   cacm.acm.org/opinion/the-... · Posted by u/cumo
aquafox · 4 months ago
> most software, as we know it, will be replaced by AI systems that are trained rather than programmed

The problem with this are all the edge cases. There are more ways unforseen circumstances can arise as you can train for. That's why you should do a lot of input checks in production.

u/aquafox

KarmaCake day603December 21, 2021View Original