Readit News logoReadit News
Posted by u/syx 5 years ago
Ask HN: What if Stack Overflow suddenly decides to shut down?
As the title says, what would happen if Stack Overflow decides to interrupt the service? Realistically speaking how much damage could that do to everyday work?
nonameiguess · 5 years ago
The US intelligence community actually keeps a clone of Stack Overflow that is updated every 12 hours and copied to servers on the classified version of the Internet so that people developing classified software can use it without needing to turn around to their unclassified workstations. I'm sure they're not alone in this, so the entirety of the answers there could be restored from mirrors or just used from the mirrors if the main site disappeared.

Nearly all of the information there is also available in the public documentation of whatever the question asker is asking about anyway. Forcing developers to use real documentation instead of Stack Overflow would not likely hurt everyday work. It might become harder to find literal worked examples, but even those are mostly duplicated elsewhere and SO is just making it easier on search engines to find it.

svara · 5 years ago
> Nearly all of the information there is also available in the public documentation of whatever the question asker is asking about anyway.

That's really not true in any meaningful way. Maybe the information is somewhere, but some Stackoverflow answers are so good that they are the best resource on a specific narrow topic that you will find online.

See for example something like [0]. That's the type of answer you would get from a good book or teacher, good luck finding something comparable in the often terse official python docs.

Sure, there are a lot of lazy SO Q/As that sound like someone wasn't doing their homework, but that's far from the only type of content there.

[0] https://stackoverflow.com/questions/100003/what-are-metaclas...

(Edit: another great example is basically anything that concerns advanced usage of matplotlib. That library is so weird and the docs so terrible that working off of SO examples really is the most efficient way to use it.)

marcosdumay · 5 years ago
Yeah, that's the kind of answer that you don't get anymore on SO, mostly because that kind of question isn't allowed anymore.

Nowadays questions must be much more specific, what has the consequence that the answers are much less generally useful.

PaulHoule · 5 years ago
Some StackOverflow answers are so bad they are the worst resource on a topic.
dataflow · 5 years ago
> Forcing developers to use real documentation instead of Stack Overflow would not likely hurt everyday work.

For beginner-level stuff maybe, but there's so much on StackOverflow about bugs, pitfalls, caveats, breaking changes, rationales, use cases, background, etc. that you can't find easily (if at all) in software documentation, if the documentation even pops up in your searches.

qntmfred · 5 years ago
> there's so much on StackOverflow about bugs, pitfalls, caveats, breaking changes, rationales, use cases, background, etc

this question is closed as off-topic.

netsharc · 5 years ago
How do you know that info in the first paragraph?

On top of that security, it'd also be interesting to monitor what someone (based on their IP and browser fingerprint) was looking up on SO, you could probably figure out what they were working on that way.

Although I guess when you work for the spooks, they could figure out how to mask your IP that each query would come from a different IP from the world (basically I'm describing Tor, I wonder if they have their own Tor network).

langitbiru · 5 years ago
> On top of that security, it'd also be interesting to monitor what someone (based on their IP and browser fingerprint) was looking up on SO, you could probably figure out what they were working on that way.

When the security incident happened last year on SO, SO team monitored what information the attacker searched in SO. So I guess some people want some privacy in searching information in SO.

https://stackoverflow.blog/2021/01/25/a-deeper-dive-into-our...

rovr138 · 5 years ago
The second part, regarding mirrors, https://archive.org/details/stackexchange
ArtWomb · 5 years ago
>>> The US intelligence community actually keeps a clone of Stack Overflow

I feel this is a prudent idea. Not necessarily for a full blown start-up, but a community-funded resource. If you stripped out all the extras. And just cloned the questions and top 2 answers. The text based data set itself is very managable. Several gigabytes in scale. Not to mention its applicability for ML/NLP/codegen research ;)

rovr138 · 5 years ago
norswap · 5 years ago
> Nearly all of the information there is also available in the public documentation of whatever the question asker is asking about anyway.

In the absolute that may be true, but even if a low percentage of those are not available elsewhere, they are absolutely crucial. At least, they have been crucial to me.

I'm too young to know if it's a recent development, but documentation this day ain't worth much.

emayljames · 5 years ago
Oh, no; it has always been that way. Either too much information (needle in haystack), or undocumented (think Apple's latest Swift). The best documentation I've seen is the PHP site that not only has all function listed, but gives official examples, with user comments that can be voted on.
mraza007 · 5 years ago
Hey when you mentioned US intelligence community do you mean the government agencies such as the army,cia or DoD

I’m just curious to know

QuadmasterXLII · 5 years ago
The app Kiwix let’s you keep a static copy of stack overflow on your phone, it’s only a few gigabytes. It also has Wikipedia!
kevsim · 5 years ago
Dash on macOS also lets you download StackOverflow for offline use. It was my goto for working on airplanes, way back when traveling on airplanes was a thing.
lmm · 5 years ago
Does it actually work? I tried a couple of times and the downloads never finished.
livre · 5 years ago
It worked for me when I tried it but it was so slow that unless you are running from an SSD it's not worth it (I had it on an external disk). Don't download everything, only get what you really need, the phone app was (maybe still is) a bit buggy.
layer8 · 5 years ago
Is there something like Dash/Kiwix for Windows?
livre · 5 years ago
Yes, Kiwix is available for Windows: https://www.kiwix.org/en/downloads/

There's also Zeal: https://zealdocs.org/

And DevDocs: https://devdocs.io/about

gus_massa · 5 years ago
The Q&A have a very permissive license, so anyone can try to host a static version. (This was on purpose, the idea is that people would have been less prone to answer, if the answers were trapped in SO.)

Moreover, there are a few spammer sites that already does this, and one of the problem of SO is to compete with them to be higher in the Google results.

So ... probably you will be able to read the Q&A in other sites, and after a while there will appear SO-likes site for niches.

dd_roger · 5 years ago
In my opinion the usefulness of Stack Overflow is vastly overstated. It's nice to find a few examples to get started in a new framework that lacks documentation (when there's documentation at all...) but generally speaking non-trivial questions never get answered anyway. I haven't used it in at least 6 months, maybe even a year if my memory serves me well.
Netcob · 5 years ago
My experiences with Stack Overflow until recently:

- Using it as a useful resource for small issues

- Seeing memes about how programmers essentially just copy/paste code from Stack Overflow all day long

- Reading comments about how Stack Overflow is severely outdated and any question will be shut down immediately

And then I had an issue with library A which was using library B but had a small inconsistency in how it used it that made my use case very difficult. So I posted a question. The developer of library B appeared, agreeing that it's a problem. Followed by the developer of library A, who agreed as well and said the fix would be in the next release. 2-3 days later, my bug was fixed by updating library A.

With a sample of one, that doesn't really say anything, my view of that website is a bit biased now.

marcosdumay · 5 years ago
That depends on your environment.

When programming in Haskell or Rust, going into SO is basically a waste of time, and the documentation or some specific search engine (yeah, hoogle) are the place to go.

When programming in C#, SO is basically irreplaceable, the knowledge there is extremely important and you just won't find it anywhere else.

When programming in Python or Javascript, it completely depends on what libraries and frameworks you are using.

sdevonoes · 5 years ago
> When programming in C#, SO is basically irreplaceable, the knowledge there is extremely important and you just won't find it anywhere else.

Not even in the official documentation? Wow.

EamonnMR · 5 years ago
Stack overflow is very useful if someone already asked your question back when it was still accepting new questions.
herodoturtle · 5 years ago
I say this somewhat tongue-in-cheek, but I suspect it would probably result in more netsplits on freenode, owed to the inevitable influx of programmers who would now seek assistance on IRC like the good old days.

I've definitely noticed a downward trend of activity in the assorted programming chat channels I've frequented over the last 25 years or so, which I guess could be a result of the growing popularity of Q&A sites like stackoverflow.

But I still think it's one of the best places to seek assistance - plus, helping out programmers on IRC in real time is a good way to stay sharp and on top of the continuously evolving software landscape.

gpanders · 5 years ago
IRC is the first place I go if I can’t find an answer in the documentation or a search engine. If that doesn’t work, I’ll turn to a mailing list.

Personally, I find both of these options have less friction than starting a new question on SO.

herodoturtle · 5 years ago
100% agreed.

I also just simply enjoy lurking on IRC to assist other programmers in need. I learnt so much in those channels when first starting out, and these days I have a bit more spare time, so I enjoy "paying it forward" by helping others much like I was helped back in the day.

Plus there are a bunch of really nice programmers hanging out there, so there's a social aspect to it as well.

dan-robertson · 5 years ago
The key offering of stack overflow these days is less the ability to ask a stupid question and get told that you’re an idiot, and more the existing wealth of answers to questions that may be found by a search engine. I don’t think irc really produces answers which are easy to find.
pan69 · 5 years ago
I remember something Jeff talked about during the original Stack Overflow podcasts, that Google was the main interface for Stack Overflow. As in, your entry into Stack Overflow would for the majority of people be through search. And to speak for myself, this is how I use Stack Overflow 99% of the time.

This makes me wonder. Just having a copy of the SO data without the Google algorithms to bring you to the right question, the experience would probably be different when working of a copy.

fiftyacorn · 5 years ago
I think it would have had more of an impact five years ago. Most of the best content is on blogs now not so. Most of the best so answers were really blog posts anyway
ksm1717 · 5 years ago
It would be nice if those blog posts were as easily aggregated as stack. I know many have tried to solve the problem, but it always feels like great luck when I find a good blog post
Macha · 5 years ago
Finding more stuff on GitHub issues myself, but agree with the general sentiment that more content is elsewhere these days
umvi · 5 years ago
Depends. General language questions are amazing on stack overflow. Once you starting getting into the quirks and bugs of individual frameworks, GitHub issues can start to be more helpful.
rohan1024 · 5 years ago
I wonder if we could implement a decentralized version of SO. It can be a CLI app since it is used by developers only. The app can keep all question/answers of topic relevant to the developer locally. This will allow developer to find answers offline as well. Since it is only text data it should not take more than a GB.
rovr138 · 5 years ago
While SO is geared for developers. If someone takes this on, ideally it would work like StackExchange sites where one can exist for each topic.

- https://math.stackexchange.com/

- https://stats.stackexchange.com/

- https://3dprinting.stackexchange.com/

- https://ai.stackexchange.com/

- https://apple.stackexchange.com/

- https://serverfault.com/

- https://superuser.com/

- https://drupal.stackexchange.com

There’s a ton more of these.

rovr138 · 5 years ago
That, not more than a GB part is very wrong

    stackoverflow.com-Badges.7z (View Contents) 07-Dec-2020 17:09 273.8M
    stackoverflow.com-Comments.7z (View Contents) 08-Dec-2020 03:10 4.5G
    stackoverflow.com-PostHistory.7z (View Contents) 08-Dec-2020 22:50 28.0G
    stackoverflow.com-PostLinks.7z (View Contents) 07-Dec-2020 17:11 95.6M
    stackoverflow.com-Posts.7z (View Contents) 08-Dec-2020 18:48 15.9G
    stackoverflow.com-Tags.7z (View Contents) 07-Dec-2020 17:11 843.9K
    stackoverflow.com-Users.7z (View Contents) 07-Dec-2020 17:25 645.2M
    stackoverflow.com-Votes.7z (View Contents) 07-Dec-2020 22:14 1.2G

swalsh · 5 years ago
There's a lot of metadata not being caputred by MetaData which would be insanely useful for a lot of purposes. It would be kind of neat if a lot more of that was captured... and if SO answers had more programatic access.
rovr138 · 5 years ago
The data is on archive.org and also on big query if you need it.

https://archive.org/details/stackexchange

MitPitt · 5 years ago
There's this open-source SO-esque Q&A forum: https://github.com/ASKBOT/askbot-devel
ruslan · 5 years ago
Preferably a knowlage DB accessable through whois(1) protocol.
mcint · 5 years ago
Whois does seem a little more flexible than dict. Able to delegate and recurse in answering a query. Let's add some flags
hprotagonist · 5 years ago
with a human in the loop, this is IRC.