Dyn Analysis Summary of Friday October 21 Attack

Changes made by large companies that relied 100% on DYN:

* us-east-1.amazonaws.com: split between internal, UltraDNS, DYN

* spotify.com: all internal nameservers now

* reddit.com: all Route53 now

* github.com: all Route53 now

* netflix.com: all Route53 now

* paypal.com: split between UltraDNS and DYN

No changes made:

* twitter.com: 100% with DYN

JohnTHaller · 9 years ago

All our eggs in that basket got crushed. Let's take this basket over here and put all our eggs in that instead. Facepalm

MichaelGG · 9 years ago

It's probably temporary? Move to R53, then figure out what's needed to manage records on two providers (if only for internal processes). Top engineering teams aren't going to knee-jerk this, right? Or did Dyn show some unfixable incompetence?

BinaryIdiot · 9 years ago

To be fair Route 53 will split over other providers so you should be good in theory. But yeah if something more specifically targets Route 53 then that could be the same problem.

At least that's my understanding anyway.

fred256 · 9 years ago

Minor correction w.r.t. Netflix: netflix.com itself is on Route53 but parts of the CDN appear to still be 100% on Dyn:

  $ dig +short ns nflxvideo.net
  ns1.p19.dynect.net.
  ns4.p19.dynect.net.
  ns2.p19.dynect.net.
  ns3.p19.dynect.net.

pilom · 9 years ago

We use CloudFlare and after this my boss said "set us up with secondary DNS somewhere." Unfortunately, CloudFlare doesn't support being a primary DNS provider with NOTIFY messages. They are designed to handle the DDoS for us by proxying content. It's an interesting problem and I don't know whether to push back to CloudFlare or my boss. Anybody else running secondary DNS after this with CloudFlare?

majke · 9 years ago

(I work for CF)

Indeed, if you want the HTTP/HTTPS traffic to go through Cloudflare, the DNS must go through Cloudflare. There are generally two ways to set it up:

a) You move your DNS auth to Cloudflare and allow it to manage it.

b) You keep managing your domain yourself, and CNAME to Cloudflare. See: https://support.cloudflare.com/hc/en-us/articles/200168706-H...

What you should do depends on your setup and threat model. Do you fear DNS auth going down? Do you think your DNS will be a target? Do you use Cloudflare to hide your HTTP origin IP addresses?

For example, if you fear DNS auth going down, but you must use Cloudflare for HTTPS (say: for caching and SSL certs), then changing DNS off CF makes little sense. You already assume stability by expecting it to work HTTP layer.

If you think you can be a target of DNS attack, I'd say having multiple auth is unlikely to give you more mileage.

If you can afford disabling CF on HTTP layer, exposing your HTTP origin IP and want to have two different DNS auth providers, fine, you can do CNAME. But then you have three vendors to worry about, and problems with each can lead to trouble.

blakesterz · 9 years ago

* spotify.com: all internal nameservers now

I don't know much about running nameservers but moving to all internally hosted seems like an odd choice to me, can anyone explain whey that's a good move?

drostie · 9 years ago

With only a modest simplification you can view security as ultimately just being a figure measured in dollars: "it costs an adversary $X to beat these countermeasures." Your goal in securing a system is not to push X to infinity, though that might be a reasonable goal (e.g. if you're a security researcher designing new crypto primitives). Instead your goal in engineering your company's security consists in evaluating the value $V of what you're securing, and then raising X until X > V. There are uncertainties in measuring X and V and in how attackers will view these tradeoffs and so forth, but it's nothing you can't account for by building in an engineering tolerance like X > 2V. The basic story remains.

Spotify simultaneously has large resources and offers a non-essential infrastructure service (music to listen to while you're doing something else). The V gained in DoSing them is very small. They got attacked anyway because they shared infrastructure with other companies, which pools the V together to create something much larger. Some attacker saw a case where V >> X and attacked it to great success until Dyn was able to bring up X again. During the interim, Spotify was down despite having V << X.

In short: Spotify probably can't do DNS better than Dyn, but they can do DNS better than the sort of people who have reason to attack them (presumably trolls, maybe some future hacktivist who doesn't like some business decisions they make, unscrupulous competitors). This attack was a wake-up call for them, "oh, if we're pooling with these other folks then we'll become targets of larger hacktivist attacks and state actors, who are not directly targeting us per se." Those attackers could presumably still take out Spotify's home-rolled DNS, but they have no real motivation to target Spotify in particular any more.

takeda · 9 years ago

It lower surface attack. With companies like Dyn, they are affected even when someone is targeting other sites, while with internal DNS servers that are only used by themselves they will be down only if someone is attacking them directly.

If someone is targeting them directly it doesn't matter much that DNS is up and running, their site is still down.

user5994461 · 9 years ago

DNS is a rather simple service that was always meant to be run internally.

The question you should ask is why did these companies used an external DNS in the first place?

patrickg_zill · 9 years ago

The approximate difficulty in running DNS server is the same as running a static HTML web server. Low difficulty.

xorcist · 9 years ago

If Spotify's networks are all down, what good would a functioning DNS do?

(And I know it's not that simple, but that's probably the basic reasoning behind it.)

eggoa · 9 years ago

Spotify clients can just go to the IP address?

whitepoplar · 9 years ago

Perhaps their "internal" nameservers are just vanity nameservers hosted by someone else.

gist · 9 years ago

Zone data that I have access to shows that they lost roughly 1500 domains on the 23rd and 250 on the 24th and 155 on the 25th.

On previous 3 Saturdays they lost between 40 and 60 domains.

arkadiyt · 9 years ago

Twitter is making many changes including adding a secondary provider - the work is ongoing but should be out soon.

kavok · 9 years ago

I think Heroku is switching to two DNS companies after this as well.

https://status.heroku.com/incidents/965

"This outage exposed a critical weakness in our DNS hosting configuration. We are taking immediate steps to add additional DNS providers. This should allow us to avoid impact in the future, provided that at least one of our DNS providers is operational."

dx034 · 9 years ago

What happens if you have a DDOS on Route53? I'm sure they can handle the attack, but do you have to pay for the requests? Or are there clauses that they drop the fees if the requests were malicious? If not, the financial risk could easily outweigh the benefits of availability for smaller companies.

song · 9 years ago

I'm in the process of looking for a secondary DNS server for a client but because they rely heavily on geolocation load balancing it's not simple... I wonder if anyone has other recommendation beside UltraDNS for a good slave?

nameless912 · 9 years ago

Alright, what the fuck?

Shame on reddit, github, and netflix for learning literally fuck-all from this.

One thing I don't understand about this attack:

Virtually everyone is behind NAT these days, often multiple layers of NAT. So how does the botnet manage to telnet or ssh into these set top boxes or lightbulbs or whatever? When I want to SSH into my home computer I have to go through elaborate maneuvers to get it to work.

jackweirdy · 9 years ago

I believe UPNP is used commonly by many of the IoT devices that end up in botnets.

bradleyjg · 9 years ago

As I understand it UPNP allows a device to negotiate a port map with a NAT box. But the developer of the IoT device would have to specifically want to map port 22 or 23. It's not something that would happen automatically.

So are you saying that these IoT device makers not only hard-coded root usernames and passwords into their devices, but deliberately set up UPnP mappings to those ports?

That looks malicious rather than negligent. Am I missing something?

profmonocle · 9 years ago

I'm surprised there are UPNP implementations that even allow mapping to specific, low-numbered ports.

(Well, not surprised as much as disappointed.)

pixl97 · 9 years ago

And so? There is more than one way to spread a botnet.

One particular method is DNS rebinding. They attack your webbrowser, then your browser passes the attack to the device inside the network.

Another means is 'the weakest link'. An insecure and compromised device (even if it just a user account on that device) scans your local network behind the firewall , passes information to a command and control server, which downloads and passes an exploits to the devices it discovers on your network.

sjnair96 · 9 years ago

So I may be flat out wrong about this, but I believe the clients, once infected phone home, thus (mostly) removing the need for worrying about NAT, as they router will forward the Server's connections to the correct machine once it sees that a prior outgoing connection has been made.

hexane360 · 9 years ago

I think the question is about the initial infection.

rajangdavis · 9 years ago

I commented this in another post, but at least for the affected surveillance cameras and recorders, the firmware for these devices is built on Busy Box. The manufacturers either opened up telnet or left telnet open for these firmwares until about 3 years ago when it started getting exploited.

The characteristics for how these surveillance devices were hacked are that the devices are using older firmware with an exploitable telnet feature and that the default credentials are still intact. There are a few vectors (using UPnP, HTTP API, directory traversal) that can be applied to bypass authorization. Throw in a global directory of these devices (shodan.io) and you have the ability to search for these devices with specific firmwares, run every attack vector to compromise these systems, and, once connected, have them do _whatever_you_want_.

viraptor · 9 years ago

There's still a lot of those on public network. Either via uPNP, or directly. Shodan (https://www.shodan.io/search?query=port%3A23) says Total results: 3,957,271

And that's only on the default port.

Deleted Comment

citrin_ru · 9 years ago

> Virtually everyone is behind NAT these days, often multiple layers of NAT. So how does the botnet manage to telnet or ssh into these set top boxes or lightbulbs or whatever?

Infected windows desktop can scan local network and infect IoT devices.

ndesaulniers · 9 years ago

With more uptake of IPv6, do people still use NAT? I thought NAT was just for IPv4?

compuguy · 9 years ago

There are still many corporations and ISP's that haven't configured or support ipv6 yet.

justinsaccount · 9 years ago

A good chunk of the infected devices are shitty NAT routers that had a backdoor...

seany · 9 years ago

Routers by default shipping with permissive upnp settings?

ProAm · 9 years ago

Probably uPnP?