## page was renamed from DNS/毒盛/Kaminsky手法/2020/saddns.net/4/4.3
#pragma section-numbers off

= saddns.net/4/4.3 =
<<TableOfContents()>>

== 4.3 Public-Facing Source Port Scan Method ==

Even though a source port can be directly probed by any attacker
IP in this case, e.g., as in unbound and dnsmasq, it is imperative to
bypass '''the per IP rate limit''' (present in Linux primarily) to achieve
faster scan speed. 

{{{
We develop three different probing methods that
can overcome the ICMP rate limit challenge.
}}}
1○ If the attacker owns multiple IP addresses, either multiple bot
machines or a single machine with an IPv6 address, then it is trivial
to bypass the per IP limit. 

IPv6 address allocation states that each
LAN is given a /64 prefix [33], effectively allowing any network to
use 264 public IP addresses. We have tested this from a machine
in a residential network that supports IPv6 and picked several IPs
within the /64 to send and receive traffic successfully.

2○ If an attacker owns only a single IPv4 address, it is still pos-
sible to ask for multiple addresses using DHCP. We verified that
multiple private IPv4 addresses can be obtained in a home network.
In addition, we have tested this in an educational network where
a single physical machine is able to acquire multiple public IPv4
addresses through this method as well.

3○ If an attacker owns a single IPv4 address and the above
method fails for some reason (e.g., statically assigned IPs), then
the last method is to leverage IP spoofing to bypass the per IP rate
limit, and the global rate limit as a side channel to infer whether
the spoofed probes have hit the correct source port or not, i.e., with
or without ICMP responses.

As have been shown in the context of TCP recently, 
global rate limit can introduce serious side channels [11, 12, 26]. 

{{{
Here we leverage the ICMP global rate limit to facilitate UDP port scans 
which we describe next.
}}}
Figure 3 illustrates this. 

In observing the maximum globally allowable burst of 50 ICMP packets in Linux, 
{{{
the attacker first sends 50 spoofed UDP probe packets each with a different source IP
(bypassing the per-IP rate limit). 
}}}

If the victim server does not have any source port open among the 50,
then 50 ICMP port unreachable messages will be triggered (but they are not directly observableto the attacker). 

If the victim server does have 𝑛 open ports, then only 50-𝑛 ICMP packets will be triggered
(as the 𝑛 UDP probingpackets will be silently discarded at the application layer).

{{{
Now, the attacker sends a verification packet using its real IP address, 
e.g., a UDP packet destined to a known closed port, such as 1. 
}}}
It will either get no response (if the global rate limit is drained), 
or an ICMP reply otherwise.

{{{
If no port is found in the first batch, 
the attacker waits for at least 50ms for the rate limit counter to recuperate, 
and then start thenext round. 
}}}
Effectively, the scanning speed will be capped at 1,000per second. 

It therefore takes 60+ seconds to enumerate the entire port range consisting of 65536 ports. 

Nevertheless, it is a winning battle as the attacker can simply repeat the experiment 
and the probability that one experiment will succeed increases drastically
we note that this is a simple Bernoulli trial). 

'''Time consideration.'''

This approach does have a strong timing requirement.
The only thing the attacker has to send 50 spoofed probing packets and the verification packet in a burst so that they are all processed within a 20𝑚𝑠 window;
otherwise, the victim may start recovering additional tokens. 

The other requirement is that the attacker has to wait long enough for the 50 max tokens to recover. 
If the network condition is not ideal, the attacker can simply wait longer than 50ms.
{{{
Binary search to narrowing down to an exact port. 
}}}
Assuming there is a single open port out of the 50 in a specific probing round,
we can then employ a simple binary search to quickly narrow down to the exact port. 
During each round of binary search, we always probe the left half of range first. 

If it is a match, i.e., 50 spoofed probing packets triggered 49 replies 
and the attacker can observe one reply to its verification packet,
then we continue to search its left half. 
Otherwise, we assume the port lies in the right half and will conduct a binary search there.

Note that we will need to send “padding packets” to ensure the global rate limit is drained
when none of the 50 guesses hit a correct port. 
Padding packets are spoofed packets destined to known closed UDP ports,
e.g., 1, that are guaranteed to trigger ICMP replies.

Handling noises. 

DNS servers usually serve multiple clients at the same time, 
creating multiple outstanding DNS queries and source ports. 

As a result, the source port scan will likely discover many irrelevant ports.
However, most such queries are transient, and the port scan process can quickly discover
an open source port disappearing during the binary search and return to the linear
search. In contrast, we assume that the attacker-triggered DNS
queries will last significantly longer, e.g., on the order of seconds instead of milliseconds (see §5).

Another source of noises comes from packet losses and reordering. 
This may lead to both false positives, e.g., loss probing packets
or their replies, reordering between verification and probing pack-
ets, and false negatives, e.g., lost of the verification packet or its
reply (although very rare in practice). To mitigate reordering (which
may happen frequently if the jitter is large), we insert a delay, which
is empirically determined to be larger than twice the jitter, between
probe packets and the verification packet. When false positives do
occur, they are handled automatically in the binary search process–
it will detect no real port being open and return to linear search.
Even though they can be handled, excessive false positives will
drain the per-IP rate limit quickly. Specifically, given the token is
recovered at the slow rate of one per second, a false positive rate
that is higher than that will force the scan to halt until the token is
recovered. Effectively, a per-IP token is a “pass to scan”. To solve
this problem, the attacker may use two or more real IPs to gain
more “passes”.

In addition, DNS servers themselves may be subject to random
UDP port probing and therefore generate ICMP unreachable mes-
sages. This would cause false negatives: we may mistakenly think
there is no open port but in fact there is because the verification
packet will not trigger any ICMP unreachable replies due to the
noise draining the rate limit. Fortunately, not all ICMP replies are
subject to rate limit. For example, the most commonly triggered
ICMP echo replies are not subject to the limit.

== 4.4 Private Source Port Scan Method ==
As described in §4.1, if connect() is performed on a UDP socket,
the port effectively becomes “private” to the remote peer, 
invalidating the previous method.

Our idea then is to send spoofed UDP packets with the source IP
of the upstream DNS server. 

In the example of a DNS resolver being
the victim, we can send UDP packets probing different source ports
with spoofed IP of the authoritative name server. If it hits the correct
source port, then no ICMP reply will be generated. Otherwise, there
will be. We can then use the same global ICMP rate limit as a side
channel to infer if such an ICMP message has been triggered. At
first glance, this method can work but at a low speed of one port
per second, due to the per-IP rate limit on ICMP messages.

Surprisingly, after we analyze the source code of the ICMP rate
limit implementation, we find that the global rate limit is checked
prior to the per-IP rate limit. This means that even if the per-IP
rate limit may eventually determine that no ICMP reply should be
sent, a packet is still subjected to the global rate limit check and one
token is deducted. Ironically, such a decision is consciously made
by Linux developers to avoid invoking the expensive check of the
per-IP rate limit [22], involving a search process to locate the per-IP
data structure.

upstream server. 

Due to per-IP rate limit, the victim server will
always generate only one ICMP reply (in steady state) as long as
there is at least one inactive port scanned, which is the case in both
the left and right side of the figure. In the case where the 50 probes
hit 𝑛 private open ports (to the upstream server), the global rate
limit counter still decrements to 𝑛 because the victim attempted to
generate 50 − 𝑛 ICMP replies. In contrast, when all 50 probes hit
inactive ports (left side of the figure), the counter decrements to 0.
The rest of the procedure is identical as before, where a binary

search can be launched to narrow down to a specific port.
Influence on public-facing source port scan. With this knowl-
edge, we can improve method 3○ in §4.3 as follows: instead of
spoofing 50 different IPs in each round of probing, we only need to
use a single spoofed IP (or a 2nd IP the attacker owns) instead of
many different IPs (which sometimes can be a hurdle).
Handling noises. We point out that there is inherently less noise
in this scan compared to the one on public-facing source ports.
This is because every source port is now effectively “open” to only
one single remote IP which is originally specified in connect().
Therefore, assuming the victim is a resolver, most of its queries (i.e.,
noise) will be destined to a different name server than a specific
attack target. Other noise conditions such as packet loss and re-
ordering still apply. Similarly, noise handling techniques also apply
(e.g., using more than one IP to alleviate the per-IP ICMP rate limit).


This effectively means that the per-IP rate limit can be disre-
garded for the purpose of our side channel based scan, as it only
determines if the final ICMP reply is generated but has nothing to
do with the global rate limit counter decrement. As a result, we can
continue to use roughly the same scan method as efficient as before,
achieving 1,000 ports per second. Figure 4 illustrates the slightly
modified scan workflow. Similar to Figure 3, the attacker first sends
50 probes where this time all of which uses the spoofed IP of the


Figure 4: Fast Port Scanning of a Private Source Port



Figure 4: Fast