TCP/IP Tutorial and Technical Overview

Table of Contents IP Routing
TCP/IP Tutorial and Technical Overview

2.2 Addressing

Internet addresses can be symbolic or numeric. The symbolic form is easier to read, for example: myname@ibm.com. The numeric form is a 32-bit unsigned binary value which is usually expressed in a dotted decimal format. For example, 9.167.5.8 is a valid Internet address. The numeric form is used by the IP software. The mapping between the two is done by the Domain Name System discussed in Domain Name System (DNS). We shall first look at the numeric form, which is called the IP address.

2.2.1 The IP Address

The standards for IP addresses are described in RFC 1166 -- Internet Numbers.

To be able to identify a host on the internet, each host is assigned an address, the IP address, or Internet Address. When the host is attached to more than one network, it is called multi-homed and it has one IP address for each network interface. The IP address consists of a pair of numbers:

IP address = <network number><host number>

The network number part of the IP address is centrally administered by the Internet Network Information Center (the InterNIC) and is unique throughout the Internet. (1)

IP addresses are 32-bit numbers usually represented in a dotted decimal form (as the decimal representation of four 8-bit values concatenated with dots). For example 128.2.7.9 is an IP address with 128.2 being the network number and 7.9 being the host number. The rules used to divide an IP address into its network and host parts are explained below.

The binary format of the IP address 128.2.7.9 is:

         10000000 00000010 00000111 00001001

IP addresses are used by the IP protocol (see Internet Protocol (IP)) to uniquely identify a host on the internet. IP datagrams (the basic data packets exchanged between hosts) are transmitted by some physical network attached to the host and each IP datagram contains a source IP address and a destination IP address. To send a datagram to a certain IP destination, the target IP address must be translated or mapped to a physical address. This may require transmissions on the network to find out the destination's physical network address (for example, on LANs the Address Resolution Protocol, discussed in Address Resolution Protocol (ARP), is used to translate IP addresses to physical MAC addresses).

The first bits of the IP address specify how the rest of the address should be separated into its network and host part.

The terms network address and netID are sometimes used instead of network number, but the formal term, used in RFC 1166, is network number. Similarly, the terms host address and hostID are sometimes used instead of host number.

There are five classes of IP addresses. These are shown in Figure - Assigned Classes of Internet Addresses.

Figure: Assigned Classes of Internet Addresses

Note: Two numbers out of each of the class A, class B and class C network numbers, and two host numbers out of every network are pre-assigned: the ``all bits 0'' number and the ``all bits 1'' number. These are discussed below in Special IP Addresses.

Class A addresses use 7 bits for the network number giving 126 possible networks (we shall see below that out of every group of network and host numbers, two have a special meaning). The remaining 24 bits are used for the host number, so each networks can have up to 2(superscript 24)-2 2 to the power 24 minus 2 (16,777,214) hosts.
Class B addresses use 14 bits for the network number, and 16 bits for the host number giving 16382 networks each with a maximum of 65534 hosts.
Class C addresses use 21 bits for the network number and 8 for the host number giving 2,097,150 networks each with up to 254 hosts.
Class D addresses are reserved for multicasting, which is used to address groups of hosts in a limited area. See Multicasting for more information on multicasting.
Class E addresses are reserved for future use.

It is clear that a class A address will only be assigned to networks with a huge number of hosts, and that class C addresses are suitable for networks with a small number of hosts. However, this means that medium-sized networks (those with more than 254 hosts or where there is an expectation that there may be more than 254 hosts in the future) must use Class B addresses. The number of small- to medium-sized networks has been growing very rapidly in the last few years and it was feared that, if this growth had been allowed to continue unabated, all of the available Class B network addresses would have been used by the mid-1990s. This is termed the IP Address Exhaustion problem. The problem and how it is being addressed are discussed in The IP Address Exhaustion Problem.

One point to note about the split of an IP address into two parts is that this split also splits the responsibility for selecting the IP address into two parts. The network number is assigned by the InterNIC, and the host number by the authority which controls the network. As we shall see in the next section, the host number can be further subdivided: this division is controlled by the authority which owns the network, and not by the InterNIC.

2.2.2 Subnets

Due to the explosive growth of the Internet, the use of assigned IP addresses became too inflexible to allow easy changes to local network configurations. These changes might occur when:

A new physical network is installed at a location.
Growth of the number of hosts requires splitting the local network into two or more separate networks.

To avoid having to request additional IP network addresses in these cases, the concept of subnets was introduced.

The host number part of the IP address is sub-divided again into a network number and a host number. This second network is termed a subnetwork or subnet. The main network now consists of a number of subnets and the IP address is interpreted as:

<network number><subnet number><host number>

The combination of the subnet number and the host number is often termed the ``local address'' or the ``local part''. ``Subnetting'' is implemented in a way that is transparent to remote networks. A host within a network which has subnets is aware of the subnetting but a host in a different network is not; it still regards the local part of the IP address as a host number.

The division of the local part of the IP address into subnet number and host number parts can be chosen freely by the local administrator; any bits in the local part can be used to form the subnet accomplished. The division is done using a subnet mask which is a 32 bit number. Zero bits in the subnet mask indicate bit positions ascribed to the host number, and ones indicate bit positions ascribed to the subnet number. The bit positions in the subnet mask belonging to the network number are set to ones but are not used. Subnet masks are usually written in dotted decimal form, like IP addresses.

The special treatment of ``all bits zero'' and ``all bits one'' applies to each of the three parts of a subnetted IP address just as it does to both parts of an IP address which has not been subnetted. See Special IP Addresses. For example, a subnetted Class B network, which has a 16-bit local part, could use one of the following schemes:

The first byte is the subnet number, the second the host number. This gives us 254 (256 minus 2 with the values 0 and 255 being reserved) possible subnets, each having up to 254 hosts. The subnet mask is 255.255.255.0.
The first 12 bits 15 are used for the subnet number and the last four for the host number. This gives us 4094 possible subnets (4096 minus 2) but only 14 hosts per subnet (16 minus 2). The subnet mask is 255.255.255.240. There are many other possibilities.

While the administrator is completely free to assign the subnet part of the local address in any legal fashion, the objective is to assign a number of bits to the subnet number and the remainder to the local address. Therefore, it is normal to use a contiguous block of bits at the beginning of the local address part for the subnet number because this makes the addresses more readable (this is particularly true when the subnet occupies 8 or 16 bits). With this approach, either of the subnet masks above are ``good'' masks, but masks like 255.255.252.252 and 255.255.255.15 are not.

2.2.2.1 Types of Subnetting

There are two types of subnetting: static and variable length. Variable length is the more flexible of the two. Which type of subnetting is available depends upon the routing protocol being used; native IP routing supports only static subnetting, as does the widely used RIP protocol. However, RIP Version 2 supports variable length subnetting as well. See Routing Information Protocol (RIP) for a description of RIP and RIP2. Routing Protocols discusses routing protocols in detail.

Static Subnetting

Static subnetting means that all subnets in the subnetted network use the same subnet mask. This is simple to implement and easy to maintain, but it implies wasted address space for small networks. For example, a network of four hosts that uses a subnet mask of 255.255.255.0 wastes 250 IP addresses. It also makes the network more difficult to reorganize with a new subnet mask. Currently, almost every host and router supports static subnetting.

Variable Length Subnetting

When variable length subnetting is used, the subnets that make up the network may use different subnet masks. A small subnet with only a few hosts needs a subnet mask that accommodates only these few hosts. A subnet with many hosts attached may need a different subnet mask to accommodate the large number of hosts. The possibility to assign subnet masks according to the needs of the individual subnets will help conserve network addresses. Also, a subnet can be split into two parts by adding another bit to the subnet mask. Other subnets in the network are unaffected by the change. Not every host and router supports variable length subnetting.

Only networks of the size needed will be allocated and routing problems will be solved by isolating networks with routers that support variable subnetting. A host that does not support this kind of subnetting would have to route to a router that supports variable subnetting.

Mixing Static and Variable Length Subnetting

At first sight, it appears that the presence of a host which only supports static subnetting would prevent variable length subnetting from being used anywhere in the network. Fortunately this is not the case. Provided that the routers between subnets with different subnet masks are using variable length subnetting, the routing protocols employed are able to hide the difference between subnet masks from the hosts in a subnet. Hosts can continue to use basic IP routing and offload all of the complexities of the subnetting to dedicated routers.

2.2.2.2 A Static Subnetting Example

Assume that our IP network has been assigned the class B IP network number 129.112. We have to implement multiple physical networks throughout our network. and some of the routers we will be using do not support variable length subnetting. We must therefore choose a subnet mask for the whole network. We have a 16-bit local address for our whole network and must divide it into two parts appropriately. At the moment, we do not anticipate having more than 254 physical networks, nor more than 254 hosts per network, so a logical subnet mask to use is 255.255.255.0 (which also has the advantage of being an ``easily readable'' one). This decision should be made with care, since it will be difficult to change it later. If the number of networks or hosts grows beyond the planned numbers, we may have to implement variable length subnetting to make the best use of the 65,534 local addresses we have.

Figure - A Subnet Configuration shows an example of an implementation with three subnets.

Figure: A Subnet Configuration - Three physical networks form one IP network. The two routers are performing slightly different tasks. Router 1 is acting as a router between subnets 1 and 3 and as a router between the whole of our network and the rest of the internet. Router 2 acts only as a router between subnets 1 and 2.

Let us now consider a different subnet mask: 255.255.255.240. The fourth octet is then divided in two parts:

The following table contains the valid subnets using this subnet mask:

Table: Subnet Values for Subnet Mask 255.255.255.240

For each of these subnet values, only 14 addresses (from 1 to 14) for hosts are available, because only the right part of the octet can be used and because addresses 0 and 15 (all bits set to one) have a special meaning as described in Special IP Addresses.

Thus, subnet number 9.67.32.16 will contain hosts whose IP addresses are in the range of 9.67.32.17 to 9.67.32.30, and subnet number 9.67.32.32 will contain hosts whose IP addresses are in the range of 9.67.32.33 to 9.67.32.46, etc.

2.2.2.3 IP Routing with Subnets

To route an IP datagram on the network, the general IP routing algorithm has the following form:

Figure: IP Routing without Subnets

To be able to differentiate between subnets, the IP routing algorithm changes and has the following form:

Figure: IP Routing with Subnets

Some implications of this algorithm are:

It is a change to the general IP algorithm. Therefore, to be able to operate this way, the particular gateway must contain the new algorithm. Some implementations may still use the general algorithm, and will not function within a subnetted network, although they can still communicate with hosts in other networks which are subnetted.
As IP routing is used in all of the hosts (and not just the routers), all of the hosts in the subnet must:
1. Have an IP algorithm that supports subnetting.
2. Have the same subnet mask (unless subnets are formed within the subnet).
If the IP implementation on any of the hosts does not support subnetting, that host will be able to communicate with any host in its own subnet but not with any machine on another subnet within the same network. This is because the host sees only one IP network and its routing cannot differentiate between an IP datagram directed to a host on the local subnet and a datagram that should be sent via a router to a different subnet.

In case one or more hosts do not support subnetting, an alternative way to achieve the same goal exists in the form of proxy-ARP, which doesn't require any changes to the IP routing algorithm for single-homed hosts, but does require changes on routers between subnets in the network. This is explained in more detail in Proxy-ARP or Transparent Subnetting.

All IBM TCP/IP implementations support subnetting but there is currently no support for proxy-ARP routers.

2.2.2.4 Obtaining a Subnet Mask

Usually, hosts will store the subnet mask to be used in a configuration file. However, sometimes this cannot be done, as for example in the case of a diskless workstation. The ICMP protocol includes two messages, address mask request and address mask reply, which allow hosts to obtain the correct subnet mask from a server. See Address Mask Request (17) and Address Mask Reply (18) for more information.

2.2.2.5 Addressing Routers and Multi-homed Hosts

Whenever a host has a physical connection to multiple networks or subnets, it is described as being multi-homed. All routers are multi-homed since their purpose is to join networks or subnets. A multi-homed host always has different IP addresses associated with each network adapter, since each adapter is in a different subnet or network.

There is one apparent exception to this rule: with some systems (for example VM and MVS) it is possible to specify the same IP address for multiple point-to-point links (such as channel-to-channel adapters) if the routing protocol used is limited to the basic IP routing algorithm. For example, a VM ``hypervisor'' system running TCP/IP connected to a token-ring LAN via a 3172 Interconnect Controller may run a number of ``guest'' VM and/or MVS systems. A very cost-effective solution for connecting these guests to the token-ring is to connect them to the hypervisor's TCP/IP with virtual channel-to-channel connections. The IP addresses can be chosen so that the channel-to-channel connected systems constitute their own subnet, in which case the hypervisor acts as a router. Because the routing protocols available on VM and MVS only support static subnetting, it may be difficult to find an additional subnet number if the address space is constrained. Therefore, the channel-to-channel connected systems may be given IP addresses in the same subnet as the token ring, in which case the hypervisor is taking the place of a bridge, and is not multi-homed. One disadvantage of this configuration is that other hosts on the LAN need static definitions to route datagrams for the hosts on the far side of the ``bridge'' via the hypervisor because the 3172 is not aware that it has this bridging responsibility. The ``transparency'' of bridges described in Bridges, Routers and Gateways is missing in this scenario.

2.2.3 Special IP Addresses

As noted above, any component of an IP address with a value ``all bits 0'' or all ``all bits 1'' has a special meaning

all bits 0: stands for ``this'': ``this'' host (IP address with <host number>=0) or ``this'' network (IP address with <network number>=0) and is only used when the real value is not known. This form is only used in source addresses when the host is trying to determine its IP addresses from a remote server. The host may know include its host number if known, but not its subnet or network number. See also BOOTstrap Protocol - BOOTP.
all bits 1: stands for ``all'': ``all'' networks or ``all'' hosts. For example, 128.2.255.255 (a class B address with a host number of 255.255) means all hosts on network 128.2. These are used in broadcast messages, as described below.

There is another address of special importance: the ``all bits 1'' class A network number 127 is reserved for the loopback address. Anything sent to an address with 127 as the value of the high order byte, for example 127.0.0.1, must not be routed via a network but must be routed directly from the IP implementation's output driver to its input driver. (2)

2.2.4 Unicasting, Broadcasting and Multicasting

The majority of IP addresses refer to a single recipient: these are called unicast addresses. However, as noted above, there are two special types of IP address which are used for addressing multiple recipients: broadcast addresses and multicast addresses. These addresses are used for sending messages to multiple recipients. Any protocol which is connectionless may send broadcast or multicast messages as well as unicast messages. A protocol which is connection-oriented can only use unicast addresses because the connection exists between a specific pair of hosts. See Transmission Control Protocol (TCP) for more information on connection-oriented protocols.

2.2.4.1 Broadcasting

There are a number of addresses which are used for IP broadcasting: all use the convention that ``all-bits 1'' indicates ``all''. Broadcast addresses are never valid as source addresses, only as destination addresses. The different types of broadcast addresses are listed here:

limited broadcast address: The address 255.255.255.255 (all bits 1 in all parts of the IP address) is used on networks which support broadcasting, such as token rings, and it refers to all hosts on the subnet. It does not require the host to know any IP configuration information at all. All hosts on the local network will recognize the address, but routers will never forward it.
There is one exception to this rule, called BOOTP forwarding. The BOOTP protocol uses the limited broadcast address to allow a diskless workstation to contact a boot server. BOOTP forwarding is a configuration option available on some routers, including the IBM 6611 and 2210 Network Processors to make an exception for UDP datagrams for ports 67 (used by the BOOTP protocol). Without this facility, a separate BOOTP server would be required on each subnet. However, this is not simple forwarding because the router also plays a part in the BOOTP protocol. See BOOTstrap Protocol - BOOTP for more information about BOOTP forwarding and User Datagram Protocol (UDP) for an explanation of UDP ports.
network-directed broadcast address: If the network number is a valid network number, the network is not subnetted and the host number is all ones (for example, 128.2.255.255), then the address refers to all hosts on the specified network. Routers should forward these broadcast messages unless configured otherwise. This is used in ARP requests (see Address Resolution Protocol (ARP)) on unsubnetted networks.
subnet-directed broadcast address: If the network number is a valid network number, the subnet number is a valid subnet number and the host number is all ones, then the address refers to all hosts on the specified subnet. Since the sender's subnet and the target subnet may have different subnet mask, the sender must somehow find out the subnet mask in use at the target. The actual broadcast is performed by the router which receives the datagram into the subnet.
all-subnets-directed broadcast address: If the network number is a valid network number, the network is subnetted and the local part is all ones (for example, 128.2.255.255), then the address refers to all hosts on all subnets in the specified network. In principle routers may propagate broadcasts for all subnets but are not required to do so. In practice, they do not; there are few circumstances where such a broadcast would be desirable, and it can lead to problems, particularly if a host has been incorrectly configured with no subnet mask. Consider the wasted resource involved if a host 9.180.214.114 in the subnetted Class A network 9 thought that it was not subnetted and used 9.255.255.255 as a ``local'' broadcast address instead of 9.180.214.255 and all of the routers in the network respected the request to forward the request to all clients.
If routers do respect all-subnets-directed broadcast address they use an algorithm called Reverse Path Forwarding to prevent the broadcast messages from multiplying out of control. See RFC 922 for more details on this algorithm.

2.2.4.2 Multicasting

Broadcasting has a major disadvantage: its lack of selectivity. If an IP datagram is broadcast to a subnet, every host on the subnet will receive it, and have to process it to determine whether the target protocol is active. If it is not, the IP datagram is discarded. Multicasting avoids this overhead by using groups of IP addresses.

Each group is represented by a 28-bit number, which is included in a Class D address. Recall that a class D address has the format:

So multicast group addresses are IP addresses in the range 224.0.0.0 to 239.255.255.255. For each multicast address there is a set of zero or more hosts which are listening to it. This set is called the host group. There is no requirement for any host to be a member of a group to send to that group. There are two kinds of host group:

permanent

The IP address is permanently assigned by IANA. The membership of a host group is not permanent: a host may leave or join the group at will. The list of IP addresses assigned permanent host groups is included in STD 2 - Assigned Internet Numbers. Important ones are:

224.0.0.0: Reserved base address
224.0.0.1: All systems on this subnet
224.0.0.2: All routers on this subnet
Some other examples used by the OSPF routing protocol (see Open Shortest Path First Protocol (OSPF) Version 2) are:
224.0.0.5: All OSPF routers
224.0.0.6: OSPF Designated Routers

An application may also retrieve a permanent host group's IP address from the domain name system (see Domain Name System) using the domain mcast.net, or determine the permanent group from an address by using a pointer query (see Mapping IP Addresses to Domain Names - Pointer Queries) in the domain 224.in-addr.arpa. A permanent group exists even if it has no members.

transient

Any group which is not permanent is transient and is available for dynamic assignment as needed. Transient groups cease to exist when their membership drops to zero.

Multicasting on a single physical network which supports the use of multicasting is simple. To join a group, a process running on a host must somehow inform its network device drivers that it is wishes to be a member of the specified group. The device driver software itself must map the multicast address to a physical multicast address and enable the reception of packets for that address. The device driver must also ensure that the receiving process does not receive any spurious datagrams by checking the destination address in the IP header before passing it to the IP layer.

For example, Ethernet supports multi-casting if the high-order byte of the 48-byte address is X'01' and IANA owns an Ethernet address block, which consists of the addresses between X'00005E000000' and X'00005EFFFFFF'. The lower half of this range has been assigned by IANA for multicast addresses, so on an Ethernet LAN there is a range of physical addresses between X'01005E000000' and X'01005E7FFFFF' which is used for IP multicasting. This range has 23 usable bits, so the 28-bit multicast addresses are mapped to the Ethernet addresses by considering the low-order 23 bits, that is 32 multicast addresses are mapped to each Ethernet address. Because of this non-unique mapping, filtering by the device driver is required. There are two other reasons why filtering might still be needed:

Some LAN adapters are limited to a finite number of concurrent multicast addresses and if this is exceeded they must receive all multicasts.
Other LAN adapters tend to filter according to a hash table value rather than the whole address, which means that there is a chance that two multicast addresses with the same hash value might be in use at the same time and the filter might ``leak''.

Despite this requirement for software filtering of multicast packets, multicasting still causes much less overhead for hosts that are not interested. In particular, those hosts that are not in any host group are not listening to any multicast addresses and all multicast messages are filtered by the network interface hardware.

Multicasting is not limited to a single physical network. There are two aspects to multicasting across physical networks:

A mechanism for deciding how widespread the multicast is (remember that unlike unicast addresses and broadcast addresses) multicast addresses cover the entire Internet.
A mechanism for deciding whether a multicast datagram needs to be forwarded to a particular network.

The first problem is easily solved: the multicast datagram has a Time To Live (TTL) value like every other, which is decremented with each hop to a new network. When the Time to Live field is decremented to zero, the datagram can go no further. The mechanism for deciding whether a router should forward a multicast datagram is called Internet Group Management Protocol (IGMP) or Internet Group Multicast Protocol. This is described further in Internet Group Management Protocol (IGMP). IGMP and multicasting are defined in RFC 1112 - Host extensions for IP multicasting.

2.2.5 The IP Address Exhaustion Problem

The number of networks on the Internet has been approximately doubling annually for a number of years. However, the usage of the Class A, B and C networks differs greatly: nearly all of the new networks assigned in the late 1980s were Class B, and in 1990 it became apparent that if this trend continued, the last Class B network number would be assigned during 1994. On the other hand, Class C networks were hardly being used.

The reason for this trend was that most potential users found a Class B network to be large enough for their anticipated needs, since it accommodates up to 65534 hosts, whereas a class C network, with a maximum of 254 hosts, severely restricts the potential growth of even a small initial network. Furthermore, most of the class B networks being assigned were small ones. There are relatively few networks that would need as many as 65,534 host addresses, but very few for which 254 hosts would be an adequate limit. In summary, although the Class A, Class B and Class C divisions of the IP address are logical and easy to use (because they occur on byte boundaries), with hindsight they are not the most practical because Class C networks are too small to be useful for most organizations while Class B networks are too large to be densely populated by any but the largest organizations.

Table - IP network number usage between 1990 and 1994 shows the usage of network numbers between 1991 and 1994.

Table: IP network number usage between 1990 and 1994 - Source: netinfo/ip_network_allocations.95Jan from anonymous FTP site rs.internic.net

Some points about this table require explanation.

Assigned: The number of network numbers in use. The Class C figures are somewhat inaccurate, because the figures do not include many class C networks in Europe which were allocated to RIPE and subsequently assigned but which are still recorded as allocated.
Allocated: This includes all of the assigned networks and additionally, those networks which have either been reserved by IANA (for example the 63 class A networks are all reserved by IANA) or which have been allocated to regional registries by IANA and which will subsequently be assigned by those registries. For example, IANA allocated 64,783 Class C networks in August 1992, and 65,959 in July 1993.

Note: IANA actually reports a network as either assigned or allocated, but this table treats allocated as a superset of assigned so that the reader can subtract the Allocated percentage from 100 to easily determine how much ``free'' space is left.

Another way to look at these numbers is to examine the proportion of the address space which has been used: the figures in the table do not show for example that the Class A address space is as big as the rest combined, or that a single Class A network can theoretically have as many hosts as 66,000 Class C networks. Figure - IP Address Space Usage shows the usage of the address space from this point of view. The full pie represents a 32-bit flat address space, that is 2(superscript 32) or 4,294,967,296 addresses. The Class A, B and C address spaces are divided as follows:

Assigned: The portion of the address space which is found in networks which have been assigned. The true assigned figure is actually much lower than this, because each network itself is likely to have a considerable amount of free space, but since this space cannot be used outside the organization which runs the network, it is effectively used.
The assigned space is shown with ``exploded'' slices: the combined area of the exploded slices represents the proportion of the IP address space in use.
Allocated: The portion of the address space which is found in networks which have been allocated but not assigned plus the portion of the address space lost to reserved network numbers like the Class A networks 0 (this network) and 127 (loopback).
The allocated space is shown with shaded but not exploded slices.
Unallocated: The remaining Class A, B and C space in each is ``free''; it is shown with an unshaded slice. The Class A, B and C slices are shown with progressively thinner borders. Class A starts at ``3 o'clock'' and moves counter-clockwise through Classes B, C and so on.
Class D: One sixteenth of the total space is absorbed by Class D multicast addresses. These are treated as being used and so the Class D slice is exploded.
Class E: The remaining sixteenth of the address space: that part corresponding to IP addresses with the four high-order bits set is reserved by IANA.

Figure: IP Address Space Usage

Examination of Table - IP network number usage between 1990 and 1994 shows that since 1990, the number of assigned Class B networks has been increasing at a much lower rate than the total number of assigned networks and that the anticipated exhaustion of the Class B network numbers has not occurred. The reason for this is that the policies of the InterNIC on network number allocation were changed in late 1990 to preserve the existing address space, in particular to avert the exhaustion of the Class B address space. The new policies can be summarized as follows.

The upper half of the Class A address space is reserved indefinitely to allow for the possibility of using it for transition to a new numbering scheme.
Class B networks are only assigned to organizations which can clearly demonstrate a need for them. The same is, of course, true for Class A networks. The requirements for Class B networks are that the requesting organization
- Has a subnetting plan which documents more than 32 subnets within its organizational network
  and
- Has more than 4096 hosts
Any requirements for a Class A network would be handled on an individual case basis.
Organizations which do not fulfill the requirements for a Class B network are assigned a consecutively numbered block of Class C network numbers.
The lower half of the Class C address space (network numbers 192.0.0 through 223.255.245) is divided into 8 blocks which are allocated to regional authorities as follows:

192.0.0 - 193.255.255
Multi-regional
194.0.0 - 195.255.255
Europe
196.0.0 - 197.255.255
Others
198.0.0 - 199.255.255
North America
200.0.0 - 201.255.255
Central and South America
202.0.0 - 203.255.255
Pacific Rim
204.0.0 - 205.255.255
Others
206.0.0 - 207.255.255
Others
The ranges defined as ``Others'' are to be where flexibility outside the constraints of regional boundaries is required. The range defined as ``multi-regional'' includes the Class C networks which had been assigned before this new scheme was adopted. The 192 networks were assigned by the InterNIC and the 193 networks had been previously allocated to RIPE in Europe.
The upper half of the Class C address space (208.0.0 to 223.255.255) remains unassigned and unallocated.
Where an organization has a range of class C network numbers, the range provided is assigned is a bit-wise contiguous range of network numbers, and the number of networks in the range is a power of 2. That is, all IP addresses in the range have a common prefix, and every address with that prefix is within the range. For example, a European organization requiring 1500 IP addresses would be assigned 8 Class C network numbers (2048 IP addresses) from the number space reserved for European networks (194.0.0 through 195.255.255) and the first of these network numbers would be divisible by eight. A range of addresses satisfying these rules would be 194.32.136 through 194.32.143, in which case the range would consist of all of the IP addresses with the 21-bit prefix 194.32.136, or B'110000100010000010001'.
The maximum number of network numbers assigned contiguously is 64, corresponding to a prefix of 18 bits. An organization requiring more than 4096 addresses but less than 16,384 addresses can request either a Class B or a range of Class C addresses. In general, the number of Class C networks assigned is the minimum required to provide the necessary number of IP addresses for the organization on the basis of a two-year outlook. However, in some cases, an organization may request multiple networks to be treated separately. For example, an organization with 600 hosts would normally be assigned four class C networks. However, if those hosts were distributed across 10 token-ring LANs with between 50 and 70 hosts per LAN, such an allocation would cause serious problems, since the organization would have to find 10 subnets within a 10-bit local address range. This would mean at least some of the LANs having a subnet mask of 255.255.255.192 which allows only 62 hosts per LAN. The intent of the rules is not to force the organization into complex subnetting of small networks, and the organization should request 10 different Class C numbers, one for each LAN.

The current rules are to be found in RFC 1466 - Guidelines for Management of IP Address Space. The reasons for the rules for the allocation of Class C network numbers will become apparent in the next section. The use of Class C network numbers in this way has averted the exhaustion of the Class B address space, but it is not a permanent term solution to the overall address space constraints that are fundamental to IP. The long-term solution is discussed in IP: The Next Generation (IPng).

2.2.6 Private Internets

Another approach to conservation of the IP address space is described in RFC 1597 - Address Allocation for Private Internets. Briefly, it relaxes the rule that IP addresses are globally unique by reserving part of the address space for networks which are used exclusively within a single organization and which do not require IP connectivity to the Internet. There are three ranges of addresses which have been reserved by IANA for this purpose:

10 A single Class A network

172.16 through 172.31 16 contiguous Class B networks

192.168.0 through 192.168.255 256 contiguous Class C networks

Any organization may use any addresses in these ranges without reference to any other organization. However, because these addresses are not globally unique, they cannot be referenced by hosts in another organization and they are not defined to any external routers. Routers in networks not using private addresses, particularly those operated by Internet service providers, are expected to quietly discard all routing information regarding these addresses. Routers in an organization using private addresses are expected to limit all references to private addresses to internal links; they should neither advertise routes to private addresses to external routers nor forward IP datagrams containing private addresses to via external routers. Hosts having only a private IP address do not have IP-layer connectivity to the Internet. This may be desirable and may even be a reason for using private addressing. All connectivity to external Internet hosts must be provided with application gateways.

2.2.7 Classless Inter-Domain Routing (CIDR)

There is a major problem with the use of a range of Class C addresses instead of a single Class B addresses: each network must be routed separately. Standard IP routing understands only the class A, B and C network classes. Within each of these types of network, subnetting can be used to provided better granularity of the address space within each network, but there is no way to specify that multiple class C networks are actually related. The result of this is termed the routing table explosion problem: a Class B network of 3000 hosts requires one routing table entry at each backbone router, but if the same network is addressed as a range of Class C networks, it requires 16 entries.

The solution to this problem is a scheme called Classless Inter-Domain Routing (CIDR). CIDR is a proposed standard protocol with a status of elective.

CIDR does not route according to the class of the network number (hence the term classless) but solely according to the high order bits of the IP address which are termed the IP prefix. Each CIDR routing entry contains a 32 bit IP address and a 32 bit network mask, which together give the length and value of the IP prefix. This can be represented as <IP_address network_mask>. For example <194.0.0.0 254.0.0.0> represents the 7 bit IP prefix B'1100001'. CIDR handles the routing for a group of networks with a common prefix with a single routing entry. This is the reason why multiple Class C network numbers assigned to a single organization have a common prefix. This process of combining multiple networks into a single entry is termed address aggregation or address summarization. It is also called supernetting because routing is based upon network masks which are shorter than the natural network mask of the IP address, in contrast to subnetting where the network masks are longer than the natural mask.

Unlike subnet masks, which are normally contiguous but may have a discontiguous local part, supernet masks are always contiguous.

If IP addresses are represented with a tree showing the routing topology, with each leaf of the tree representing a group of networks which are considered as a single unit (called a routing domain) and the IP addressing scheme is chosen so that each fork in this tree corresponds to an increase in the length of the IP prefix, then CIDR allows address aggregation to be performed very efficiently. For example, if a router in North America routes all European traffic via a single link, then a single routing entry for <194.0.0.0 254.0.0.0> includes the group of Class C network addresses assigned to Europe as described above. This single entry takes the place of all the entries for all of the assigned network numbers in this range which is a possible maximum of 2(superscript 17) or 131,072 numbers. At the European end of this link, there are routing entries with longer prefixes which map to the European network topology but this routing information is not needed at the American end of the link. CIDR uses a longest match is best approach, so if the router in the US needs to make an exception for one range of addresses, such as the 64 network range <195.1.64.0 255.555.192.0> it needs just one additional entry, since this entry overrides the more general (shorter) one for those networks it contains. It is apparent from this example that as the usage of the IP address space increases, particularly the Class C address space, the benefits of CIDR increase as well, provided that the assignment of addresses follows the network topology. The existing state of the IP address space does not follow such a scheme since it pre-dates the development of CIDR. However, new Class C addresses are being assigned in such a way as to enable CIDR, and this should have the effect of alleviating the routing table explosion problem in the near term. In the longer term, a restructuring of the IP address space along topological lines may be necessary. This would involve the re-numbering of a large number of networks, implying an enormous amount of implementation effort, and so would be a gradual process.

It is an over-simplification to assume that routing topology can be represented as a simple tree; although most routing domains have a single attachment which provides access to the rest of the Internet, there are also many domains which have multiple attachments. Routing domains of these two types are called single-homed and multi-homed. Furthermore, the topology is not static. Not only are new organizations joining the Internet at an ever-increasing rate, but existing organizations may change places within the topology, for example if they change between service providers for commercial or other reasons. Although such cases complicate the practical implementation of CIDR-based routing and reduce the efficiency of address aggregation that can be achieved, they do not invalidate the approach.

The current Internet address allocation policies and the assumptions on which those policies were based is described in RFC 1518 - An Architecture for IP Address Allocation with CIDR. They can be summarized as follows:

IP address assignment reflects the physical topology of the network and not the organizational topology; wherever organizational and administrative boundaries do not match the network topology, they should not be used for the assignment of IP addresses.
In general, network topology will closely follow continental and national boundaries and therefore IP addresses should be assigned on this basis.
There will be a relatively small set of networks which carry a large amount of traffic between routing domains and which will be interconnected in a non-hierarchical way and which will cross national boundaries. These are referred to as transit routing domains (TRDs). Each TRD will have a unique IP prefix. TRDs will not be organized in a hierarchical way where there is no appropriate hierarchy. However, wherever a TRD is wholly within a continental boundary, its IP prefix should be an extension of the continental IP prefix.
There will be many organizations which have attachments to other organizations which are for the private use of those two organizations and which do not carry traffic intended for other domains (transit traffic). Such private connections do not have a significant effect on the routing topology and can be ignored.
The great majority of routing domains will be single-homed. That is, they will be attached to a single TRD. They should be assigned addresses which begin with that TRD's IP prefix. All of the addresses for all single-homed domains attached to a TRD can therefore be aggregated into a single routing table entry for all domains outside that TRD.
Note: This implies that if an organization changes its Internet service provider, it should change all of its IP addresses. This is not the current practice, but the widespread implementation of CIDR is likely to make it much more common.
There are a number of address assignment schemes which can be used for multi-homed domains. These include:
- The use of a single IP prefix for the domain. External routers must have an entry for the organization which lies partly or wholly outside the normal hierarchy. Where a domain is multi-homed, but all of the attached TRDs themselves are topologically nearby, it would be appropriate for the domain's IP prefix to include those bits common to all of the attached TRDs. For example, if all of the TRD's were wholly within the United States, an IP prefix implying an exclusively North American domain would be appropriate.
- The use of one IP prefix for each attached TRD, with hosts in the domain having IP addresses containing the IP prefix of the most appropriate TRD. The organization appears to be a set of routing domains.
- Assigning an IP prefix from one of the attached TRDs. This TRD becomes a default TRD for the domain but other domains may explicitly route by one of the alternative TRDs.
- The use of IP prefixes to refer to sets of multi-homed domains having the TRD attachments. For example there may be an IP prefix to refer to single-homed domains attached to network A, one to refer to single-homed domains attached to network B and one to refer to dual-homed domains attached to networks A and B.
Each of these has various advantages, disadvantages and side effects. For example, the first approach tends to result in inbound traffic entering the target domain closer to the sending host than the second approach, and therefore in a larger proportion of the network costs being incurred by the receiving organization.
Because multi-homed domains may vary greatly in character and none of the above schemes is suitable for all such domains, there is no single policy which is best and RFC 1518 does not specify any rules for choosing between them.

2.2.7.1 CIDR Implementation

The implementation of CIDR in the Internet is primarily based on Border Gateway Protocol Version 4 (see Border Gateway Protocol Version 4 (BGP-4)). In future CIDR will also be implemented by a variant of the ISO standard Inter-Domain Routing Protocol (IDRP, ISO 10747), called IDRP for IP, which is closely related to BGP-4.

The implementation strategy, described in RFC 1520 - Exchanging Routing Information Across Provider Boundaries in the CIDR Environment involves a staged process through the routing hierarchy beginning with backbone routers. Network service providers are divided into four types:

Type 1: Those which cannot employ any default inter-domain routing.
Type 2: Those which use default inter-domain routing but which require explicit routes for a substantial proportion of the assigned IP network numbers.
Type 3: Those which use default inter-domain routing and supplement it with a small number of explicit routes.
Type 4: Those which perform all inter-domain routing using only default routes.

The CIDR implementation involves an implementation beginning with the Type 1 network providers, then the Type 2 and finally the Type 3 ones. CIDR has already been widely deployed in the backbone and over 9,000 class-based routes have been replaced by approximately 2,000 CIDR-based routes.

2.2.7.2 References

RFC 1467 - Status of CIDR Deployment in the Internet
RFC 1517 - Applicability Statement for the Implementation of Classless Inter-Domain Routing (CIDR)
RFC 1518 - An Architecture for IP Address Allocation with CIDR
RFC 1519 - Classless Inter-Domain Routing (CIDR): an Address Assignment and Aggregation Strategy
RFC 1520 - Exchanging Routing Information Across Provider Boundaries in the CIDR Environment

2.2.8 Domain Name System

The Domain Name System protocol is a standard protocol (STD 13). Its status is recommended. It is described in:

RFC 1034 - Domain names - concepts and facilities
RFC 1035 - Domain names - implementation and specification

The early internet configurations required users to use only numeric IP addresses. Very quickly, this evolved to the use of symbolic host names. For example, instead of typing TELNET 128.12.7.14 one could type TELNET eduvm9, and eduvm9 is then translated in some way to the IP address 128.12.7.14. This introduces the problem of maintaining the mappings between IP addresses and high-level machine names in a coordinated and centralized way.

Initially, host names to address mappings were maintained by the Network Information Center (NIC) in a single file (HOSTS.TXT) which was fetched by all hosts using FTP. This is called a flat namespace.

Due to the explosive growth in the number of hosts, this mechanism became too cumbersome (consider the work involved in the addition of just one host to the Internet) and was replaced by a new concept: Domain Name System. Hosts may continue to use a local flat namespace (the HOSTS.LOCAL file) instead of or in addition to the Domain Name System, but outside small networks, the Domain Name System is practically essential. The Domain Name System allows a program running on a host to perform the mapping of a high-level symbolic name to an IP address for any other host without the need for every host to have a complete database of host names.

For the remainder of this section we will examine how the Domain Name System works from the user's point of view. See Domain Name System (DNS) for more details about the implementation of domain name resolvers and servers and the types of records stored in the Domain Name System.

2.2.8.1 The Hierarchical Namespace

Consider the internal structure of a large organization. As the chief executive cannot do everything, the organization will probably be partitioned into divisions, each of them having autonomy within certain limits. Specifically, the executive in charge of a division has authority to make direct decisions, without permission from his chief executive.

Domain names are formed in a similar way, and will often reflect the hierarchical delegation of authority used to assign them. For example, consider the name

                   lcs.mit.edu

Here, lcs.mit.edu is the lowest-level domain name, a subdomain of mit.edu, which again is a subdomain of edu (education) which is called a top-level domain. We can also represent this naming concept by a hierarchical tree (see Figure - Hierarchical Namespace).

Figure: Hierarchical Namespace - This figure shows the chain of authority in assigning domain names. This tree is only a tiny fraction of real namespace.

Figure - The Generic Top-level Domains shows some of the top-level domains. The single domain above the ``top-level'' domains has no name and is referred to as the root domain. The complete structure is explained in the following sections.

2.2.8.2 Fully Qualified Domain Names (FQDNs)

When using the Domain Name System, it is common to work with only a part of the domain hierarchy, for example the ral.ibm.com domain. The Domain Name System provides a simple method of minimizing the typing necessary in this circumstance. If a domain name ends in a dot (for example, wtscpok.itsc.pok.ibm.com.) it is assumed to be complete. This is termed a Fully Qualified Domain Name (FQDN) or an absolute domain name. If, however it does not end in a dot (for example, wtscpok.itsc) it is incomplete and the DNS resolver (see below) may complete this, for example by appending a suffix such as .pok.ibm.com to the domain name. The rules for doing this are implementation dependent and locally configurable.

2.2.8.3 Generic Domains

The three-character top-level names are called the generic domains or the organizational domains.

Domain Name: Meaning
edu: Educational institutions
gov: Government institutions
com: Commercial organizations
mil: Military groups
net: Networks
int: International organizations
org: Other organizations

Figure: The Generic Top-level Domains

Since the Internet began in the United States, the organization of the hierarchical namespace initially had only US organizations at the top of the hierarchy, and it is still largely true that the generic part of the namespace contains US organizations. However, only the .gov and .mil domains are restricted to the US.

2.2.8.4 Country Domains

There are also top-level domains named for the each of the ISO 3166 international 2-character country codes (from ae for the United Arab Emirates to zw for Zimbabwe). These are called the country domains or the geographical domains. Many countries have their own second-level domains underneath which parallel the generic top-level domains. For example, in the United Kingdom, the domains equivalent to the generic domains .com and .edu are .co.uk and .ac.uk (``ac'' is an abbreviation for academic). There is a .us top-level domain, which is organized geographically by state (for example, .ny.us refers to the state of New York). See RFC 1480 for a detailed description of the .us domain.

2.2.8.5 Mapping Domain Names to IP Addresses

The mapping of names to addresses, a process called domain name resolution, is provided by independent, cooperating systems called name servers. A name server is a server program answering requests from a client called a name resolver.

Each name resolver is configured with a name server to use (and possibly a list of alternatives to contact if the primary is unavailable). Figure - Domain Name Resolution shows schematically how a program uses a name resolver to convert a host name to an IP address. A user provides a host name, and the user program uses a library routine, called a stub resolver, to communicate with a name server which resolves the host name to an IP address and returns it to the stub, which returns it to the main program. The name server may obtain the answer from its name cache, its own database or another name server.

Figure: Domain Name Resolution

2.2.8.6 Mapping IP Addresses to Domain Names - Pointer Queries

The Domain Name System provides for a mapping of symbolic names to IP addresses and vice versa. While it is a simple matter in principle to search the database for an IP address given its symbolic name because of the hierarchical structure, the reverse process cannot follow the hierarchy. Therefore, there is another namespace for the reverse mapping. It is found in the domain in-addr.arpa (``arpa'' because the Internet was originally the ARPA Internet). Because IP addresses are normally written in dotted decimal format, there is one layer of domains for each hierarchy. However, because domain names have the least-significant parts of the name first but dotted decimal format has the most significant bytes first, the dotted decimal address is shown in reverse order. For example, the domain in the domain name system corresponding to the IP address 129.34.139.30 is 30.139.34.129.in-addr.arpa. Given an IP address, the Domain Name System can be used to find the matching host name. A domain name query to find the host names associated with an IP address is called a ``pointer query''.

2.2.8.7 Other Uses for the Domain Name System

The Domain Name System is designed to be capable of storing a wide range of information. One of the most important of these is mail exchange information, which is used for electronic mail routing. This provides two facilities: transparent re-routing of mail to a different host than that specified and the implementation of mail gateways where electronic mail can be received by a mail gateway and re-directed using a different mail protocol from the intended recipient. This is explained in more detail in SMTP and the Domain Name System.

2.2.8.8 References

For more details on the implementation of the Domain Name System and the format of the messages between Name Servers, see Domain Name System (DNS). The following RFCs define the Domain Name System standard and the information kept in the system.

RFC 1032 - Domain Administrator's Guide
RFC 1033 - Domain Administrator Operations Guide
RFC 1034 - Domain Names - Concepts and Facilities
RFC 1035 - Domain Names - Implementation and Specification
RFC 1101 - DNS Encoding of networks names and other types
RFC 1183 - New DNS RR Definitions
RFC 1706 - DNS NSAP Resource Records

Table of Contents Internet Protocol (IP)