The InetProtUtils API provides functionality to handle URIs, such as extracting URI components, constructing and modifying a URI, text utilities, and URI component utilities required to work with URIs.
The clients that make use of this API must be familiar with the following standards:
The component has the following key concepts:
Universal Resource Identifier (URI) is a formatted string that identifies the location and name of a resource, for instance a web page address.
Let us consider the following URI:
http://user:[email protected]:80/top_path/foo.htm?checkme#ref
The URI can be split into components as follows:
In an URI, the user information, host and port information put together is called authority. Format for authority appears as the following:
[[email protected]]host[:port]
where, user-info may consist of a user name and, optionally, scheme-specific information about how to gain authorization to access the server, for example, password. port specifies the port number.
The InetProtUtils API (inetprotutils.dll) is used by the following components:
The following classes are used for creating, parsing and modifying the URI:
Class Name | Description |
---|---|
Creates URI from parts and modifies the URI. |
|
Modifies the authority and creates authority from parts. |
|
Retrieves information about the URI such as extract, compare, validate etc. (the non-modifying functionality). |
|
Retrieves information about the authority URI such as extract, compare, validate etc. |
|
Parses a text URI. |
|
Parses a text authority. |
The following classes are used for parsing delimited data in the URI:
Class Name | Description |
---|---|
Creates a delimited path, where components of the path are delimited by '/' . |
|
Creates a delimited query, where components of the query are delimited by '&'. |
|
Creates a delimited path segment, where components of the path segment are delimited by ';'. |
|
Parses path delimited by a '/'. |
|
Parses query delimited by a '&'. |
|
Parses path segments delimited by a ';'. |
|
Encodes and decodes excluded and reserved URI characters. It can be used with 8 bit and 16 bit descriptors. |
|
Parses HTTP header text. |
|
Provides extra utilities for URI/Authority classes such as creating URI from Unicode descriptor. |
The following classes are used for encoding and decoding WSP headers:
Class Name | Description |
---|---|
The class is used to encode one header field at a time with all its values and parameters. |
|
The class is used to separate a WSP buffer into WSP header name/value pairs. |
|
The class is used to convert WSP header data into binary strings. |
|
The class is used to convert the binary data into integers, strings, dates, etc. |
|
The class holds the name and value pair of WSP header field. |
The following diagram shows the InetProtUtils classes and their relationships.
Applications that use HTTP transport framework use InetProtUtils API. The utilities may also be useful for applications which deal with internet strings or URIs, such as mail or IM applications.
The following tasks can be performed using InetProtUtils:
Creating an URI
URI can be created in different ways. It can be created from parts, from file or by resolving two URIs using CUri8 class.
For more information on creating a URI, refer to Creating an URI tutorial.
Parsing an URI
To check if the components in the URI are syntactically correct as per the four main components and fragment identifier, parse the URI. Use TUriParser8 class to parse an URI.
For more information on parsing, refer to Parsing an URI tutorial.
URI classes namely, CUri8, TUriC8 and TUriParser8 provide functionalities for parsing generic and SIP URIs. CUri8 class provides a reference to a TUriC8 object so that the non-modifying functionality can be used. TUriC8 is used to obtain information about an URI.
Extracting URI components
TUriC8 class provides extracting methods to extract components from the URI. TAuthority8 class extracts the authority component.
For more information on getting URI components, refer to Extracting the URI Components tutorial.
Generating the filename from an URI
A fully qualified file name can be generated from a file URI object. TUriC8::GetFileNameL() provides this functionality. The path component encodes the file's location on the file system.
For more information, refer to Generating File Name from URI tutorial.
Validating a URI
Use TUriParser8::Validate() to check if components in a given URI are valid. This validates only SIP and SIPS scheme components.
For more information, refer to Validating an URI tutorial.
Parsing delimited data
URI components are separated into segments using delimiters. Parse this delimited data. TDelimitedXxxxParser8 classes provide the functionality to parse the delimited data, extract the current segment and parse the string for the next segment.
For more information, refer to Parsing Delimited Data tutorial.
Modifying the data and the delimiter
The delimiters and segments of data within an URI can be added, removed and parsed. This functionality is provided by CDelimitedXxxx8 classes.
For more information, refer to Modifying the Data and Delimiter tutorial.
Escape encoding and decoding unsafe characters in an URI
The reserved and unsafe data in the URI is escape encoded and decoded using EscapeUtils class. It also supports converting Unicode data (16-bit descriptor) into UTF8 data (8-bit descriptor) and vice-versa. For more information, refer to Escape Encoding and Decoding and Converting between Unicode and Utf8 tutorials.
Manipulating the URI data
The data in an URI can be manipulated using various text parsing utilities that are used typically in HTTP headers. InetProtTextUtils provides functionality to:
remove the leading/trailing contiguous white space characters from the descriptors
convert the decimal/hexadecimal value to descriptor and vice versa.
extract tokens from:
It supports both Unicode and UTF-8 formats. For more information on text manipulations, refer to Manipulating URI Data tutorial.
Using URI utilities
Additional functionality to create an URI from unicode descriptor is provided by UriUtils class. It also allows to translate unsafe characters to UTF8 and escape-encode.
For more information, refer to Using URI utilities.
Using datetime utilities
To store dates in universal times and parse the internet dates into TDateTime dates, use TInternetDate.
For more information, refer to Using Date and Time Utilities tutorial.
Escape encoding and decoding the WSP header
The WSP header information is encoded and decoded using CWspHeaderEncoder. It uses TWspPrimitiveEncoder to convert the data into binary strings.
For more information, refer to Encoding and Decoding WSP Header.