c# - 在 C# / .NET 中获取 url 的域名

Question

编码：

string sURL = "http://subdomain.website.com/index.htm";
MessageBox.Show(new System.Uri(sURL).Host);

给我“subdomain.website.com”

但我需要任何 url 或 web 链接的主域“website.com”。

我怎么做？

score 16 · Accepted Answer

您可以这样做以仅获取主机名的最后两段：

string[] hostParts = new System.Uri(sURL).Host.Split('.');
string domain = String.Join(".", hostParts.Skip(Math.Max(0, hostParts.Length - 2)).Take(2));

或这个：

var host = new System.Uri(sURL).Host;
var domain = host.Substring(host.LastIndexOf('.', host.LastIndexOf('.') - 1) + 1);

此方法会发现至少包含两个域名部分，但也会包含两个或更少字符的中间部分：

var host = new System.Uri(sURL).Host;
int index = host.LastIndexOf('.'), last = 3;
while (index > 0 && index >= last - 3)
{
    last = index;
    index = host.LastIndexOf('.', last - 1);
}
var domain = host.Substring(index + 1);

这将处理诸如localhost、example.com和之类的域example.co.uk。这不是最好的方法，但至少它使您免于构建庞大的顶级域列表。

score 4 · Accepted Answer

你可以试试这个。如果您在数组中定义它，这可以处理多种根域。

string sURL = "http://subdomain.website.com/index.htm";
var host = new System.Uri(sURL).Host.ToLower();

string[] col = { ".com", ".cn", ".co.uk"/*all needed domain in lower case*/ };
foreach (string name in col)
{
    if (host.EndsWith(name))
    {
        int idx = host.IndexOf(name);
        int sec = host.Substring(0, idx - 1).LastIndexOf('.');
        var rootDomain = host.Substring(sec + 1);
    }
}

score 3 · Accepted Answer

试试正则表达式？

using System.Text.RegularExpressions;

string sURL = "http://subdomain.website.com/index.htm";
string sPattern = @"\w+.com";

// Instantiate the regular expression object.
Regex r = new Regex(sPattern, RegexOptions.IgnoreCase);

// Match the regular expression pattern against a text string.
Match m = r.Match(sUrl);
if (m.Success)
{
    MessageBox.Show(m.Value);
}

c# - 在 C# / .NET 中获取 url 的域名

3 回答 3

试试正则表达式？

Related

Reference