2

正如标题所示,我正在尝试使用 javascript 正则表达式从字符串中检索域。

取以下字符串:

String                                  ==>     Return
"google"                                ==>     null
"google.com"                            ==>     "google.com"
"www.google.com"                        ==>     "www.google.com"
"ftp://ftp.google.com"                  ==>     "ftp.google.com"
"http://www.google.com"                 ==>     "www.google.com"
"http://www.google.com/"                ==>     "www.google.com"
"https://www.google.com/"               ==>     "www.google.com"
"https://www.google.com.sg/"            ==>     "www.google.com.sg"
"https://www.google.com.sg/search/"     ==>     "www.google.com.sg"
"*://www.google.com.sg/search/"         ==>     "www.google.com.sg"

我已经阅读过 “Regex to find domain name without www - Stack Overflow”“Extract root domain name from string - Stack Overflow” ,但它们太复杂了,所以我尝试编写自己的正则表达式:

var re = new RegExp("[\\w]+[\\.\\w]+");
/[\w]+[\.\w]+/
re.exec(document.URL);

它适用于,"google.com"但返回,"www.google.com"等等。"www.google.com.sg"http"http://google.com/""http://www.google.com/"

由于我是正则表达式的新手,我似乎无法弄清楚出了什么问题......有什么想法吗?

提前致谢!

4

2 回答 2

10

使用这个正则表达式:

/(?:[\w-]+\.)+[\w-]+/

这是一个正则表达式演示

采样:

>>> var regex = /(?:[\w-]+\.)+[\w-]+/
>>> regex.exec("google.com")
... ["google.com"]
>>> regex.exec("www.google.com")
... ["www.google.com"]
>>> regex.exec("ftp://ftp.google.com")
... ["ftp.google.com"]
>>> regex.exec("http://www.google.com")
... ["www.google.com"]
>>> regex.exec("http://www.google.com/")
... ["www.google.com"]
>>> regex.exec("https://www.google.com/")
... ["www.google.com"]
>>> regex.exec("https://www.google.com.sg/")
... ["www.google.com.sg"]
于 2014-08-15T08:08:01.647 回答
2

You can use this regex in Javascript:

\b(?:(?:https?|ftp):\/\/)?([^\/\n]+)\/?

RegEx Demo

于 2014-08-15T08:20:34.577 回答