js 正則匹配（去掉html標籤）

時間 2019-11-11

原文原文鏈接

1，獲得網頁上的連接地址：html

string matchString = @"<a[^>]+href=\s*(?:'(?<href>^']+)'|""(?<href>[^""]+)""|(?<href>[^>\s]+))\s*[^>]*>";

2，獲得網頁的標題：函數

string matchString = @"<title>(?<title>.*)</title>";

3，去掉網頁中的全部的html標記：code

string temp = Regex.Replace(html, "<[^>]*>", ""); //html是一個要去除html標記的文檔htm

4, string matchString = @"<title>([\S\s\t]*?)</title>";
5, js去掉全部html標記的函數：rem

function delHtmlTag(str)
{
      return str.replace(/<[^>]+>/g,"");//去掉全部的html標記
}

6. 統計字數文檔

t = $('.remarktext').html().replace(/<[^>]+>/g,"").length;

相關標籤/搜索