我正在尝试按存储的关键字对文章进行分类。我有一个类别的关键字列表,我希望为一篇文章分配一个关键字计数最多的类别。
For Each keyword As String In category.Keywords
category.tempCount += Regex.Matches(article.Item("title").InnerXml, Regex.Escape(keyword)).Count
category.tempCount += Regex.Matches(article.Item("description").InnerXml, Regex.Escape(keyword)).Count
Next
这是为每个类别完成的,为每篇文章运行。我正在尝试对列表进行排序,以判断哪个类别最适合本文。然而,最好的类别可能不止一个,而且没有一个类别适合。所以运行这个对我没有帮助:
Categories.Sort(
Function(article1 As ArticleCategory, article2 As ArticleCategory)
Return article1.tempCount.CompareTo(article2.tempCount)
End Function)
也许我做这一切都是错的,但到目前为止,我认为我走在正确的道路上。(我在 Category 类中也有一个默认比较,它也不起作用。)
我在最可能导致的排序上遇到异常,因为它们不是唯一的。
我得到的异常是 InvalidOperationException:无法比较数组中的两个元素。那是使用我在 ArticleClass 中构建的比较器
Imports System.Xml
Class ArticleCategory
Implements IComparer(Of ArticleCategory)
Public ReadOnly key As Int32
Public ReadOnly Name As String
Public ReadOnly Keywords As List(Of String)
Public tempCount As Integer = 0
Public Sub New(ByVal category As XmlElement)
key = System.Web.HttpUtility.UrlDecode(category.Item("ckey").InnerXml)
Name = System.Web.HttpUtility.UrlDecode(category.Item("name").InnerXml)
Dim tKeywords As Array = System.Web.HttpUtility.UrlDecode(category.Item("keywords").InnerXml).Split(",")
Dim nKeywords As New List(Of String)
For Each keyword As String In tKeywords
If Not keyword.Trim = "" Then
nKeywords.Add(keyword.Trim)
End If
Next
Keywords = nKeywords
End Sub
'This should be removed if your using my solution.
Public Function Compare(ByVal x As ArticleCategory, ByVal y As ArticleCategory) As Integer Implements System.Collections.Generic.IComparer(Of ArticleCategory).Compare
Return String.Compare(x.tempCount, y.tempCount)
End Function
End Class