0

How can we parse a word document using asp.net and c#?

The question I have is with respect to parsing a word document, say a resume, to read the contents like name and email address. For this, I have an idea, but not sure of how to implement it. Can someone help me on this? My idea is:

  • Take the first line, if there are two or more words (other than curriculum vitae)separated by a single space or using a period(.) followed by a space, it is a name.

  • Take the second line, if there are two or more words separated by a single space or using a period(.) followed by a space, it is a name.

  • Consider the last line, if there are two or more words separated by a single space or using a period(.) followed by a space, it is a name.

  • Consider the second last line, if there are two or more words separated by a single space or using a period(.) followed by a space, it is a name.

I know how to read telephone number and email using patters, but I don't know how to implement finding out the name. Is there any idea?

4

1 回答 1

0

你可以试试Spire.Doc。我在我的一些项目中使用了它。

于 2013-07-04T04:26:00.037 回答