2

我正在使用java wordnet 接口 JWI来尝试生成单词的上位词,将它们从特定实体概括为高阶概念/类型。

在我的部分代码中,我想确保一个单词在 wordnet 中注册,因为有时我输入 like isa,这对应于is a但 wordnet 无法识别它,并且如果它看到这个,我的程序会崩溃。这就是我现在正在尝试的方式。

public void getHypernyms( String inut_word ) throws IOException
{   

    // get the synset of 'input_word'
    //IIndexWord idxWord = dict . getIndexWord (inut_word, POS . NOUN ) ;
    IIndexWord idxWord = dict.getIndexWord( inut_word, POS.VERB );

    if( idxWord != null && idxWord.size() > 0)

    IWordID wordID = idxWord.getWordIDs().get (0); // 1st meaning
    IWord word = dict.getWord( wordID );
    ISynset synset = word.getSynset();

    // get the hypernyms
    List < ISynsetID > hypernyms = synset.getRelatedSynsets( Pointer.HYPERNYM );

    if( hypernyms.size() > 0)
    {
        // print out each hypernyms id and synonyms
        List < IWord > words;
        for( ISynsetID sid : hypernyms ) 
        {
            words = dict.getSynset( sid ).getWords ();
            System.out.print( sid + " {");
            for( Iterator <IWord> i = words.iterator(); i.hasNext(); ) 
            {
                System.out.print( i.next().getLemma() );
                if( i.hasNext() )
                    System.out.print(", ");
            }
            System.out.println("}");
        }
    }
    else
    {
        System.out.println( inut_word );
    }


}

但日食警告我method size() is not defined for type IIndexWord

我认为这意味着我需要调整@override尺寸,不是吗?但是我以前从来没有真正这样做过,怎么做呢?

java.util.List.size 具体来说。

我试图像这样实现这种方法,类似的,就像一个魅力。

public String getStem(String word)
{
    WordnetStemmer stem =  new WordnetStemmer( dict );

    List<String> stemmed_words = stem.findStems(word,  POS.VERB);

    if( stemmed_words != null && stemmed_words.size() > 0)
        return stemmed_words.get(0);
    else
        return word;
}
4

2 回答 2

1

在@Titus 的大力帮助下,我明白了

public void getHypernyms( String input_word ) throws IOException
{   
    System.out.println( "input_word : " + input_word);
    // get the synset of 'input_word'
    //IIndexWord idxWord = dict . getIndexWord (inut_word, POS . NOUN ) ;
    IIndexWord idxWord = dict.getIndexWord( input_word, POS.VERB );

    if(dict.getIndexWord( input_word, POS.VERB ) == null)
    {
        System.out.println( "got ya'!");
    }
    else
    {

        System.out.println( "idxWord: " + idxWord );
        IWordID wordID = idxWord.getWordIDs().get(0); // 1st meaning
        IWord word = dict.getWord( wordID );
        ISynset synset = word.getSynset();

        // get the hypernyms
        List < ISynsetID > hypernyms = synset.getRelatedSynsets( Pointer.HYPERNYM );

        if( hypernyms.size() > 0)
        {
            // print out each hypernyms id and synonyms
            List < IWord > words;
            for( ISynsetID sid : hypernyms ) 
            {
                words = dict.getSynset( sid ).getWords ();
                System.out.print( sid + " {");
                for( Iterator <IWord> i = words.iterator(); i.hasNext(); ) 
                {
                    System.out.print( i.next().getLemma() );
                    if( i.hasNext() )
                        System.out.print(", ");
                }
                System.out.println("}");
            }
        }


    }

}
于 2015-01-30T06:57:10.053 回答
0

我认为这是导致问题的线

 if( idxWord != null && idxWord.size() > 0)

我看了看文档,似乎IIndexWord没有size()方法。为避免空指针异常和索引越界异常,您可以进行如下检查:

if(idxWord != null && idxWord.getWordIDs() != null && idxWord.getWordIDs().size() > 0)
于 2015-01-30T06:23:59.580 回答