By W.Zh May 2015
Exception in thread “main” java.lang.UnsupportedClassVersionError: org/apache/solr/util/SolrCLI : Unsupported major.minor version 51.0
After unzip the solr, you will try to start using:
JAVA version is less than 7
1. Use the Topic Modelling. reflect the text content of the articles to the dimensions of the topic, then you can try to calculate the similarity.
2. Use the Cosine Similarity, Steps like this:
（1）use the TF-IDF to find out the key words of tow articles
（2）combine the the two key words set into one set, and get the frequency of the each keys.
（3）create the frequency vectors for two articles.
（4）caculate the Cosine Similarity of each vector, then the bigger , the similar they two.
TF-IDF (term frequency–inverse document frequency)