(c)<br>1. Calculate the similarity according to tdidf, and set the threshold to filter out the websites lower than the threshold, and sort the remaining websites by PageRank as the result.<br>2. Filter the websites with lower PageRank according to the threshold set by PageRank, and sort the remaining websites by tdidf as the result.<br>3. According to the PageRank value and tdidf similarity value, appropriate weighting is carried out, and two factors are comprehensively considered. And give the total weight to establish a probability model, so that the heavy weight has a greater probability to rank first, while the small weight has a chance to be searched, increasing the diversity.<br>
正在翻译中..