The traditional TextRank algorithm is based on a graph model, and the extracted abstract generated by it does not deviate from the source text, but there may be issues such as polysemy. The BERT model can capture contextual semantics well. It can directly convert sentences in text into vector form, which can better solve the problem of polysemy. BiLSTM can obtain further information, IDCNN can better handle overfitting problems compared to traditional convolutional neural networks, and CRF can better solve the problem of sentence coherence. The model proposed in this article has been compared with other extraction based text summarization methods on the TTNews dataset, and experiments have shown that the method proposed in this article can obtain meaningful text summaries with significant optimization.
|