Wikitext-2-Wikitext-103-子數據集

本數據集是Wikitext-103 的子集,主要用於測試小型數據集的語言模型訓練效果。 Recent neural network sequence models with softmax classifiers have achieved their best language modeling performance only with very large hidden states and
相關文章
相關標籤/搜索