中文多分類 BERT

時間 2019-12-10

標籤中文分類 bert 简体版

原文原文鏈接

直接把本身的工做文檔導入的，因爲是在外企工做，因此都是英文寫的python

Steps:

git clone https://github.com/google-research/bert
prepare data, download pre-trained models
modify code in run_classifier.py
1. add a new processor
2. add the processor in main function

Train and predict

train

python run_classifier.py \git

--task_name=multiclass \github

--do_train=true \json

--do_eval=true \google

--data_dir=/home/wxl/bertProject/bertTextClassification/data\spa

--vocab_file=/home/wxl/bertProject/chinese_L-12_H-768_A-12/vocab.txt \code

--bert_config_file=/home/wxl/bertProject/chinese_L-12_H-768_A-12/bert_config.json \blog

--init_checkpoint=/home/wxl/bertProject/chinese_L-12_H-768_A-12/bert_model.ckpt \文檔

--max_seq_length=128 \get

--train_batch_size=16 \

--learning_rate=2e-5 \

--num_train_epochs=100.0 \

--output_dir=/home/wxl/bertProject/bertTextClassification/outputThree/

you would get the following result if success:
predict

python run_classifier.py \

--task_name=multiclass \

--do_predict=true \

--data_dir=/home/wxl/bertProject/bertTextClassification/data\

--vocab_file=/home/wxl/bertProject/chinese_L-12_H-768_A-12/vocab.txt \

--bert_config_file=/home/wxl/bertProject/chinese_L-12_H-768_A-12/bert_config.json \

--init_checkpoint=/home/wxl/bertProject/bertTextClassification/outputThreeV1 \

--max_seq_length=128 \

--output_dir=/home/wxl/bertProject/bertTextClassification/mulitiPredictThreeV1/

相關文章

相關標籤/搜索

PHP 7 新特性

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。

最新文章

本站公眾號

歡迎關注本站公眾號,獲取更多信息

相關文章

>>更多相關文章<<