Tomcat7項目遷移到Tomcat8中文亂碼問題

時間 2019-12-02

標籤 tomcat7 tomcat 項目遷移 tomcat8 中文亂碼問題欄目 Tomcat 简体版

原文原文鏈接

我打算開始使用Tomcat8了,先解決中文亂碼問題,在解決其它的問題!
我的推薦:修改server.xml方式
對於SpringMVC報的錯誤我稍後在補充問題html

1.問題描述

Tomcat 7下項目切換到Tomcat 8後，出現亂碼。不管Google仍是百度，多數解決方法是server.xml設置URIEncoding=」UTF-8」,這種配置爲了解決GET請求的中文亂碼問題。對於Tomcat 7下遇到亂碼問題，這樣配置是正確的；可是對」Tomcat 7正常，切換到Tomcat 8」亂碼的狀況無效。java

2. 解決方案[我的不喜歡]

Tomcat8的server.xml配置Connector節點添加屬性URIEncoding=」ISO-8859-1」。
參考： http://tomcat.apache.org/migration-8.htmlgit

3. 官方文檔解釋

https://tomcat.apache.org/tomcat-7.0-doc/config/http.htmlapache

URIEncoding
This specifies the character encoding used to decode the URI bytes, after %xx decoding the URL. If not specified, ISO-8859-1 will be used.tomcat

https://tomcat.apache.org/tomcat-8.0-doc/config/http.htmlless

URIEncoding
This specifies the character encoding used to decode the URI bytes, after %xx decoding the URL. If not specified, UTF-8 will be used unless the org.apache.catalina.STRICT_SERVLET_COMPLIANCE system property is set to true in which case ISO-8859-1 will be used.編碼

Tomcat7對URI默認編碼是ISO-8859-1
Tomcat8對URI默認編碼是UTF-8spa

4. 關於編碼問題

4.1 Tomcat7這個URI默認的編碼帶來不少問題，下面這個應該很常見：

 
 
 
 
  
  
  
  
 
 
 
 
new String(value.getBytes("ISO-8859-1"), param);

若是server.xml配置上URIEncoding=」UTF-8」就不須要了。進而項目直接遷移到Tomcat8，不修改server.xml，或者再次加上URIEncoding=」UTF-8」也是不會有問題。code

4.2 Tomcat8是否是就是由於開發者服務端轉碼麻煩，URI默認的編碼改成」UTF-8」。

對於在Tomcat8開發項目，就簡單不少，不須要上面的那段繁瑣的代碼。可是從Tomcat7遷移上來的項目，要麼，去掉上面的代碼，要麼server.xml添加URIEncoding=」ISO-8859-1」，浪費Tomcat8一番美意。server

4.3 固然，經過判斷參數值是否亂碼，進行編碼也是很不錯的

 
 
 
 
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
 
 
 
 
import java.util.regex.Matcher;import java.util.regex.Pattern;public class ChineseUtill { private static boolean isChinese(char c) { Character.UnicodeBlock ub = Character.UnicodeBlock.of(c); if (ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS || ub == Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS || ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A || ub == Character.UnicodeBlock.GENERAL_PUNCTUATION || ub == Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION || ub == Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS) { return true; } return false; } public static boolean isMessyCode(String strName) { Pattern p = Pattern.compile("\\s*|\t*|\r*|\n*"); Matcher m = p.matcher(strName); String after = m.replaceAll(""); String temp = after.replaceAll("\\p{P}", ""); char[] ch = temp.trim().toCharArray(); float chLength = 0 ; float count = 0; for (int i = 0; i < ch.length; i++) { char c = ch[i]; if (!Character.isLetterOrDigit(c)) { if (!isChinese(c)) { count = count + 1; } chLength++;  } } float result = count / chLength ; if (result > 0.4) { return true; } else { return false; } } public static String toChinese(String msg){ if(isMessyCode(msg)){ try { return new String(msg.getBytes("ISO8859-1"), "UTF-8"); } catch (Exception e) { } } return msg ;  }}