org.wltea.analyzer.core
类 CharacterUtil
java.lang.Object
org.wltea.analyzer.core.CharacterUtil
public class CharacterUtil
- extends java.lang.Object
字符集识别工具类
从类 java.lang.Object 继承的方法 |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CHAR_USELESS
public static final int CHAR_USELESS
- 另请参见:
- 常量字段值
CHAR_ARABIC
public static final int CHAR_ARABIC
- 另请参见:
- 常量字段值
CHAR_ENGLISH
public static final int CHAR_ENGLISH
- 另请参见:
- 常量字段值
CHAR_CHINESE
public static final int CHAR_CHINESE
- 另请参见:
- 常量字段值
CHAR_OTHER_CJK
public static final int CHAR_OTHER_CJK
- 另请参见:
- 常量字段值
CharacterUtil
public CharacterUtil()
identifyCharType
public static int identifyCharType(char input)
- 识别字符类型
- 参数:
input
-
- 返回:
- int CharacterUtil定义的字符类型常量
regularize
public static char regularize(char input)
- 进行字符规格化(全角转半角,大写转小写处理)
- 参数:
input
-
- 返回:
- char