九州大学 研究者情報
研究者情報 (研究者の方へ)入力に際してお困りですか?
基本情報 研究活動 教育活動 社会活動
鈴木 孝彦(すずき たかひこ) データ更新日:2018.08.29



主な研究テーマ
統計数値の不整合と関連テキストのテキストマイニング
キーワード:Benfordの法則 属性選択 SVM
2017.08~2019.03.
特徴語抽出による感性分析
キーワード:感性分析、特徴語、語のベクトル化
2015.05~2017.05.
日本語WordNetにおける意味構造の研究
キーワード:ワードネット オントロジー テキストマイニング
2012.05~2016.05.
混合道路交通における低速車両の移動ロケーションデータの収集と分析方法の研究
キーワード:ロケーションデータ Web マイニング 低速車両 
2012.04~2017.04.
研究業績
主要原著論文
1. Takahiko Suzuki, Sachio Hirokawa, Koki Miyata, Difficulty of Words and Their Ambiguity Estimated from the Result of Word Sense Disambiguation
, Proc. KICSS 2016, 2016.10, When learning a new word in language learning, there are two problems. One is how difficult the word itself is. The second is, in what kind of situation, it will be used. There is a research that defined quantitative ambiguity of words based on the structure of WordNet, then investigated the relationship between the ambiguity and the difficulty level of words. In this paper, we re-define ambiguity of word occurrences in text by using the result of word sense disambiguation technique. We analyze the relationship between the ambiguity of words and their difficulty level. We compare the result with those in the previous research. Utilizing knowledge and training data affect the relationship between the difficulty and ambiguity of words..
2. Takahiko Suzuki, Sachio Hirokawa, Takuya Hirao, Nao Wariishi, Kyota Hashimoto, Evaluation of Integrity of WordNet by Combining Word Similarity and Random Forest, AROB 2016, 2016.01, The Japanese WordNet contains erroneous synonyms. We have been exploiting the detection of the erroneous synonyms in the Japanese WordNet. In a previous study, it is shown that the combination of word2vec and decision tree is promising in distinguishing synonyms and related words from unrelated words. In this paper, we introduce an error detection method based on the combination of word2vec, WordNet structure and Random Forest. We discuss current results and possible improvement..
3. Takahiko Suzuki, Sachio Hirokawa, Nao Wariishi, Takuya Hirao, Vector Similarity of Related Words and Synonyms in the Japanese WordNet, Information Engineering Express, IIAI, 1, 4, 21, 2015.12, Word2vec is a tool that produces vector representations of words from a large amount of text data. In this study, we show that only a part of the vector space produced by word2vec is sufficient to represent the collective sense of a set of related words in Japanese WordNet. Furthermore, we show that there is a subspace of the vector space that does not relate to the collective sense of related words and synonyms. We construct a compact decision tree by using the vectors to distinguish whether a given word belongs to the set of related words..
4. Takahiho Suzuki, Sachio Hirokawa, Nao Wariishi, Takuya Hirao, Vector Similarity of Related Words in the Japanese WordNet, Proc ESKM 2015 IIAI, 21, 2015.08, Word2vec is a tool that produces vector representation of words from a large amount of text data. In this paper, we show that only a part of the vector space produced by word2vec is enough to represent the collective sense of a set of related words in the Japanese WordNet. Further, we will show that there is a subspace in the vector space which do not relate to the collective sense. We construct a compact decision tree by using the vectors in order to distinguish whether a given word belongs to the set of related words..
5. Takahiko Suzuki, Sachio Hirokawa, Takuya Hirao, Kohki Miyata, Detection Method for Misplacement of Synonyms in the Japanese WordNet, 15, 2, 26-35, 2014.12, Lexical database the Japanese WordNet is a useful tool in natural language processing. However, it is officially announced that the Japanese WordNet contains 5% errors. In this paper, we discuss error detection methods in the Japanese WordNet..
6. Takahiko Suzuki, Sachio Hirokawa, Takuya Hirao, Kohki Miyata, Detection of Misplacement of Synonyms in the Japanese WordNet, Proc. of International Conference on Advanced Applied Informatics (AAI 2014),
Kitakyushu, Japan, 2014.8.31-9.4.
, 2014.09, Lexical database the Japanese WordNet is a useful tool in natural language processing. However, it is officially announced that the Japanese WordNet contains 5% errors. In this paper, we classify errors in the Japanese WordNet and discuss error detection methods..
7. Takahiko Suzuki, Kohki Miyata, Sachio Hirokawa, Difficulty and Ambiguity of Verbs
-Analysis based on Synsets in Japanese WordNet-, IIAI AAI AIT 2013, 2013.12, When foreign students learn Japanese words, they encounter two types of problems. The first is the difficulty of the word itself. The second problem is related to the situation which it is used. The meaning of the word may differ as the situation changes. It is necessary to understand the ambiguity of the word. In this paper, we propose three simple formulas representing ambiguity of words based on Synset structure in Japanese WordNet. Then we analyze the relationship between the ambiguity and the difficulty of Japanese words, particularly verbs. We use vocabulary level of Japanese-Language Proficiency Test (old version) as the difficulty measure. The result shows that easy (not difficult) words are more ambiguous than difficult words..
主要学会発表等
学会活動
学会大会・会議・シンポジウム等における役割
2014.08.31~2014.09.04, IAIAI AAI ESKM 2014, 座長(Chairmanship).
2013.08.31~2014.09.04, IAIAI AAI LTLE 2013, 座長(Chairmanship).
2013.08.31~2013.09.04, IIAI ESKM 2013, スタッフ.
その他の研究活動
海外渡航状況, 海外での教育研究歴
Wisconsin University, UnitedStatesofAmerica, 2006.10~2006.10.

九大関連コンテンツ