Updated on 2024/07/28

Information

 

写真a

 
SHIMIZU TOSHIYUKI
 
Organization
Library Associate Professor
Library (Concurrent)
Data-Driven Innovation Initiative (Concurrent)
Graduate School of Integrated Frontier Sciences Department of Library Science(Concurrent)
Title
Associate Professor
Contact information
メールアドレス
Profile
He is conducting research on data and metadata management, including maintenance, sharing, and retrieval. In particular, he is working on management and retrieval of research data. Also, as one of the applications, he is working on metadata management for earth science data.
External link

Degree

  • Ph.D. in Informatics

Research History

  • 2007年4月-2008年3月 日本学術振興会特別研究員(DC2) 2008年4月-2009年3月 日本学術振興会特別研究員(PD)/アリゾナ大学客員研究員 2009年4月-2011年3月 京都大学大学院情報学研究科特定助教(産官学連携) 2011年4月-2011年7月 京都大学大学院情報学研究科特定助教 2011年7月-2021年7月 京都大学大学院情報学研究科助教   

Research Interests・Research Keywords

  • Research theme: Management and retrieval of research data

    Keyword: research data, metadata, dataset search, data distribution, retrieval support

    Research period: 2022.4

  • Research theme: Utilization of scholarly data

    Keyword: literature search, citations, papers, presentation slides

    Research period: 2012.4 - 2018.3

  • Research theme: Management of scientific metadata

    Keyword: earth science data, metadata, data release, keyword recommendation, dataset search

    Research period: 2009.4

  • Research theme: Searching on XML documents

    Keyword: XML, index, information retrieval

    Research period: 2004.4 - 2013.3

Awards

  • 若手功績賞

    2024.3   日本データベース学会  

  • 第2回データ工学と情報マネジメントに関するフォーラム(DEIM2010) 優秀インタラクティブ賞

    2010.3  

  • 第143回データベースシステム研究会(DBWS2007) 学生奨励賞

    2007.7  

  • 情報処理学会東海支部学生論文奨励賞

    2006.5  

  • 第16回データ工学ワークショップ(DEWS2005) 優秀論文賞

    2005.7  

Papers

  • 自然言語処理モデルを用いた行政文書の同一判定に係るしきい値の調査 Reviewed

    新原 俊樹, 内山 英昭, 清水 敏之, 甲斐 尚人

    日本情報ディレクトリ学会誌   22   10 - 17   2024.3

     More details

    Language:Japanese   Publishing type:Research paper (scientific journal)  

  • Does Student-Submission Allocation Affect Peer Assessment Accuracy? Reviewed

    Hideaki Ohashi, Toshiyuki Shimizu, Masatoshi Yoshikawa

    IEICE Transactions on Information and Systems   E105D ( 5 )   888 - 897   2022.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Peer assessment in education has pedagogical benefits and is a promising method for grading a large number of submissions. At the same time, student reliability has been regarded as a problem; consequently, various methods of estimating highly reliable grades from scores given by multiple students have been proposed. Under most of the existing methods, a nonadaptive allocation pattern, which performs allocation in advance, is assumed. In this study, we analyze the effect of student-submission allocation on score estimation in peer assessment under a nonadaptive allocation setting. We examine three types of nonadaptive allocation methods, random allocation, circular allocation and group allocation, which are considered the commonly used approaches among the existing nonadaptive peer assessment methods. Through simulation experiments, we show that circular allocation and group allocation tend to yield lower accuracy than random allocation. Then, we utilize this result to improve the existing adaptive allocation method, which performs allocation and assessment in parallel and tends to make similar allocation result to circular allocation. We propose the method to replace part of the allocation with random allocation, and show that the method is effective through experiments.

    DOI: 10.1587/transinf.2021DAP0002

  • Adaptive Balanced Allocation for Peer Assessments Reviewed

    Hideaki OHASHI, Yasuhito ASANO, Toshiyuki SHIMIZU, Masatoshi YOSHIKAWA

    IEICE Transactions on Information and Systems   E103.D ( 5 )   939 - 948   2020.5

     More details

    Language:Others   Publishing type:Research paper (scientific journal)  

    DOI: 10.1587/transinf.2019dap0004

  • An analysis and comparison of keyword recommendation methods for scientific data Reviewed

    Youichi Ishida, Toshiyuki Shimizu, Masatoshi Yoshikawa

    International Journal on Digital Libraries   2020.2

     More details

    Language:Others   Publishing type:Research paper (scientific journal)  

    DOI: 10.1007/s00799-020-00279-3

  • Flexible and Fast Similarity Search for Enriched Trajectories Reviewed

    Hideaki Ohashi, Toshiyuki Shimizu, Masatoshi Yoshikawa

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS   E100D ( 9 )   2081 - 2091   2017.9

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    In this study, we focus on a method to search for similar trajectories. In the majority of previous works on searching for similar trajectories, only raw trajectory data were used. However, to obtain deeper in-sights, additional time-dependent trajectory features should be utilized depending on the search intent. For instance, to identify similar combination plays in soccer games, such additional features include the movements of the team players. In this paper, we develop a framework to flexibly search for similar trajectories associated with time-dependent features, which we call enriched trajectories. In this framework, weights, which represent the relative importance of each feature, can be flexibly given by users. Moreover, to facilitate fast searching, we first propose a lower bounding measure of the DTW distance between enriched trajectories, and then we propose algorithms based on this lower bounding measure. We evaluate the effectiveness of the lower bounding measure and compare the performances of the algorithms under various conditions using soccer data and synthetic data. Our experimental results suggest that the proposed lower bounding measure is superior to the existing measure, and one of the proposed algorithms, which is based on the threshold algorithm, is suitable for practical use.

    DOI: 10.1587/transinf.2016EDP7482

  • DIASにおけるデータ公開と課題 Reviewed

    絹谷 弘子, 清水 敏之, 吉川 正俊, 喜連川 優, 小池 俊雄

    情報知識学会誌   24 ( 3 )   254 - 274   2014.10

     More details

    Language:Japanese  

    DOI: 10.2964/jsik_2014_027

  • 整合性を考慮した注釈伝播 Reviewed

    青戸 了, 清水 敏之, 吉川 正俊

    日本データベース学会論文誌   10 ( 1 )   49 - 54   2011.6

     More details

    Language:Japanese  

  • 履歴管理システムにおける多粒度アノテーション伝播 Reviewed

    青戸 了, 清水 敏之, 増田 耕一, 吉川 正俊

    日本データベース学会論文誌   9 ( 1 )   64 - 69   2010.6

     More details

    Language:Japanese  

  • Metadata management for integration and analysis of earth observation data Reviewed

    Akira Takahashi, Masashi Tatedoko, Toshiyuki Shimizu, Hiroko Kinutani, Masatoshi Yoshikawa

    Journal of Software   5 ( 2 )   168 - 178   2010.2

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    Earth observation technologies have developed rapidly during the past few decades. Substantial amounts of earth observation data have been acquired and are currently stored in the literature and databases for various research fields such as climatology, oceanography, agriculture, and ecology. Analyzing and integrating such data might produce valuable data products to promote better understanding of the global environment and to help solve global environmental issues. However, most institutions store and manage their earth observation data independently, with little metadata. Scientists have to struggle to search for valuable data from information outside their research domains and seek uses for these. This paper introduces a conceptual model of earth observation data. Utilizing a model to express earth observation items associated with ontologies, the model is a simple quintuple with information extracted from conventional data models, and it is used to uniquely determine portions of earth observation data, which enables flexible annotations to these data. We also introduce our systems to manage the metadata and user interfaces to encourage users to add annotations to earth observation data that can help scientists discover and understand useful information that can support their research. © 2010 Academy Publisher.

    DOI: 10.4304/jsw.5.2.168-178

  • XSemantic: An Extension of LCA Based XML Semantic Search Reviewed

    Umaporn Supasitthimethee, Toshiyuki Shimizu, Masatoshi Yoshikawa, Kriengkrai Porkaew

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS   E92D ( 5 )   1079 - 1092   2009.5

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    One of the most convenient ways to query XML data is a keyword search because it does not require any knowledge of XML structure or learning a new user interface. However, the keyword search is ambiguous. The users may use different terms to search for the same information. Furthermore, it is difficult for a system to decide which node is likely to be chosen as a return node and how much information should be included in the result. To address these challenges, we propose an XML semantic search based on keywords called XSemantic. On the one hand, we give three definitions to complete in terms of semantics. Firstly, the semantic term expansion, our system is robust from the ambiguous keywords by using the domain ontology. Secondly, to return semantic meaningful answers,, we automatically infer the return information from the user queries and take advantage of the shortest path to return meaningful connections between keywords. Thirdly, we present the semantic ranking that reflects the degree of similarity as well as the semantic relationship so that the search results with the higher relevance are presented to the users first. On the other hand, in the LCA and the proximity search approaches, we investigated the problem of information included in the search results. Therefore, we introduce the notion of the Lowest Common Element Ancestor (LCEA) and define our simple rule without any requirement on the schema information such as the DTD or XML Schema. The first experiment indicated that XSemantic not only properly infers the return information but also generates compact meaningful results. Additionally, the benefits of our proposed semantics are demonstrated by the second experiment.

    DOI: 10.1587/transinf.E92.D.1079

  • XML情報検索における利得と閲覧コストに基づく検索結果の取得と評価 Reviewed

    清水 敏之, 吉川 正俊

    情報処理学会論文誌. データベース   1 ( 1 )   1 - 12   2008.6

     More details

    Language:Japanese  

  • 関係データベースを用いたXML情報検索システムの開発 Reviewed

    清水 敏之, 寺田 憲正, 吉川 正俊

    情報処理学会論文誌. データベース   48 ( 11 )   224 - 234   2007.6

     More details

    Language:Japanese  

  • XMLデータの管理とストリーム処理に関する技術 Reviewed

    清水敏之, 鬼塚真, 江田毅晴, 江田毅晴, 吉川正俊

    電子情報通信学会論文誌 D   J90-D ( 2 )   159 - 184   2007.2

     More details

    Language:Japanese  

  • Full-text and structural indexing of XML documents on B+-tree Reviewed

    T Shimizu, M Yoshikawa

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS   E89D ( 1 )   237 - 247   2006.1

     More details

    Language:English   Publishing type:Research paper (scientific journal)  

    XML query processing is one of the most active areas of database research. Although the main focus of past research has been the processing of structural XML queries, there are growing demands for a full-text search for XML documents. In this paper, we propose XICS (XML Indices for Content and Structural search), which aims at high-speed processing of both full-text and structural queries in XML documents. An important design principle of our indices is the use of a B+-tree. To represent the structural information of XML trees, each node in the XML tree is labeled with an identifier. The identifier contains an integer number representing the path information from the root node. XICS consist of two types of indices, the COB-tree (COntent B+-tree) and the STB-tree (STructure B+-tree). The search keys of the COB-tree are a pair of text fragments in the XML document and the identifiers of the leaf nodes that contain the text, whereas the search keys of the STB-tree are the node identifiers. By using a node identifier in the search keys, we can retrieve only the entries that match the path information in the query. The STB-tree can filter nodes using structural conditions in queries, while the COB-tree can filter nodes using text conditions. We have implemented a COB-tree and an STB-tree using GiST and examined index size and query processing time. Our experimental results show the efficiency of XICS in query processing.

    DOI: 10.1093/ietisy/e89-d.1.237

▼display all

Presentations

  • 2001年PRMUアルゴリズムコンテスト「道路交通標識の認識」実施報告とその入賞アルゴリズムの紹介

    岩井 儀雄, 松村 朱里, 伊藤 陽生, 清水 敏之, 不殿 健治, 横地 裕次, 山崎 隆一, 山本 隆史, 村山 健二, 池田 聖

    電子情報通信学会技術研究報告. PRMU, パターン認識・メディア理解  2001.12 

     More details

    Language:Japanese  

    Country:Japan  

    2001 PRMU Algorithm Contest "Recognition of Traffic Signs" Summary Report and Its Prize Winning Algorithm
    The Special Interest Group on Pattern Recognition and Media Understanding (PRMU) has been promoting an annual algorithm contest on pattern recognition and media understanding since 1997. The fifth contest focuses on pattern classification and segmentation of traffic signs. The problems of the contest are divided into three levels according to difficulties of classification ; segmented CG images as level 1, segmented real images as level 2, and real scenes as level 3. The summary of the contest is reported first, and then each prize-winning algorithm is described in detail by the original prize-winner.

  • A mapping scheme of XML documents into relational databases using schema-based path identifiers

    Kei Fujimoto, Toshiyuki Shimizu, Dao Dinh Kha, Masatoshi Yoshikawa, Toshiyuki Amagasa

    International Workshop on Challenges in Web Information Retrieval and Integration (WIRI 2005)  2005.4 

     More details

    Language:English  

    Country:Japan  

    In this paper we propose a mapping scheme of XML documents into relational databases. The scheme enables us to store, retrieve and update XML documents efficiently. When storing XML documents in relational databases, XML tree structures must be preserved explicitly. To this end, a label is assigned to nodes in the XML tree. In general, document retrieval and update performance is affected by node labeling schemes. We use SPIDER (Schema based Path IDentifiER), a labeling scheme for XML documents utilizing DTDs that makes retrieval and update more efficient. SPIDER only identifies paths from root node to a node. Thus, multiple nodes appearing in the same path cannot be distinguished by only using SPIDER. We introduced Sibling Dewey Order to identify such nodes. Generally, when a new node is inserted into XML documents, some other nodes need to be relabeled to preserve the order of nodes. In our method, only Sibling Dewey Order is relabeled; SPIDER is not affected. Since the range of relabeling is small, it is possible to update documents efficiently.
    We stored documents utilizing SPIDER in a relational database and then translated various XPath expressions into SQL using SPIDER. We perform experiments and demonstrate that the proposed scheme outpeforms conventional methods both in retrieval and update.

  • Full-text and structural XML indexing on B+-tree

    Toshiyuki Shimizu, Masatoshi Yoshikawa

    16th International Conference on Database and Expert Systems Applications (DEXA 2005)  2005.8 

     More details

    Language:English  

    Country:Denmark  

    XML query processing is one of the most active areas of database research. Although the main focus of past research has been the processing of structural XML queries, there are growing demands for a full-text search for XML documents. In this paper, we propose XICS (XML Indices for Content and Structural search), novel indices built on a B+-tree, for the fast processing of queries that involve structural and full-text searches of XML documents. To represent the structural information of XML trees, each node in the XML tree is labeled with an identifier. The identifier contains an integer number representing the path information from the root node. XICS consist of two types of indices, the COB-tree (COntent B+-tree) and the STB-tree (STructure B+-tree). The search keys of the COB-tree are a pair of text fragments in the XML document and the identifiers of the leaf nodes that contain the text, whereas the search keys of the STB-tree are the node identifiers. By using a node identifier in the search keys, we can retrieve only the entries that match the path information in the query. Our experimental results show the efficiency of XICS in query processing.

  • XMLを用いた論文検索システム

    藤本圭, 杉木健二, 清水敏之, 松原茂樹, 天笠俊之, 波多野賢治, 吉川正俊

    ディジタル図書館  2005.8 

     More details

    Language:Japanese  

    Country:Japan  

  • Implementation of a high-speed and high-precision XML information retrieval system on relational databases

    Kei Fujimoto, Toshiyuki Shimizu, Norimasa Terada, Kenji Hatano, Yu Suzuki, Toshiyuki Amagasa, Hiroko Kinutani, Masatoshi Yoshikawa

    4th International Conference on Initiative for the Evaluation of XML Retrieval (INEX 2005)  2005.11 

     More details

    Language:English  

    Country:Germany  

    This paper describes an XML information retrieval system that we have developed. It is based on a vector space model, and implemented on top of XRel, a relational XML database system that has been developed in our research group. When a query is processed, a large number of fragments are retrieved, because a single XML document usually contains many XML fragments. Keeping all XML fragments degrades retrieval precision and increases query processing time, because some XML fragments are not appropriate as a query target. In existing methods, retrieval targets are manually selected by human experts when an XML collection is stored in the system. Such manual selection is not feasible when many kinds of XML documents are stored in the system. To cope with the problem we propose a method for automatically selecting document-centric fragments by introducing three measurements, namely, period ratio, number of different words, and empirical rules. By deleting inappropriate data-centric fragments from results of keyword query, we can improve the accuracy and performance of our system. Through performance evaluations, we confirmed the improvement of retrieval precision and query processing speed.

  • Synvie:ブログの仕組みを利用したマルチメディアコンテンツ配信システム

    山本大介, 清水敏之, 大平茂輝, 長尾確

    情報処理学会研究報告  2006.1 

     More details

    Language:Japanese  

    Country:Japan  

  • Synvie:ブログの仕組みを利用したマルチメディアコンテンツ配信システム

    山本大介, 清水敏之, 大平茂輝, 長尾確

    情報処理学会全国大会講演論文集  2006.3 

     More details

    Language:Japanese  

    Country:Japan  

  • 内容と構造を指定したXML文書検索

    寺田憲正, 清水敏之, 吉川正俊

    電子情報通信学会技術研究報告  2006.7 

     More details

    Language:Japanese  

    Country:Japan  

  • 内容と構造を指定したXML文書検索

    寺田憲正, 清水敏之, 吉川正俊

    情報処理学会研究報告  2006.7 

     More details

    Language:Japanese  

    Country:Japan  

  • Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML

    Toshiyuki Shimizu, Norimasa Terada, Masatoshi Yoshikawa

    9th International Conference on Asian Digital Libraries (ICADL 2006)  2006.11 

     More details

    Language:English  

    Country:Japan  

  • Development of an XML information retrieval system for queries on contents and structures

    Toshiyuki Shimizu, Norimasa Terada, Masatoshi Yoshikawa

    Second International Conference on Informatics Research for Development of Knowledge Society Infrastructure (ICKS 2007)  2007.1 

     More details

    Language:English  

    Country:Japan  

    We have developed an XML information retrieval system which can process queries by keywords or queries by combination of keywords and structural conditions. Queries by keywords are simple yet useful because users are not required to understand XML query languages or XML schema. While issuing queries by combination of keywords and structural conditions requires users to understand query languages and the underlying XML schema, we can restrict the target document fragments and the search conditions using structures in XML. The system was implemented on top of a relational XML database system developed by our group. The system can process both types of queries under a common relational schema. By carefully designing the database schema, the system handles a huge number of document fragments efficiently. For queries by keywords, we have developed a user-friendly interface for displaying search results. Our experiments using INEX test collection show that the system achieved relatively high precision and can process keyword set queries in acceptable search time.

  • XML Information Retrieval Considering Physical Page Layout of Logical Elements

    Toshiyuki Shimizu, Masatoshi Yoshikawa

    10th International Workshop on the Web and Databases (WebDB 2007)  2007.6 

     More details

    Language:English  

    Country:China  

  • XML情報検索システムの検索結果に対する入れ子を考慮した順序付け

    清水敏之, 吉川正俊

    電子情報通信学会技術研究報告  2007.6 

     More details

    Language:Japanese  

    Country:Japan  

  • A ranking scheme for XML information retrieval based on benefit and reading effort

    Toshiyuki Shimizu, Masatoshi Yoshikawa

    10th International Conference on Asian Digital Libraries (ICADL 2007)  2007.12 

     More details

    Language:English  

    Country:Viet Nam  

    XML information retrieval (XML-IR) systems search for relevant document fragments in XML documents for given queries. In top-k search, users control the size of output by an integer k. In XML-IR, however, each output element varies widely in size. Consequently, total output size of top-k elements is uncontrollable by simply giving an integer k. In addition, search results may have nesting elements. If a system orders result elements simply by their relevance, we may browse the same content more than once due to the nestings. To handle these problems, we propose a new ranking method that enables us to browse search results of XML-IR systems efficiently by introducing the concepts of benefit and reading effort. We also propose an evaluation metrics based on benefit and reading effort, and compared the metrics with existing XML-IR metrics by experiments.

  • Ranking and Presenting Search Results in an RDB-based XML Search Engine

    Kenji Hatano, Toshiyuki Shimizu, Jun Miyazaki, Yu Suzuki, Hiroko Kinutani, Masatoshi Yoshikawa

    Pre-Proceedings of INEX 2007  2007.12 

     More details

    Language:English  

    Country:Japan  

  • An XML Fragment Retrieval Method with Image and Text using Textual Information Retrieval Techniques

    Yu Suzuki, Masahiro Mitsukawa, Kenji Hatano, Toshiyuki Shimizu, Jun Miyazaki, Hiroko Kinutani

    Pre-Proceedings of INEX 2007  2007.12 

     More details

    Language:English  

    Country:Japan  

  • A new ranking scheme and result representation for XML information retrieval based on benefit and reading effort

    Masatoshi Yoshikawa, Toshiyuki Shimizu

    International Conference on Informatics Education and Research for Knowledge-Circulating Society (ICKS 2008)  2008.1 

     More details

    Language:English  

    Country:Japan  

    Elements of XML documents greatly vary in size and may nest each other. XML information. retrieval (XML-IR) systems are required to take these nature of XML documents into consideration. Because of these natures, top-k search is not suitable for XML-IR. We introduce new concepts of benefit anal reading effort of elements. By using these concepts, we propose a new dynamic ranking method that enables us to browse search results of XML-IR systems efficiently. We also study how to display the search results of XML-IR Systems. If XML document was original composed of paper image pages, such as scholarly articles or books, it would be natural to display result elements by overlaying them on the physical layout of pages in the user interfaces. We propose methods for displaying the result of XML search of scholarly articles and ranking methods based on page units.

  • Dynamic Focused Retrieval of XML Documents and Its Evaluation

    Toshiyuki Shimizu, Masatoshi Yoshikawa

    Workshop on Novel Methodologies for Evaluation in Information Retrieval  2008.3 

     More details

    Language:English  

    Country:United Kingdom  

  • An Extension of LCA based XML Keyword Search

    Umaporn Supasitthimethee, Toshiyuki Shimizu, Masatoshi Yoshikawa, Kriengkrai Porkaew

    International Workshop on Information-explosion and Next Generation Search (INGS 2008)  2008.4 

     More details

    Language:English  

    Country:China  

    One of the most convenient ways to query XML data is a keyword search because it does not require any knowledge about XML structure and without the need to learn a new user interface. However, keyword search interface is very flexible. It is hard for a system to decide which node is likely to be chosen as a return node and how much information should be included in the result. To address this challenge, we propose an extension of LCA based XML keyword search. First, to determine a return node, we provide a query syntax that the users can tell the system which node they are really interested in. In case that the users do not explicitly specify return information, our system will automatically analyze and choose appropriate return. nodes by inferring from user keywords. Second, to return a meaningful result, we investigate the problem of the return information in the LCA and the proximity search approaches. To this end, we introduce the Lowest Element Node (LEN) and define our simple rules without any requirement on the schema information such. as DTD or XML Schema. Our experiment results indicate that our system not only infers the right return nodes but also generates compact and meaningful results.

  • Development Toward Integrated Management of Earth System Data - 2. A Document Centric Metadata Registration Tool -

    Hiroko Kinutani, Masafumi Ono, Toshiyuki Shimizu, Masashi Tatedoko, Masatoshi Yoshikawa

    6th GEWEX and 2nd iLEAPS Joint Science Conferences, Melbourne, Australia, August 24-28, 2009. (poster)  2009.8 

     More details

    Language:English  

    Country:Australia  

  • Wikipediaとドメインオントロジーの統合利用による地球科学データ推薦手法

    立床雅司, 齊藤昭則, 清水敏之, 吉川正俊

    電子情報通信学会技術研究報告  2009.8 

     More details

    Language:Japanese  

    Country:Japan  

  • A Document Centric Metadata Registration Tool Constructing Earth Environmental Data Infrastructure

    Mika Ichino, Hiroko Kinutani, Masafumi Ono, Toshiyuki Shimizu, Masatoshi Yoshikawa, Kooiti Masuda, Kazuyo Fukuda, Haruko Kawamoto

    AGU Fall Meeting 2009 (Session IN23B), San Francisco, California, USA, December 14-18, 2009. (poster)  2009.12 

     More details

    Language:English  

    Country:United States  

  • Meaningful Interrelated Object Tree for XML Keyword Search

    Umaporn Supasitthimethee, Toshiyuki Shimizu, Masatoshi Yoshikawa, Kriengkrai Porkaew

    2nd International Conference on Computer and Automation Engineering (ICCAE 2010)  2010.2 

     More details

    Language:English  

    Country:Singapore  

    In the research field of XML retrieval with keyword-based approach, a variant of Lowest Common Ancestors (LCAs) have been widely accepted to provide how keywords are connected by ancestor relationship. However, returning a whole subtree or a partial subtree based on LCA nodes is insufficient for identifying how subtrees are conceptually related under different tree structure such as ID/IDREF. On the other hand, storing XML documents in the graph model can define richer relationships that the tree model cannot but the cost of enumerating result is very high. In this paper, we propose a novel Smallest Lowest Object Tree (SLOT) which keywords are connected through physical connections. In addition, to capture conceptual connections, we also propose the Smallest Interrelated Object Tree (SIOT) which extends ID/IDREF relationships based on SLOT. Finally, our experiment indicates that the proposed approach returns more effective and more semantic results for users.

  • 履歴情報管理システムのための多粒度アノテーション伝播

    青戸 了, 清水 敏之, 増田 耕一, 吉川 正俊

    第2回データ工学と情報マネジメントに関するフォーラム (DEIM 2010)  2010.3 

     More details

    Language:Others  

    Country:Japan  

  • 地球観測データに対するメタデータ登録システムの構築―データセットドキュメントメタファを利用したメタデータ登録―

    絹谷弘子, 清水敏之, 吉川正俊, 喜連川優

    情報処理学会全国大会講演論文集  2010.3 

     More details

    Language:Japanese  

    Country:Japan  

  • XMLデータに対するキーワード検索結果の分類とそれに基づく理解支援

    本村 徹太郎, 清水 敏之, 吉川 正俊

    第2回データ工学と情報マネジメントに関するフォーラム (DEIM 2010)  2010.3 

     More details

    Language:Others  

    Country:Japan  

  • A retrieval method for earth science data based on integrated use of Wikipedia and domain ontology

    Masashi Tatedoko, Toshiyuki Shimizu, Akinori Saito, Masatoshi Yoshikawa

    21st International Conference on Database and Expert Systems Applications (DEXA 2010)  2010.9 

     More details

    Language:English  

    Country:Spain  

    Due to the recent advancement in observation technologies and progress in information technologies, the total amount of earth science data has increased at an explosive pace. However, it is not easy to search and discover earth science data because earth science requires high degree of expertness. In this paper, we propose a retrieval method for earth science data which can be used by non-experts such as scientists from other field, or students interested in earth science. In order to retrieve relevant data sets from a query, which may not include technical terminologies, supplementing terms are extracted by utilizing knowledge bases
    Wikipedia and domain ontology. We evaluated our method using actual earth science data. The data, the queries, and the relevance assessments for our experiments were made by the researchers of earth science. The results of our experiments show that our method has achieved good recall and precision. © 2010 Springer-Verlag.

  • 多様な地球科学データに対する俯瞰・検索システムの開発

    清水敏之, 絹谷弘子, 吉川正俊

    電子情報通信学会技術研究報告  2010.11 

     More details

    Language:Japanese  

    Country:Japan  

  • DIASにおける多分野研究者連携による地球科学データ公開に向けた協働

    絹谷弘子, 清水敏之, 吉川正俊, 喜連川優

    電子情報通信学会技術研究報告  2010.11 

     More details

    Language:Japanese  

    Country:Japan  

  • XMLデータに対するキーワード検索結果の理解支援

    本村徹太郎, 清水敏之, 吉川正俊

    情報処理学会研究報告(CD-ROM)  2010.12 

     More details

    Language:Japanese  

    Country:Japan  

  • データ変遷において生じる注釈の不整合に関する分析

    青戸 了, 清水 敏之, 吉川 正俊

    第3回データ工学と情報マネジメントに関するフォーラム (DEIM 2011)  2011.3 

     More details

    Language:Others  

    Country:Japan  

  • キーワード検索における検索結果の拡張

    田邊 翼, 清水 敏之, 吉川 正俊

    第3回データ工学と情報マネジメントに関するフォーラム (DEIM 2011)  2011.3 

     More details

    Language:Others  

    Country:Japan  

  • 地球科学データの可視化ファイル共有システムDagik

    吉田 大紀, 齊藤 昭則, 津川 卓也, 秋谷 祐亮, 清水 敏之, 吉川 正俊

    日本地球惑星科学連合2011年大会 02033 (ACG032-17)  2011.5 

     More details

    Language:Japanese  

    Country:Japan  

  • DIASにおける地球科学データ公開

    絹谷 弘子, 清水 敏之, 吉川 正俊, 喜連川 優, 小池 俊雄

    日本地球惑星科学連合2011年大会(JpGU Meeting 2011), ACG032-P06  2011.5 

     More details

    Language:Japanese  

    Country:Japan  

  • Alternative query generation for XML keyword search and its optimization

    Tetsutaro Motomura, Toshiyuki Shimizu, Masatoshi Yoshikawa

    22nd International Conference on Database and Expert Systems Applications (DEXA 2011)  2011.8 

     More details

    Language:English  

    Country:France  

    Much work has been done in XML keyword search since users can obtain various information from XML databases without specific knowledge of the database schema and/or the knowledge about the query languages. Moreover, certain researches have suggested methods of returning some information that would help users understand search results. In this paper, we define alternative queries, which can be considered as different aspects of XML keyword search results. In XML keyword search, a keyword may match an unexpected text value or element name, then incorrect results that do not correspond to the users' search intentions may be retrieved. When we generate alternative queries, it does not seem useful to generate alternative queries for all the results since they include several results retrieved by several interpretations. Thus, we propose a method of generating alternative queries from results classified by interpretations. We also propose a stack-based algorithm for generating alternative queries. Finally, the experimental results reveal that our proposal generates alternative queries efficiently. © 2011 Springer-Verlag Berlin Heidelberg.

  • Propagation of multi-granularity annotations

    Ryo Aoto, Toshiyuki Shimizu, Masatoshi Yoshikawa

    22nd International Conference on Database and Expert Systems Applications (DEXA 2011)  2011.9 

     More details

    Language:English  

    Country:France  

    Data origin or processing information and the metadata that is useful in understanding data can be associated with data by using annotation. Provenance knowledge preserved by annotation is managed by continuously propagating the annotations through the workflow. Models for explicitly associating annotations are generally used for annotation-based provenance management, and techniques for propagating annotations have been proposed. There is also a model for implicitly associating annotations - the annotations are associated with data with arbitrary granularity by using queries. We call the implicit model "multi-granularity annotation" model. Multi-granularity annotation enables flexible association of information. However, no provenance management methods using multi-granularity annotations have been reported. We have developed a method for propagating multi-granularity annotations. We define rules for annotation propagation for each relational algebra operation, and they are used to recalculate the scopes of annotations associated with data. We also addressed the loss of information needed to preserve annotation associations during data derivation and the lack of static data annotations by extending the operations and the association method. Experiments showed that our method requires less space usage and execution time than conventional annotation management methods. © 2011 Springer-Verlag Berlin Heidelberg.

  • Development of a data browsing system for geoscience data using geobrowsers

    Akinori Saito, Takuya Tsugawa, Yusuke Akiya, Toshiyuki Shimizu, Masatoshi Yoshikawa

    The 1st ICSU World Data System Conference, PS6-11  2011.9 

     More details

    Language:English  

    Country:Japan  

    Development of a data browsing system for geoscience data using geobrowsers

  • 重要度と時空間近接度を統合した地球科学データのランキング手法

    冨田典也, 清水敏之, 齊藤昭則, 吉川正俊

    情報処理学会シンポジウムシリーズ(CD-ROM)  2011.10 

     More details

    Language:Japanese  

    Country:Japan  

    重要度と時空間近接度を統合した地球科学データのランキング手法

  • 地球科学データと学術論文の関連付け

    梅原 佳宏, 清水 敏之, 吉川 正俊

    第4回データ工学と情報マネジメントに関するフォーラム (DEIM 2012)  2012.3 

     More details

    Language:Others  

    Country:Japan  

  • XMLキーワード検索における要素の特性を考慮した検索結果の構築

    田邊 翼, 清水 敏之, 吉川 正俊

    第4回データ工学と情報マネジメントに関するフォーラム (DEIM 2012)  2012.3 

     More details

    Language:Others  

    Country:Japan  

  • キーワードの役割と要素の特性を考慮したXML検索

    田邊翼, 清水敏之, 吉川正俊

    情報処理学会シンポジウムシリーズ(CD-ROM)  2012.11 

     More details

    Language:Japanese  

    Country:Japan  

  • Effective keyword-based XML retrieval using the data-centric and document-centric features

    Tsubasa Tanabe, Toshiyuki Shimizu, Masatoshi Yoshikawa

    8th Asia Information Retrieval Societies Conference (AIRS 2012)  2012.12 

     More details

    Language:English  

    Country:China  

    Extensible Markup Language (XML) is used for not only describing structured documents but also for describing data just for generating XML from relational data. The former is called document-centric XML, and the latter is called data-centric XML. From studies on retrieving data-centric XML by using keyword searches, methods based on LCA have been proposed, while from studies on retrieving document-centric XML, methods based on information retrieval that focus on the granularity of XML elements have been proposed. However, documents generally have both data-centric and document-centric elements, so there are cases in which desired results cannot be returned by using existing research. We propose a method for constructing suitable search results for XML documents that include both data-centric and document-centric elements by considering a user's query intention and element features (data-centric or document-centric). Our experiments show that both data-centric and document-centric elements need to be considered for actual XML documents. © Springer-Verlag 2012.

  • 論文とスライドの部分対応付け手法

    西田 惇人, 清水 敏之, 吉川 正俊

    第5回データ工学と情報マネジメントに関するフォーラム (DEIM 2013)  2013.3 

     More details

    Language:Others  

    Country:Japan  

  • 構造を利用した文書間の細粒度対応付け手法

    辻尾 尚樹, 清水 敏之, 吉川 正俊

    第5回データ工学と情報マネジメントに関するフォーラム (DEIM 2013)  2013.3 

     More details

    Language:Others  

    Country:Japan  

  • 地球科学データと学術論文の対応付け獲得に向けた事例調査

    梅原 佳宏, 清水 敏之, 吉川 正俊

    第5回データ工学と情報マネジメントに関するフォーラム (DEIM 2013)  2013.3 

     More details

    Language:Others  

    Country:Japan  

  • プレゼンテーションスライドからの構成抽出

    坂本 祥之, 清水 敏之, 吉川 正俊

    第5回データ工学と情報マネジメントに関するフォーラム (DEIM 2013)  2013.3 

     More details

    Language:Others  

    Country:Japan  

  • Supporting Keyword Selection in Generating Earth Science Metadata

    Toshiyuki Shimizu, Tomo Sueki, Masatoshi Yoshikawa

    37th Annual IEEE Computer Software and Applications Conference (COMPSAC 2013)  2013.7 

     More details

    Language:English  

    Country:Japan  

    Vast amounts of earth science data have been stored and managed in various projects. Because earth science requires considerable experience and expertise, we generally generate metadata for earth science datasets. In such metadata, keyword information is important. Controlled keywords such as GCMD Science Keyword are used for metadata. However, selection of suitable keywords is not easy. We propose methods to suggest keywords from summary texts of datasets. We considered simple string matching and inferring by Labeled LDA, and apply them to the actual earth science metadata.

  • 論文とプレゼンテーションスライドの部分対応付けにおける粒度の検証

    川上 優平, 清水 敏之, 吉川 正俊

    第6回データ工学と情報マネジメントに関するフォーラム (DEIM 2014)  2014.3 

     More details

    Language:Others  

    Country:Japan  

  • 地球科学データに対するタグと検索語の推薦手法

    石田 陽一, 清水 敏之, 吉川 正俊

    第6回データ工学と情報マネジメントに関するフォーラム (DEIM 2014)  2014.3 

     More details

    Language:Others  

    Country:Japan  

  • 共引用テキストを利用した論文間の関係抽出

    正元 修平, 清水 敏之, 難波 英嗣, 吉川 正俊

    第6回データ工学と情報マネジメントに関するフォーラム (DEIM 2014)  2014.3 

     More details

    Language:Others  

    Country:Japan  

  • プレゼンテーションスライドに対する分割手法の提案と講義スライドへの応用

    坂本 祥之, 清水 敏之, 吉川 正俊

    第6回データ工学と情報マネジメントに関するフォーラム (DEIM 2014)  2014.3 

     More details

    Language:Others  

    Country:Japan  

  • Semantic Knowledge Base Construction from Domain-Specified Metadata

    Jiyi Li, Toshiyuki Shimizu, Masatoshi Yoshikawa

    第6回データ工学と情報マネジメントに関するフォーラム (DEIM 2014)  2014.3 

     More details

    Language:Others  

    Country:Japan  

    Semantic Knowledge Base Construction from Domain-Specified Metadata

  • データ提供者のポリシーを考慮したDIASによる地球観測事業データプロダクト公開

    絹谷弘子, 中原陽子, Jiyi Li, 清水敏之, 吉川 正俊, 喜連川 優, 小池 俊雄

    日本地球惑星科学連合2014年大会(JpGU 2014)  2014.5 

     More details

    Language:Others  

    Country:Japan  

  • Development of DIAS Metadata System

    Masatoshi Yoshikawa, Toshiyuki Shimizu, Jiyi LI, Yoko Nakahara, Hiroko Kinutani

    日本地球惑星科学連合2014年大会(JpGU 2014)  2014.5 

     More details

    Language:Others  

    Country:Japan  

  • DIAS data release and the cooperation among data centers by metadata

    Hiroko Kinutani, Toshiyuki Shimizu, Jiyi Li, Masatoshi Yoshikawa

    The 11th Annual Meeting of Asia Oceania Geosciences Society (AOGS), IG32-ST31-AS53-A011  2014.8 

     More details

    Language:Others  

    Country:Japan  

  • A Method for Fine-Grained Document Alignment Using Structural Information

    Naoki Tsujio, Toshiyuki Shimizu, Masatoshi Yoshikawa

    16th Asia-Pacific Web Conference (APWeb 2014)  2014.9 

     More details

    Language:English  

    Country:China  

    It is useful to understand the corresponding relationships between each part of related documents, such as a conference paper and its modified version published as a journal paper, or documents in different versions. However, it is hard to associate corresponding parts which have been heavily modified only using similarity in their content. We propose a method of aligning documents considering not only content information but also structural information in documents. Our method consists of three steps; baseline alignment considering document order, merging, and swapping. We used papers which have been presented at a domestic conference and an international conference, then obtained their alignments by using several methods in our evaluation experiments. The results revealed the effectiveness of the use of document structures.

  • Axis-Based Alignment of Scholarly Papers and Its Presentation Slides Considering Document Structure

    Yuhei Kawakami, Atsuto Nishida, Toshiyuki Shimizu, Masatoshi Yoshikawa

    16th International Conference on Asia-Pacific Digital Libraries (ICADL 2014)  2014.11 

     More details

    Language:English  

    Country:Thailand  

    Recently, most researchers make a presentation with presentation slides to introduce a paper in academic conferences. We can often retrieve and browse papers and presentation slides through websites. We consider that we can obtain information efficiently by using both of them, and we propose a method to align papers and its presentation slides at the fine granularity. Though there are some existing works on this alignment, our system tried to achieve better accuracy for this problem by proposing the two approaches: 1) the adjustment by axis alignments; and 2) two-step alignment. The content similarity between each slide and paragraph is unstable due to the small amount of texts in the slides. Therefore we also calculate the content similarity in section-level and consider the ancillary alignment in section-level. Also, we succeed to obtain better alignments by adjusting the alignment score or narrowing down the alignment candidates. Finally, for each slide, we calculate the scores of each paragraph, and determine the alignment according to the scores by associating the slide to a sequence of paragraphs. We created a small dataset manually and conducted an experiment to confirm the effectiveness of the proposed method.

  • 文献検索エンジン結果ページへの引用関係に基づくグラフの付与

    正元修平, 清水敏之, 吉川正俊

    情報処理学会シンポジウムシリーズ(CD-ROM)  2014.11 

     More details

    Language:Japanese  

    Country:Japan  

  • 地球科学データに対するキーワード推薦手法

    石田 陽一, 清水 敏之, 吉川正俊

    ARG 第5回Webインテリジェンスとインタラクション研究会 (WI2研究会)  2014.11 

     More details

    Language:Others  

    Country:Japan  

  • 講義プレゼンテーションスライド部分対応付けを用いた学習支援

    坂本祥之, 清水敏之, 吉川正俊

    情報処理学会全国大会講演論文集  2015.3 

     More details

    Language:Japanese  

    Country:Japan  

  • 論文とプレゼンテーションスライドの部分対応付けにおける軸対応補正の適用

    川上優平, 清水敏之, 吉川正俊

    情報処理学会全国大会講演論文集  2015.3 

     More details

    Language:Japanese  

    Country:Japan  

  • 引用関係に基づくグラフの付与による文献検索エンジンの高機能化

    正元修平, 清水敏之, 吉川正俊

    情報処理学会全国大会講演論文集  2015.3 

     More details

    Language:Japanese  

    Country:Japan  

  • 分野特徴語を利用した地球科学データに対するキーワード推薦手法

    石田陽一, 清水敏之, 吉川正俊

    情報処理学会全国大会講演論文集  2015.3 

     More details

    Language:Japanese  

    Country:Japan  

  • Improving Document Similarity Computation by Model Performance Prediction

    Jiyi Li, Toshiyuki Shimizu, Yasuhito Asano, Masatoshi Yoshikawa

    第7回データ工学と情報マネジメントに関するフォーラム (DEIM 2015)  2015.3 

     More details

    Language:Others  

    Country:Japan  

  • Document Similarity Computation by Combining Multiple Representation Models

    Jiyi Li, Toshiyuki Shimizu, Masatoshi Yoshikawa

    16th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD 2015)  2015.6 

     More details

    Language:English  

    Country:Japan  

    Evaluating semantic similarity of text document pairs is an active research topic. Various models of document representation have been proposed. Each kind of representation model concentrates on a different kind of information from other kind of models. However, it is difficult for a single model to perform well in all scenarios because of the variety of textual documents. Leveraging these models to complement each other is possible to improve the performance. In this paper, we first make an analysis on the relations among document semantic similarity, human ratings and model performance. Based on the observations, we propose a rational solution of selecting different representation models and fusing the results of these models to compute document similarity for a given document collection. We leverage the performance and relations of different models to select proper models. Our fusion approach proposes a regression function with both nonlinear and linear factors and dynamic weights based on the similarities by various models. We report the effectiveness of our work based on a rated news document collection. The particular version of our general approach for this collection can integrate the information from both brief entity knowledge and detailed word content.

  • A dynamic-static approach of model fusion for document similarity computation

    Jiyi Li, Yasuhito Asano, Toshiyuki Shimizu, Masatoshi Yoshikawa

    16th International Conference on Web Information Systems Engineering (WISE 2015)  2015.11 

     More details

    Language:English  

    Country:United States  

    The semantic similarity of text document pairs can be used for valuable applications. There are various existing basic models proposed for representing document content and computing document similarity. Each basic model performs difference in different scenarios. Existing model selection or fusion approaches generate improved models based on these basic models on the granularity of document collection. These improved models are static for all document pairs and may be only proper for some of the document pairs. We propose a dynamic idea of model fusion, and an approach based on a Dynamic-Static Fusion Model (DSFM) on the granularity of document pairs, which is dynamic for each document pair. The dynamic module in DSFM learns to rank the basic models to predict the best basic model for a given document pair. We propose a model categorization method to construct ideal model labels of document pairs for learning in this dynamic module. The static module in DSFM is based on linear regression. We also propose a model selection method to select appropriate candidate basic models for fusion and improve the performance. The experiments on public document collections which contain paragraph pairs and sentence pairs with human-rated similarity illustrate the effectiveness of our approach.

  • Enrichment of Academic Search Engine Results Pages by Citation-Based Graphs

    Shuhei Shogen, Toshiyuki Shimizu, Masatoshi Yoshikawa

    11th Asia Information Retrieval Societies Conference (AIRS 2015)  2015.12 

     More details

    Language:English  

    Country:Australia  

    Researchers' readings of academic papers make their research more sophisticated and objective. In this paper, we describe a method of supporting scholarly surveys by incorporating a graph based on citation relationships into the results page of an academic search engine. Conventional academic search engines have a problem in that users have difficulty in determining which academic papers are relevant to their needs because it is hard to understand the relationship between the academic papers that appear in the search results pages. Our method helps users to make judgments about the relevance of papers by clearly visualizing the relationship. It visualizes not only academic papers on the results page but also papers that have a strong citation relationship with them. We carefully considered the method of visualization and implemented a prototype with which we conducted a user study simulating scholarly surveys. We confirmed that our method improved the efficiency of scholarly surveys through the user study.

  • A Keyword Recommendation Method Using CorKeD Words and Its Application to Earth Science Data

    Youichi Ishida, Toshiyuki Shimizu, Masatoshi Yoshikawa

    11th Asia Information Retrieval Societies Conference (AIRS 2015)  2015.12 

     More details

    Language:English  

    Country:Australia  

    In various research domains, data providers themselves annotate their own data with keywords from a controlled vocabulary. However, since selecting keywords requires extensive knowledge of the domain and the controlled vocabulary, even data providers have difficulty in selecting appropriate keywords from the vocabulary. Therefore, we propose a method for recommending relevant keywords in a controlled vocabulary to data providers. We focus on a keyword definition, and calculate the similarity between an abstract text of data and the keyword definition. Moreover, considering that there are unnecessary words in the calculation, we extract CorKeD (Corpus-based Keyword Decisive) words from a target domain corpus so that we can measure the similarity appropriately. We conduct an experiment on earth science data, and verify the effectiveness of extracting the CorKeD words, which are the terms that better characterize the domain.

  • 動的属性付き軌跡データの類似検索

    大橋 英明, 清水 敏之, 吉川 正俊

    第8回データ工学と情報マネジメントに関するフォーラム (DEIM 2016)  2016.3 

     More details

    Language:Others  

    Country:Japan  

  • 共引用情報を利用した論文関係グラフの提示

    中野 優, 清水 敏之, 吉川 正俊

    第8回データ工学と情報マネジメントに関するフォーラム (DEIM 2016)  2016.3 

     More details

    Language:Others  

    Country:Japan  

  • NEXTI at NTCIR-12 IMine-2 Task

    Hidetsugu Nanba, Tetsuya Sakai, Noriko Kando, Atsushi Keyaki, Koji Eguchi, Kenji Hatano, Toshiyuki Shimizu, Yu Hirate, Atsushi Fujii

    12th NTCIR Conference on Evaluation of Information Access Technologies  2016.6 

     More details

    Language:English  

    Country:Japan  

  • Flexible Similarity Search for Enriched Trajectories

    Hideaki Ohashi, Toshiyuki Shimizu, Masatoshi Yoshikawa

    11th International Workshop on Spatial and Spatiotemporal Data Mining (SSTDM 2016)  2016.12 

     More details

    Language:English  

    Country:Spain  

    In this study, we focus on a method of searching for similar trajectories. In most previous works on searching for similar trajectories, only raw trajectory data have been used. However, to obtain deeper insights, additional time-dependent trajectory features should be utilized depending on the search intent. For instance, to identify soccer players who have similar dribbling patterns, such additional features include the correlations between players' speeds and directions. In addition, when finding similar combination plays, the additional features include the team players' movements. In this paper, we develop a framework to flexibly search for similar trajectories associated with time-dependent features, called enriched trajectories. In this framework, weights, which represent the relative importance of each feature, can be flexibly input. Moreover, to facilitate fast searching, we propose a lower bounding measure of the DTW distance between enriched trajectories. We evaluate the effectiveness of the lower bounding measure using soccer data and synthetic data. Our experimental results suggest that the proposed lower bounding measure is superior to the existing measure and works very well.

  • A Visualization of Relationships Among Papers Using Citation and Co-citation Information

    Yu Nakano, Toshiyuki Shimizu, Masatoshi Yoshikawa

    18th International Conference on Asia-Pacific Digital Libraries (ICADL 2016)  2016.12 

     More details

    Language:English  

    Country:Japan  

    When we conduct scholarly surveys, we occasionally encounter difficulties in grasping the vast amount of related papers. Because academic papers have relationships, such as citing and cited relationships, we considered utilizing them for supporting scholarly surveys. In this paper, we propose a method for visualizing relationships among papers, and we construct paper graphs using two types of relationships, namely, citation and co-citation. Moreover, we quantify the strengths of citations and co-citations based on their frequency and the positions of co-citations, and show both types of relationships together in a graph. We constructed paper graphs using papers in the database field and discussed their usefulness.

  • 多変量拡張軌跡の時空間ペナルティを考慮した類似検索

    大橋 英明, 清水 敏之, 吉川 正俊

    第9回データ工学と情報マネジメントに関するフォーラム (DEIM 2017)  2017.3 

     More details

    Language:Others  

    Country:Japan  

  • 基準例を用いた主観評価からのバイアス軽減に向けて

    大橋英明, 清水敏之, 吉川正俊

    情報処理学会関西支部支部大会講演論文集(CD-ROM)  2017.9 

     More details

    Language:Japanese  

    Country:Japan  

  • サーベイ論文の節構造を用いた論文に対するタグ付けの検討

    中野優, 清水敏之, 吉川正俊

    電子情報通信学会技術研究報告  2017.9 

     More details

    Language:Japanese  

    Country:Japan  

  • A View-based Programmable Architecture for Controlling and Integrating Decentralized Data.

    Yasuhito Asano, Soichiro Hidaka, Zhenjiang Hu, Yasunori Ishihara, Hiroyuki Kato, Hsiang-Shang Ko, Keisuke Nakano, Makoto Onizuka, Yuya Sasaki, Toshiyuki Shimizu, Kanae Tsushima, Masatoshi Yoshikawa

    CoRR  2018.3 

     More details

    Language:English  

    Country:Japan  

  • MOOCにおける相互評価のための適応的な割り当て手法

    大橋 英明, 清水 敏之, 吉川 正俊

    第10回データ工学と情報マネジメントに関するフォーラム (DEIM 2018)  2018.3 

     More details

    Language:Others  

    Country:Japan  

  • 統制語彙における階層構造を考慮した地球科学データに対するキーワード推薦

    城戸 譲次, 清水 敏之, 吉川 正俊

    第11回データ工学と情報マネジメントに関するフォーラム (DEIM 2019)  2019.3 

     More details

    Language:Others  

    Country:Japan  

  • ビューに基づくデータクリーニング方式の提案

    大森 弘樹, 清水 敏之, 吉川 正俊

    第11回データ工学と情報マネジメントに関するフォーラム (DEIM 2019)  2019.3 

     More details

    Language:Others  

    Country:Japan  

  • KL情報量を用いた事後正則化の検討とクラウドソーシングへの応用

    大橋 英明, 清水 敏之, 吉川 正俊

    第11回データ工学と情報マネジメントに関するフォーラム (DEIM 2019)  2019.3 

     More details

    Language:Others  

    Country:Japan  

  • Give and Take: Adaptive Balanced Allocation for Peer Assessments.

    Hideaki Ohashi, Yasuhito Asano, Toshiyuki Shimizu, Masatoshi Yoshikawa

    25th International Computing and Combinatorics Conference (COCOON 2019)  2019.7 

     More details

    Language:English  

    Country:China  

  • Toward a view-based data cleaning architecture.

    Toshiyuki Shimizu, Hiroki Omori, Masatoshi Yoshikawa

    CoRR  2019.10 

     More details

    Language:English  

    Country:Japan  

  • 科学メタデータクリーニングのための自律分散型データ共有・更新システムに向けて

    清水 敏之, 加藤 弘之, 吉川 正俊

    第12回データ工学と情報マネジメントに関するフォーラム (DEIM 2020)  2020.3 

     More details

    Language:Others  

    Country:Japan  

  • エンティティ解決手法を応用したデータクリーニングのための不整合検出

    大森 弘樹, 清水 敏之, 吉川 正俊

    第12回データ工学と情報マネジメントに関するフォーラム (DEIM 2020)  2020.3 

     More details

    Language:Others  

    Country:Japan  

  • オンラインDEIM2020開催の試み

    宮崎 純, 合田 和生, 大塚 真吾, 横山 昌平, 吉田 尚史, 北山 大輔, 清水 敏之, 戸田 浩之, 山口 実靖

    情報・システムソサイエティ誌  2020.5 

     More details

    Language:Japanese  

    Country:Japan  

  • Keyword Recommendation Methods for Earth Science Data Considering Hierarchical Structure of Vocabularies.

    Toshiyuki Shimizu, Joji Kido, Masatoshi Yoshikawa

    ACM/IEEE Joint Conference on Digital Libraries (JCDL 2020)  2020.8 

     More details

    Language:English  

    Country:China  

  • 自律分散型データ共有更新システムにおける共有ポリシー実装のためのアーキテクチャの検討

    沖野 雄哉, 若林 勇弥, 清水 敏之, 加藤 弘之, 吉川 正俊

    第13回データ工学と情報マネジメントに関するフォーラム (DEIM 2021)  2021.3 

     More details

    Language:Others  

    Country:Japan  

  • 自律分散型データ共有・更新システムにおける共有レポジトリの実装について

    若林 勇弥, 沖野 雄哉, 清水 敏之, 加藤 弘之, 吉川 正俊

    第13回データ工学と情報マネジメントに関するフォーラム (DEIM 2021)  2021.3 

     More details

    Language:Others  

    Country:Japan  

  • 制約脆弱データに対するデータクリーニングのための不整合候補検出

    大森 弘樹, 清水 敏之, 吉川 正俊

    第13回データ工学と情報マネジメントに関するフォーラム (DEIM 2021)  2021.3 

     More details

    Language:Others  

    Country:Japan  

  • Bidirectional Collaborative Frameworks for Decentralized Data Management International conference

    Yasuhito Asano, Yang Cao, Soichiro Hidaka, Zhenjiang Hu, Yasunori Ishihara, Hiroyuki Kato, Keisuke Nakano, Makoto Onizuka, Yuya Sasaki, Toshiyuki Shimizu, Masato Takeichi, Chuan Xiao, Masatoshi Yoshikawa

    Communications in Computer and Information Science  2022.1 

     More details

    Language:English  

    Country:Denmark  

    Along with the continuous evolution of data management systems for the new market requirements, we are moving from centralized systems towards decentralized systems, where data are maintained in different sites with autonomous storage and computation capabilities. There are two fundamental issues with such decentralized systems: local privacy and global consistency. By local privacy, the data owner wishes to control what information should be exposed and how it should be used or updated by other peers. By global consistency, the systems wish to have a globally consistent and integrated view of all data. In this paper, we report the progress of our BISCUITS (Bidirectional Information Systems for Collaborative, Updatable, Interoperable, and Trusted Sharing) project that attempts to systematically solve these two issues in distributed systems. We present a new bidirectional transformation-based approach to control and share distributed data, propose several distributed architectures for data integration via bidirectional updatable views, and demonstrate the applications of these architectures in ride-sharing alliances and gig job sites.

  • 科学メタデータにおけるデータクリーニングのための不整合検出の効率化

    中林 和也, 清水 敏之, 大手 信人

    第14回データ工学と情報マネジメントに関するフォーラム (DEIM 2022)  2022.3 

     More details

    Language:Others  

    Country:Japan  

  • Dejimaを用いた自律分散型データ共有・更新システムの実装

    清水 敏之, 加藤 弘之, 吉川 正俊

    第14回データ工学と情報マネジメントに関するフォーラム (DEIM 2022)  2022.3 

     More details

    Language:Others  

    Country:Japan  

  • Mahalo Button: Building the Network of Gratitude for Sharing the Dataset Usage

    Asanobu Kitamoto, Yoko Nakahara, Hiroyuki Shimai, Toshiyuki Shimizu

    日本地球惑星科学連合2022年大会(JpGU 2022)  2022.5 

     More details

    Language:Others  

    Country:Japan  

  • データの利用成果を収集・共有するには?―Mahalo Buttonの活用事例

    北本 朝展, 中原 陽子, 清水 敏之, 島井 博行, 吉川 正俊

    研究データ利活用協議会(RDUF)公開シンポジウム  2022.11 

     More details

    Language:Others  

    Country:Other  

  • 論文のどこに,どんな形式でデータが引用されているか?ーDIASでの実態調査

    中原 陽子, 北本 朝展, 清水 敏之, 島井 博行, 吉川 正俊

    研究データ利活用協議会(RDUF)公開シンポジウム  2022.11 

     More details

    Language:Others  

    Country:Other  

  • Data Citation and Mahalo Button: Collecting and Sharing Dataset Usage in DIAS

    Asanobu Kitamoto, Yoko Nakahara, Toshiyuki Shimizu, Hiroyuki Shimai, Masatoshi Yoshikawa

    日本地球惑星科学連合2023年大会(JpGU 2023)  2023.5 

     More details

    Language:Others  

    Country:Japan  

  • Investigation of ChatGPT Use in Research Data Retrieval International conference

    Motokazu Yamasaki, Yoichi Tomiura, Toshiyuki Shimizu

    25th International Conference on Asia-Pacific Digital Libraries (ICADL 2023)  2023.12 

     More details

    Language:English  

    Country:Taiwan, Province of China  

  • 流通を考慮した研究データのメタデータ付与支援

    篠原 桐, 清水 敏之, 冨浦 洋一

    第16回データ工学と情報マネジメントに関するフォーラム (DEIM 2024)  2024.2 

     More details

    Language:Others  

    Country:Japan  

  • 研究内容からの研究データ検索

    山﨑 基数, 冨浦 洋一, 清水 敏之

    第16回データ工学と情報マネジメントに関するフォーラム (DEIM 2024)  2024.2 

     More details

    Language:Others  

    Country:Japan  

  • 論文調査の支援に向けた研究 —最新の読むべき論文の判別方法—

    佐々木 陸, 清水 敏之, 冨浦 洋一

    第16回データ工学と情報マネジメントに関するフォーラム (DEIM 2024)  2024.3 

     More details

    Language:Others  

    Country:Japan  

▼display all

Professional Memberships

  • Database Society of Japan

  • Institute of Electronics, Information and Communication Engineers

  • Information Processing Society of Japan

  • ACM

Committee Memberships

  • 情報処理学会データベースシステム研究会   Organizer   Domestic

    2022.4 - 2025.3   

  • 電子情報通信学会データ工学研究専門委員会   専門委員   Domestic

    2019.6 - 2025.6   

  • 情報処理学会関西支部   Organizer   Domestic

    2017.5 - 2019.5   

  • 情報処理学会データベースシステム研究会   Steering committee member   Domestic

    2011.4 - 2015.3   

Academic Activities

  • トラックチェア

    第16回データ工学と情報マネジメントに関するフォーラム(DEIM 2024)  ( Japan ) 2024.2 - 2024.3

     More details

    Type:Competition, symposium, etc. 

  • 幹事(DBSリエゾン)

    第16回データ工学と情報マネジメントに関するフォーラム(DEIM 2024)  ( Japan ) 2024.2 - 2024.3

     More details

    Type:Competition, symposium, etc. 

  • Local Co-chair International contribution

    The 18th Korea-Japan (Japan-Korea) Database Workshop (KJDB 2023)  ( Japan ) 2023.10

     More details

    Type:Competition, symposium, etc. 

  • 幹事(DBSリエゾン)

    第15回データ工学と情報マネジメントに関するフォーラム(DEIM 2023)  ( Japan ) 2023.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2023

     More details

    Type:Peer review 

    Proceedings of International Conference Number of peer-reviewed papers:10

  • プログラム編集委員

    第14回データ工学と情報マネジメントに関するフォーラム(DEIM 2022)  ( Japan ) 2022.2 - 2022.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2022

     More details

    Type:Peer review 

    Number of peer-reviewed articles in Japanese journals:2

    Proceedings of International Conference Number of peer-reviewed papers:1

  • 情報処理学会論文誌

    2021.6 - Present

     More details

    Type:Academic society, research group, etc. 

  • ローカル共同委員長

    第13回データ工学と情報マネジメントに関するフォーラム(DEIM 2021)  ( Japan ) 2021.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2021

     More details

    Type:Peer review 

    Number of peer-reviewed articles in Japanese journals:1

    Proceedings of International Conference Number of peer-reviewed papers:2

  • ローカル共同委員長

    第12回データ工学と情報マネジメントに関するフォーラム(DEIM 2020)  ( Japan ) 2020.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2020

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:1

    Proceedings of International Conference Number of peer-reviewed papers:2

  • Registration Chair International contribution

    The 6th IEEE International Conference on Big Data and Smart Computing (IEEE BigComp 2019)  ( Japan ) 2019.2 - 2019.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2019

     More details

    Type:Peer review 

    Number of peer-reviewed articles in Japanese journals:1

    Proceedings of International Conference Number of peer-reviewed papers:3

  • 運営委員

    第9回ソーシャルコンピューティングシンポジウム (SoC 2018)  ( Japan ) 2018.6

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2018

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:1

    Number of peer-reviewed articles in Japanese journals:3

    Proceedings of International Conference Number of peer-reviewed papers:1

  • プログラム委員(表彰担当)

    第9回データ工学と情報マネジメントに関するフォーラム(DEIM 2017)  ( Japan ) 2017.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2017

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:1

    Number of peer-reviewed articles in Japanese journals:1

  • 電子情報通信学会和文論文誌D データ工学と情報マネジメント特集

    2016.6 - 2021.5

     More details

    Type:Academic society, research group, etc. 

  • プログラム委員(表彰担当)

    第8回データ工学と情報マネジメントに関するフォーラム(DEIM 2016)  ( Japan ) 2016.2 - 2016.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2016

     More details

    Type:Peer review 

    Number of peer-reviewed articles in Japanese journals:2

    Proceedings of International Conference Number of peer-reviewed papers:3

  • International Relationship Co-Chair International contribution

    The 7th International Workshop with Mentors on Databases, Web and Information Management for Young Researchers (iDB Workshop 2015)  ( Japan ) 2015.8

     More details

    Type:Competition, symposium, etc. 

  • 実行委員

    情報処理学会第77回全国大会  ( Japan ) 2015.3

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2015

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:1

    Number of peer-reviewed articles in Japanese journals:1

    Proceedings of domestic conference Number of peer-reviewed papers:1

  • International Relationship Chair International contribution

    The 6th International Workshop with Mentors on Databases, Web and Information Management for Young Researchers (iDB Workshop 2014)  ( Japan ) 2014.7 - 2014.8

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2014

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:1

    Number of peer-reviewed articles in Japanese journals:2

  • 実行委員(ローカルアレンジメント担当)

    第6回Webとデータベースに関するフォーラム(WebDB Forum 2013)  ( Japan ) 2013.11

     More details

    Type:Competition, symposium, etc. 

  • Local Arrangement Co-Chair International contribution

    The 5th International Conference on Social Informatics (SocInfo 2013)  ( Japan ) 2013.11

     More details

    Type:Competition, symposium, etc. 

  • 情報処理学会論文誌:データベース

    2013.4 - 2017.3

     More details

    Type:Academic society, research group, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2013

     More details

    Type:Peer review 

    Proceedings of International Conference Number of peer-reviewed papers:1

  • 実行委員(出版担当)

    第5回Webとデータベースに関するフォーラム(WebDB Forum 2012)  ( Japan ) 2012.11

     More details

    Type:Competition, symposium, etc. 

  • Screening of academic papers

    Role(s): Peer review

    2012

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:1

  • Screening of academic papers

    Role(s): Peer review

    2011

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:2

    Number of peer-reviewed articles in Japanese journals:2

    Proceedings of International Conference Number of peer-reviewed papers:2

    Proceedings of domestic conference Number of peer-reviewed papers:2

  • Screening of academic papers

    Role(s): Peer review

    2010

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:2

    Number of peer-reviewed articles in Japanese journals:2

    Proceedings of International Conference Number of peer-reviewed papers:1

  • Screening of academic papers

    Role(s): Peer review

    2009

     More details

    Type:Peer review 

    Number of peer-reviewed articles in foreign language journals:2

    Proceedings of International Conference Number of peer-reviewed papers:2

  • Screening of academic papers

    Role(s): Peer review

    2008

     More details

    Type:Peer review 

    Number of peer-reviewed articles in Japanese journals:2

▼display all

Research Projects

  • 熊本県の事例研究に基づく地方公共団体の模範となる文書管理モデルの確立

    2023 - 2025

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research  Grant-in-Aid for Scientific Research (B)

      More details

    Authorship:Coinvestigator(s)  Grant type:Scientific research funding

  • 地球環境デジタル基盤の構築とその高度化

    2022.4 - 2031.3

    Research commissions

      More details

    Authorship:Coinvestigator(s)  Grant type:Other funds from industry-academia collaboration

  • 地球環境デジタル基盤の構築とその高度化

    2022.4

    国立研究開発法人 海洋研究開発機構 

      More details

    Authorship:Coinvestigator(s) 

    https://diasjp.net/en/

  • 問合せに着目したデータの理解支援に関する研究

    2018 - 2022

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research  Grant-in-Aid for Scientific Research (C)

      More details

    Authorship:Principal investigator  Grant type:Scientific research funding

  • 半構造データに対する付加情報の管理と検索への利用

    2010 - 2013

    Grants-in-Aid for Scientific Research  Grant-in-Aid for Young Scientists (B)

      More details

    Authorship:Principal investigator  Grant type:Scientific research funding

  • 集合知の体系化による知の創出支援基盤技術の開発

    2009 - 2010

    Grants-in-Aid for Scientific Research  Grant-in-Aid for Scientific Research on Priority Areas

      More details

    Authorship:Coinvestigator(s)  Grant type:Scientific research funding

  • Web上の知識資源の統合利用基盤技術に関する研究

    2008 - 2011

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research  Grant-in-Aid for Scientific Research (B)

      More details

    Authorship:Coinvestigator(s)  Grant type:Scientific research funding

  • 仮想部分文書クラスタを用いたXMLサーチエンジンに関する研究

    2007 - 2008

    Japan Society for the Promotion of Science  Grants-in-Aid for Scientific Research  Grant-in-Aid for JSPS Fellows

      More details

    Authorship:Principal investigator  Grant type:Scientific research funding

  • 仮想部分文書クラスタを用いたXMLサーチエンジンに関する研究

    2007 - 2008

    Japan Society for the Promotion of Science  Research Fellowships for Young Scientists

      More details

    Grant type:Joint research

▼display all

Educational Activities

  • [統合新領域学府ライブラリーサイエンス専攻]
    データベース演習, LSS特別講義2, 情報セキュリティ論, ライブラリーサイエンスPTLⅠ, 特別研究Ⅰ, 特別研究Ⅱ

Class subject

  • ライブラリーサイエンスPTLⅠ

    2024.10 - 2025.3   Second semester

  • 特別研究Ⅱ

    2024.4 - 2025.3   Full year

  • 特別研究Ⅰ

    2024.4 - 2025.3   Full year

  • LSS特別講義2

    2024.4 - 2024.9   First semester

  • データベース演習

    2024.4 - 2024.9   First semester

  • 情報セキュリティ論

    2024.4 - 2024.6   Spring quarter

  • ライブラリーサイエンスPTLⅠ

    2023.10 - 2024.3   Second semester

  • LSS特別講義2

    2023.10 - 2024.3   Second semester

  • 特別研究Ⅰ

    2023.4 - 2024.3   Full year

  • 特別研究Ⅱ

    2023.4 - 2024.3   Full year

  • データベース演習

    2023.4 - 2023.9   First semester

  • 情報セキュリティ論

    2023.4 - 2023.6   Spring quarter

  • ライブラリーサイエンスPTLⅠ

    2022.10 - 2023.3   Second semester

  • LSS特別講義2

    2022.10 - 2022.12   Fall quarter

  • 特別研究Ⅰ

    2022.4 - 2023.3   Full year

  • 特別研究Ⅱ

    2022.4 - 2023.3   Full year

  • 図書館情報技術論

    2022.4 - 2022.9   First semester

  • 統合新領域最先端セミナー

    2022.4 - 2022.9   First semester

  • データベース演習

    2022.4 - 2022.9   First semester

  • 情報セキュリティ論

    2022.4 - 2022.6   Spring quarter

  • 情報セキュリティ論

    2021.12 - 2022.2   Winter quarter

  • データベース演習

    2021.10 - 2022.3   Second semester

  • LSS特別講義2

    2021.10 - 2021.12   Fall quarter

▼display all

FD Participation

  • 2023.11   Role:Speech   Title:GakuNin RDMデータ活用セミナー : これからの研究データ管理を探る

    Organizer:[Undergraduate school/graduate school/graduate faculty]

  • 2023.9   Role:Participation   Title:M2Bシステムの使い方 ~Moodleのバージョンアップによる変更点を中心に紹介します~(9/13)

    Organizer:University-wide

  • 2022.4   Role:Participation   Title:令和4年度 第1回全学FD(新任教員の研修)The 1st All-University FD (training for new faculty members) in FY2022

    Organizer:University-wide

  • 2021.12   Role:Participation   Title:医学系学府教育FD「学術論文の購読と投稿とこれから」

    Organizer:[Undergraduate school/graduate school/graduate faculty]

Travel Abroad

  • 2008.4 - 2009.3

    Staying countory name 1:United States   Staying institution name 1:アリゾナ大学