About Literature Database

The Literature Database is a result of our research and development efforts in AI security. It collects literature related to AI security, including academic papers and blogs published on the internet and features an automatic labeling system for efficient organization. The figure below provides an overview of the Literature Database. This article briefly introduces its key features.

Figure 1. Overview of the Literature Database.

The literature database offers the following three main functions:

  1. Collection of literature related to AI security
  2. Content-based labeling of literature
  3. Automatic posting of labeling information

We launched the Literature Database in March 2025 and are continuously engaged in research and development to address the following technical challenges:

  • Accuracy of assigned labels
    Since labels are assigned using large language models (LLMs), they are not always guaranteed to be academically accurate. We plan to further enhance the accuracy of the labeling process.
  • Spelling variations
    Variations in notation, such as differences in English expressions and abbreviations, pose a challenge. A key focus moving forward is to develop methods to unify these variations.

This article has introduced the automatic labeling feature utilized in the Literature Database. As the database is based on ongoing research and development, there are many challenges to overcome. We remain committed to improving the Literature Database continually.

Statistical Information

The Literature Database provides statistical information on the literature collected daily. Specifically, we collect information about the publication year, the country of the first author’s affiliation, and the conferences where the works were presented. To view this information, please visit the Literature Database Statistics page.

The publication year is obtained from the metadata of the collected literature. The country of the first author’s affiliation is extracted using LLMs based on the text information from the first page of the collected papers. For the conference information, we query a database that handles conference paper information based on the title information to obtain the details of the presenting conference.

The Literature Database Statistics page displays the number of papers by country using a world map, along with the number of papers by publication year represented in graphs. There is a search form available, allowing you to filter the results by keywords, publication year, and conference.

Please note that since some of the information is extracted using LLMs, the information presented on the statistics page should be viewed only as indicative of trends in the literature registered in the database.