World's first geographic multimodal premiered in China
2024-09-19
Chinese scientists unveiled the world's first multimodal geographic science model, "Sigma Geography," which can answer professional geographic questions, analyze geographic literature, query geographic data resources, and even create thematic maps.
Compared to general-purpose large language models, it boasts an approximately 31 percent increase in accuracy within the field of geography, the research team claimed.
Sigma Geography was jointly developed by the Institute of Geographic Sciences and Natural Resources Research, the Institute of Tibetan Plateau Research, and the Institute of Automation of the Chinese Academy of Sciences.
The research team established a comprehensive geographic corpus covering four major categories and sixteen subcategories, providing 32.3 billion geographic tokens for the large model's self-supervised learning. They also crafted over 40,000 high-quality geographic instructions for model fine-tuning.
"Compared to general-purpose large language models, Sigma Geography is more familiar with the language patterns, professional terminology, and domain knowledge of geography, leading to an accuracy improvement of around 31 percent on the geographic benchmark test set," said Su Fenzhen, deputy director of the Institute of Geographic Sciences and Natural Resources Research.
Su compared Sigma Geography with Chat-GPT in answering geographic professional queries and found that Sigma Geography provided more accurate and comprehensive responses, tailoring answers to the roles of different questioners.
The research team explained that this was achieved through their innovative user profile, precise discrimination and response technology. Consequently, Sigma Geography can fully consider the cognitive and expressive differences in geographic scientific knowledge systems among geography enthusiasts, geography students, and researchers, providing tailored solutions to geographic professional questions that align with users' knowledge structures through a combination of text and images.
In addition, Sigma Geography can retrieve different geographic elements based on generated text answers, match them with geographic landscape photos, thematic maps, or schematic diagrams for presentation to questioners, and follow user instructions to complete processes such as data acquisition, information analysis, and map production, ultimately generating the specialized geographic charts users need.
Sigma Geography has supported the publication of over ten high-level academic papers in journals such as Nature sub-journals, The Innovation, and Earth's Future, according to the research team.
In the future, the research team plans to advance the development of large-scale map models and geographic reasoning engines, aiming to enable geographic science large models to act as geointelligent scientists, Su said.
They also intend to build a collaborative platform for geographic research, allowing every scientist and research team to possess their exclusive geographic models for collaborative work with millions of scientists through shared data, models, research ideas, and more.