Le Lézard
Classified in: Science and technology
Subject: Product/Service

New Argonne-supported code mines microscopy images in scientific articles


Deep learning is a form of artificial intelligence transforming society by teaching computers to process information using artificial neural networks that mimic the human brain. It is now used in facial recognition, self-driving cars and even in the playing of complex games like Go. In general, the success of deep learning has depended on using large datasets of labeled images for training purposes.

A potential gold mine of labeled images resides within the scientific literature, with over a million articles published each year. Most have many figures woven into the text. To date, these figures have not been amenable to deep learning models. This is, in part, due to their complex layouts. Each figure typically contains multiple embedded images, graphs and illustrations. Also lacking has been an adequate means to search the literature for images matching specific content.

Addressing this challenge, researchers at the U.S. Department of Energy's (DOE) Argonne National Laboratory and Northwestern University have created the EXSCLAIM! software tool. The name stands for extraction, separation and caption-based natural language annotation of images.

"Images generated by electron microscopes down to the billionths of a meter are one of the most important kinds of figures in materials science literature," said Maria Chan, scientist in Argonne's Center for Nanoscale Materials, a DOE Office of Science user facility. ?"These images are essential to the understanding and development of new materials in many different fields. Our goal with EXSCLAIM! is to unlock the untapped potential of these imaging data."

What sets EXSCLAIM! apart is its unique focus on a query-to-dataset approach similar to how a prompt is used with generative AI tools such as ChatGPT and DALL-E. It is thus capable of extracting individual images with very specific content from figures, as it both classifies the image content and recognizes the degree of magnification. It can then create descriptive labels for each image. This innovative software tool is expected to become a valuable asset for scientists researching new materials at the nanoscale.

"While existing methods often struggle with the compound layout problem, EXSCLAIM! employs a new approach to overcome this," said lead author Eric Schwenker, a former Argonne graduate student. ?"Our software is effective at identifying sharp image boundaries, and it excels in capturing irregular image arrangements."

EXSCLAIM! has already demonstrated its effectiveness by constructing a self-labeled electron microscopy dataset of more than 280,000 nanostructure images. While initially developed around materials microscopy images, EXSCLAIM! is adaptable to any scientific field that produces high volumes of papers with images. The software thus promises to revolutionize the use of published scientific images across various disciplines.

"Researchers now have a powerful image-mining tool to advance their understanding of complex visual information," Chan said.

This research was supported by the DOE Office of Basic Energy Sciences, Laboratory Directed Research and Development funding from Argonne and a DOE Early Career Award. The team used high performance computing resources at Argonne's Laboratory Computing Resource Center, Argonne's Joint Laboratory for System Evaluation and the National Energy Research Scientific Computing Center, a DOE Office of Science user facility at DOE's Lawrence Berkeley National Laboratory.

This research first appeared in Patterns. In addition to Chan and Schwenker, authors include Weixin Jiang, Trevor Spreadbury, Nicola Ferrier and Oliver Cossairt.


These press releases may also interest you

at 20:51
Ecogensus, a pioneering resource technology company, today announced its expansion into innovative materials derived from mixed wastes, unlocking the potential value of billions of tons of unused resources. These versatile new materials are...

at 20:48
Hawaiian Airlines today published its annual Corporate Kuleana (Responsibility) Report highlighting Environmental, Social and Governance (ESG) initiatives to fly and grow more sustainably....

at 19:56
BigSpring, the AI-powered revenue acceleration platform, is proud to share its latest product offering to accelerate sales for companies with a global ecosystem of teams and partners. Focused on rapidly building proficiency on the latest innovations,...

at 19:46
BigSpring, a member of the EDISON Alliance and a World Economic Forum Global Innovator, has committed to equip one hundred thousand people by 2025 with digital proficiency to enhance productivity in both work and life....

at 19:31
Alleva, a leading provider of electronic medical record (EMR) solutions, attended the first-ever East Coast TCIV conference last week in Palm Beach, Florida. The company's CEO, Steve McCall, Director of Sales, John David, and Manager of Strategic...

at 19:27
Mathpresso, creator of Asia's leading learning app, QANDA, today announced that it has been recognized by TIME's World's Top EdTech Companies in 2024. Presented by TIME and Statista Inc., this accolades acknowledges companies that exhibit financial...



News published on and distributed by: