Skip to Main Content

Patent Databases Research Guide

Free Patent Data Sources


SureChEMBL is a publicly available large-scale resource containing compounds extracted from the full text, images and attachments of patent documents. The data are extracted from the patent literature according to an automated text- and image-mining pipeline on a daily basis. Currently, the database contains 17 million compounds extracted from 14 million patent documents. Direct downloads of patent-compound associations are also available on our FTP page.