SureChEMBL is a publicly available large-scale resource containing compounds extracted from the full text, images, and attachments of patent documents. The data are extracted from the patent literature according to an automated text- and image-mining pipeline on a daily basis. Currently, the database contains 28 million compounds extracted from 28 million patent documents. Direct downloads of patent-compound associations are also available on their FTP page.