The non-redundant patent sequence databases have been created at two levels from the patent class (PAT) in EMBL-Bank and the patent proteins databases. Level-1 non-redundant patent sequences are 100% identical over the same length; Level-2 non-redundant patent sequences are identical and belong to a same patent family (the same invention). Level-2 sequence clusters have been enriched with biological information and additional data from the patent documents.
The user can run the similarity & homology tool, e.g. FASTA, to search sequences against the non-redundant databases, and use EBI Search ( NRNL1 , NRNL2, NRPL1 and NRPL2) for text querying.