The SequenceBase Research Portal contains patents and biological sequences from rich databases, namely, USGENE, WOGENE, GENESEQ along with its fast-updating companion GENESEQ FASTAlert and GENBANK. The amount of data in each of them is overwhelming! As the data is taken from different sources (databases) sometimes same sequences can be found in two or more of them. Still, the importance of searching through all these databases can not be neglected because of the following reasons.
The Venn diagrams created from the data we have clearly point out the reasons why a prudent searcher needs to use our SequenceBase Research Portal with all sources of data for a comprehensive search.
There are two circles on the diagram below – the red one representing the number of sequences in both USGENE and WOGENE databases and the green one which indicates how many sequences there are in GENESEQ:
While there is significant overlap in sequences available in USGENE+WOGENE and GENESEQ, there is still a large number of sequences available in USGENE+WOGENE that are not available in GENESEQ and vice versa.
The second Venn diagram clearly shows that even with the large dataset in GENBANK, there are still huge numbers of sequences available in USGENE+WOGENE+GENESEQ not available in GENBANK:
As you see, although different databases have some identical sequences, each of them consists of a large number of unique sequences that can not be found in any other database. To ensure that you haven’t missed any key sequences and your reporting results are complete and overall we advise launching searches through all the databases:
Feel free to sign up to the SequenceBase Research Portal !