• NERC Data Catalogue Service
  •  
  •  
  •  

Molluscan Shell Matrix Proteins

The database contains fasta sequences from UniProt and associated metadata for molluscan shell matrix proteins (SMPs). The database only contains SMPs that have been experimentally validated to be present in molluscan shell matrices (based on the publication(s) attached to the UniProtID). Metadata includes information on functional domains present in the sequence, as detected by InterproScan.

With the advent of Next Generation Sequencing technologies, it is computationally resource intensive to run sequence similarity algorithms on all published data. Moreover, it is impractical to sort through hundreds of sequence similarity search results when working with non-model organisms, since pre-established functional annotations of sequences are generally not available. Therefore, this database was created in order to provide a targeted molluscan biomineralization dataset for sequence similarity algorithms (such as BLAST).

Database created as part of doctoral research, funded under Marie Curie Innovative Training Networks (ITN) - Calcium in the Changing Environment (CACHE - Grant agreement 605051).

Simple

Date (Creation)
2019-01-24
Date (Revision)
2019-01-24
Date (Publication)
2019-01-24
Date (released)
2019-01-24
Edition
1.0
Unique resource identifier
https://doi.org/10.5285/c42314b9-089e-48e7-b08e-8f664f5dc71c
Codespace
doi
Unique resource identifier
GB/NERC/BAS/PDC/01132
Codespace
https://data.bas.ac.uk/
Other citation details
Please cite this item as: Yarra, T. (2019). Molluscan Shell Matrix Proteins (Version 1.0) [Data set]. UK Polar Data Centre, British Antarctic Survey, Natural Environment Research Council, UK Research & Innovation. https://doi.org/10.5285/c42314b9-089e-48e7-b08e-8f664f5dc71c
Credit
No credit.
Status
completed Completed
Author
  British Antarctic Survey, Natural Environment Research Council, UK Research & Innovation - Yarra, Tejaswi ( Researcher )
Point of contact
  NERC EDS UK Polar Data Centre
British Antarctic Survey, High Cross, Madingley Road , Cambridge , Cambridgeshire , CB3 0ET , United Kingdom
+44 (0)1223 221400
https://www.bas.ac.uk/team/business-teams/information-services/uk-polar-data-centre/
Maintenance and update frequency
asNeeded As needed
Maintenance note
completed Completed
Global Change Master Directory (GCMD) Science Keywords
  • EARTH SCIENCE > Biosphere > Animal Taxonomy > Mollusks
Theme
  • Biomineralization
  • Molluscs
  • SMPs
  • Shell Matrix Proteins
  • Shell formation
GEMET - INSPIRE themes, version 1.0
  • Habitats and biotopes
Access constraints
otherRestrictions Other restrictions
Other constraints
no limitations to public access
Access constraints
otherRestrictions Other restrictions
Other constraints
no limitations
Use constraints
license License
Other constraints
Open Government Licence v3.0
Use constraints
otherRestrictions Other restrictions
Other constraints
Data released under Open Government Licence V3.0:
Unique resource identifier
url
Codespace
url
Association Type
crossReference Cross reference
Spatial representation type
textTable Text, table
Metadata language
engEnglish
Character set
utf8 UTF8
Topic category
  • Biota
N
S
E
W
thumbnail


Supplemental Information
It is recommended that careful attention be paid to the contents of any data, and that the author be contacted with any questions regarding appropriate use. If you find any errors or omissions, please report them to polardatacentre@bas.ac.uk.
Date (Publication)
2008-11-12
Publisher
  European Petroleum Survey Group
https://www.epsg-registry.org/
Unique resource identifier
urn:ogc:def:crs:EPSG::3031
Version
6.18.3

Distributor

Distributor
  NERC EDS UK Polar Data Centre
British Antarctic Survey, High Cross, Madingley Road , Cambridge , Cambridgeshire , CB3 0ET , United Kingdom
+44 (0)1223 221400
https://www.bas.ac.uk/team/business-teams/information-services/uk-polar-data-centre/
Name
application/vnd.ms-excel
Name
text/x-fasta
Name
text/plain
Units of distribution
bytes
Transfer size
445440
OnLine resource
Get Data ( WWW:LINK-1.0-http--link )

Download data

Units of distribution
bytes
Transfer size
445440
OnLine resource
Get Data ( WWW:LINK-1.0-http--link )

Download data

Units of distribution
bytes
Transfer size
445440
OnLine resource
Get Data ( WWW:LINK-1.0-http--link )

Download data

Hierarchy level
dataset Dataset
Statement

Methodology:

Data was gathered by mining metadata from Trembl database for various biomineralization specific keywords. Example keywords include: Molluscs, shell, bivalve, aragonite, calcite, prismatic, foliated, mantle, mantle edge, central mantle, pallial mantle, etc. Only SMPs uploaded to Uniprot were included in this database. The database only contains SMPs that have been experimentally validated to be present in molluscan shell matrices (based on the publication(s) attached to the UniProtID). Uniprot entries for only mantle specific sequences were not included, since all such sequences were determined to be biomineralization related based on sequence similarity results to the proteins identified in shell matrices, and were not experimentally validated.

Data collection:

Data was collected from Uniprot (https://www.uniprot.org/).

Data quality:

Database is based on uploaded Unioprot IDs entries until JULY 2018. No major publications have since been released that match the criteria to be included in the database. Missing values are only present in the domain columns - A missing value indicates that there were no functional domains detected in the sequence, based on InterproScan results from Interpro databases.

File identifier
c42314b9-089e-48e7-b08e-8f664f5dc71c XML
Metadata language
engEnglish
Character set
utf8 UTF8
Hierarchy level
dataset Dataset
Hierarchy level name
dataset
Date stamp
2019-01-24
Metadata standard name
ISO 19115 Geographic Information - Metadata
Metadata standard version
ISO 19115:2003(E)
Point of contact
  NERC EDS UK Polar Data Centre
British Antarctic Survey, High Cross, Madingley Road , Cambridge , Cambridgeshire , CB3 0ET , United Kingdom
+44 (0)1223 221400
https://www.bas.ac.uk/team/business-teams/information-services/uk-polar-data-centre/
 
 

Overviews

Spatial extent

N
S
E
W
thumbnail


Keywords

Biomineralization Molluscs SMPs Shell Matrix Proteins Shell formation
GEMET - INSPIRE themes, version 1.0
Habitats and biotopes
Global Change Master Directory (GCMD) Science Keywords
EARTH SCIENCE > Biosphere > Animal Taxonomy > Mollusks

Provided by

logo

Share on social sites

Access to the portal
Read here the full details and access to the data.

Associated resources

Not available


  •  
  •  
  •