字體:小 中 大 |
|
|
|
| 2007/12/22 23:49:12瀏覽470|回應0|推薦0 | |
GenBank. Today, the preeminent DNA sequence database in the world is GenBank, maintained by the 今日世界上最具權威的DNA定序資料庫,係由隸屬美國國家醫學圖書館(NLM),其中的美國國家衛生研究院生物科技資訊中心(NCBI)所維護。 It was established in 1978 as a central repository for DNA sequence data. 於1978年建立並定為DNA序列資料的核心資料庫。 Since then it has expanded somewhat in scope to include expressed sequence tag data, protein sequence data, three-dimensional protein structure, taxonomy, and links to the biomedical literature (MEDLINE). 爾後,並擴大範圍至序列表示的標籤數據、蛋白質序列數據、三維蛋白質架構、分類學和生物醫學(線上醫學檔案分析和檢索系統)的連接。 As of August 2005, the database has exceeded 100 gigabases of sequence data representing both individual genes and partial and complete genomes of over 165,000 organisms. 自2005年8月起,資料庫已有多於100個的定序gigabases,描述超過165,000個生物體的個別基因和部分或完整基因。 Through international collaboration with the European Molecular Biology Laboratory (EMBL) in the 更透過與在英國的歐洲分子生物學實驗室(EMBL)以及日本的DNA Data Bank (DDBI) 的國際合作,資料每日進行同步交換,最新數據就可快速的提供給世界各地的科學家。 While it is complex, comprehensive database, the scope of its coverage is focused on sequences from human and other organisms and links to the literature. 這是個複雜、廣泛的資料庫,囊括範圍集中於人和其他生物體的定序相關性與文獻的連結。 Other limited data sources (for example, three-dimensional structure and OMIN, discussed below ) , have been added recently by reformatting the existing OMIN and PDB databases and redesigning the structure of the GenBank system to accommodate these new data sets. 最近已重新格式化,設計成融合現有OMIN 和PDB的資料庫數據於GenBank的系統架構。 The system is maintained as a combination of flat files, relational databases, and files containing Abstract Syntax Notation One (ASN.1) ─ a syntax for defining data structures developed for the telecommunications industry. 系統以flat files、關聯式資料庫和一般檔案構成,使用抽象語法表示規則(ASN. 1) –多用於電信工業使用的資料定義語法。 Each GenBank entry is assigned a unique identifier by the NCBI. Updates are assigned a new identifier, with the identifier of the original entity remaining unchanged for archival purposes. GenBank中的每個DNA由NCBI賦予單一標籤。而原先標籤不試用或不夠明確時,於更新時則給予帶有不更改原有檔案目的地的新標籤。 Older references to an entity thus do not inadvertently indicate a new and possibly inappropriate value. The most current concepts also receive a second set of unique identifier (UIDs) , which mark the most up-to-date form of a concept while allowing older versions to be accessed via their original identifier. 最新的設計則另含有第2 套獨特的標籤(UIDs),允許之前的版本在使用者透過原先的標識尋找時,仍可被找到,並以最新的標籤顯示。 The average user of the database is not able to access the structure of the data directly for querying or other functions, although complete snapshots of the database are available for export in a number of formats, including ASN.1. 在ㄧ定格式下,包括ASN. 1,是可供使用資料庫的完整權限,但資料庫的普通用戶仍不被允許透過資料查詢或其他方式干涉系統架構。 The query mechanism provided is via the Entrez application (or its Web version), which allows keyword, sequence, and GenBank UID searching through a static interface. 透過向Entrez(或它的Web 版本)提供的查詢機制,申請使用者身份,開放允許關鍵字查詢、順序和GenBank UID的權限。 原文截自Fundamentals of DATABASE SYSTEMS,FIFTH EDITION 腦殘自翻,高手請指正錯誤地方(跪) |
|
| ( 心情隨筆|校園筆記 ) |











