www.whizpr.nlwww.marcommit.nlProgressCommunications.eu
www.whizpr.nlProgressCommunications.euwww.whizpr.nl

x.com/ictberichten
Datum: (20 jaar en 119 dagen geleden)
Bedrijf:
PR: CFF Communications

Oracle Database 10g ondersteunt grootste DNA-database ter wereld

Sanger Institute heeft wereldwijd de meest complete administratie op het gebied van genetisch materiaal

Op woensdag 18 januari 2006 heeft het DNA database-archief van het Wellcome Trust Sanger Institute 1 miljard records bereikt. De omvang van het archief is zo enorm, dat als alles uitgeprint zou worden als één tekstregel deze 250 keer om de wereld zou passen. Het archief behelst alle data die door wetenschappers wereldwijd geproduceerd en gepubliceerd wordt. Het archief is 22 terabytes groot en verdubbelt iedere 10 maanden.

De database draait op een enkele HP ES45 (een 4-CPU server met een geheugen van 16GB) met een opslag die bestaat uit HSV EVA5000’s enEVA8000’s op een SAN. De data wordt verwerkt in de database met behulp van een cluster van 4 ES45’s. De database is Oracle Database 10g.

Hieronder volgt het volledige persbericht:

Around the World in 800 Billion Bases
Sanger Institute Genetic Records are World’s Biggest

On Wednesday 18 January 2006 the Wellcome Trust Sanger Institute's World Trace Archive database of DNA sequences hit one billion entries. The Trace Archive is a store of all the sequence data produced and published by the world scientific community, including the Sanger Institute’s own prodigious output as a world-leading genomics institution.

To grasp how much data is in the Archive, if it were printed out as a single line of text, it would stretch around the world more than 250 times. Printing it out on pages of A4 would produce a stack of paper two-and-a-half times as high as Mount Everest.

Each entry is a piece of genetic information averaging 864 characters long. Scientists can search these sequences and piece them together to build up the whole genetic information of organisms – mice, fish, flies, bacteria and, of course, humans.

The Archive is 22 Terabytes in size and doubling every ten months – perhaps the largest single scientific database in Europe, if not the world.

Martin Widlake, Database Services Manager at the Wellcome Trust Sanger Institute said: “At 22 000 GB the Trace Archive is in the Top Ten UNIX databases in the world. That’s not bad for a research organisation of 850 employees in the countryside just outside Cambridge.

“It is possibly the biggest single (acknowledged) scientific RDBMS database in Europe, if not the world.”

All the data are freely available to the world scientific community (http://trace.ensembl.org/), as a resource to geneticists all over the globe. When a researcher is studying a disease or gene, they can download the genetic information known about the area they are studying.

The data are being actively used by biomedical researchers in academic and commercial organizations. The three internet domains that make most use of the trace archive are .com, .edu and .uk. Dotcoms are responsible for about 80% of download each week – mostly as big ‘customers’, taking vast chunks each visit. Next are US university researchers, followed by UK scientists.

Trace data are the raw results of genetic research to allow them to identify and study genes, to reveal variations (mutations) in genes and to study similarity to genes in other organisms. These are vital starting points for studying and better understanding the biology of health and disease.

By any comparison, the billion records stands above many other familiar repositories. The British Library holds 13 million items: the US Library of Congress holds 115 million items. The Trace Archive holds one billion chunks of unique information.

"Accessing the data becomes a larger and larger problem as the dataset grows,” continued Martin Widlake. “At present it is simple and very quick to access a record if you know its unique identifier as issued by the Sanger Institute, the US National Center for Biotechnology Information (NCBI) database, or the ‘name’ of the trace as given by the organization that originally sequenced that piece of genetic information.

“Scanning the whole dataset for a single genetic sequence, which is a lot like searching for a single sentence in the contents of the British Library, is a massive task. However, the team at the Sanger Institute are working on new methods to make the data easier to search and access".

The data are held in duplicate, with the NCBI also maintaining a copy: with two sites holding it, a single disaster cannot wipe out the only copy of this vital and heavily used database.

NOOT VOOR DE REDACTIE

DNA traces
DNA sequencing technology tags each letter of genetic code (base) with a fluorescent chemical. The sequence is read by robots that visualize each letter as a peak of red, green, yellow or blue fluorescence. This image is the ‘trace’.

Each file of raw data is about 200 KB. The trace is interpreted by the robot software and the letters are identified (the bases are ‘called’ in the jargon). The text string of sequence then becomes searchable and faster programs are needed to manage the search of almost one trillion letters (one billion records of 864 bases on average, plus some older records of earlier versions).

trace.ensembl.org/index.html

The hardware and software
The Database is hosted on a single HP ES45 (a 4-CPU server with 16GB of memory) with the storage consisting of HSV EVA5000s and EVA8000s on a SAN. The data are processed into the database using a cluster of 4 ES45s. The database is Oracle Database 10g.

The Winter Corporation database survey 2005 The Winter Corporation database survey 2005 suggests the Trace Archive would rank fifth behind such giants as AT&T, Yahoo and other large international corporations.

Link 1  -- select ‘UNIX’ as the Platform option:
Link 2 (PDF)

Oracle
Oracle is 's werelds grootste leverancier van bedrijfssoftware. Voor meer informatie: www.oracle.com

###

Trademarks
Oracle, JD Edwards, PeopleSoft and Retek are registered trademarks of Oracle Corporation and/or its affiliates. Other names may be trademarks of their respective owners.
Recent van Oracle Nederland  
Oracle en NetSuite lanceren AI-oplossing voor restaurantbeheer

Oracle brengt Java 26 uit

Oracle Red Bull Racing verlengt titelsponsorschap met Oracle

Verstreken tijd: 20 jaar en 119 dagen
Oracle Nederla.. contact  

+31 030 669 9000
www.oracle.com/nl

Marcommit is hét full service B2B marketing bureau van Nederland! Wij helpen jouw bedrijf met offline en online marketing campagnes die écht werken.
 Spotlight  
Logo Expertum
Logo Valid
Logo Decos
Logo Companial
Logo Companial
Logo 12Build
Logo Key2XS
Logo Frontline Solutions
Logo Delta-N B.V.
Logo R-Go Tools B.V.
Logo Blastic
Logo Key2XS
Logo BusinessCom
Logo NetBoss B.V.
Logo Cyemptive
Logo Victoria ID
Logo Spryng
Logo Onventis B.V.
Logo DNA Services B.V.
Logo We talk SEO B.V.
Logo BusinessCom
Logo SCOS ViaCloud BV
Logo Web Wings
Logo Frontline Solutions
Logo Keuze.nl BV
Logo We talk SEO B.V.
Logo We talk SEO B.V.
Logo We talk SEO B.V.
Logo Data Tribes
Logo MCS B.V.
Logo Infoblox
Logo Red Hat
Logo Veeam Software
Logo Geotab
Logo KnowBe4
Logo reichelt elektronik
Logo Odin Groep
Logo Becky.works
Logo Veeam Software
Logo Veeam Software
Logo Incubeta
Logo Palo Alto Networks
Logo NetApp
Logo Red Hat
Logo PocketBook
TARIEVEN
Publicatie eenmalig €49

PUBLICATIEBUNDELS
6 voor €199
12 voor €349
Onbeperkt €499

EENMALIG PLAATSEN
Persbericht aanleveren

REGELMATIG PLAATSEN
Bedrijfsabonnement
CONTACT
Persberichten.com
JMInternet
Kuyperstraat 48
7942 BR Meppel
Nederland
info@persberichten.com
KvK 54178096

VOLGEN
@ICTBERICHTEN

ZOEKEN
IT bedrijf
IT PR-bureau
OVER ONS
Persberichten.com, hét platform voor IT/Tech persberichten

DATABASE
103547 persberichten
7023 bedrijfsprofielen
59 PR-bureauprofielen
17340 tags

KENMERKEN
• Behouden tekstopmaak
• Foto/illustratie/logo
• Downloadbare bijlages
• Profiel met socials
 
www.whizpr.nlwww.marcommit.nlProgressCommunications.eu
INFLUX PRProgressCommunications.euProgressCommunications.eu