Sponsored Links

Rabu, 27 Juni 2018

Sponsored Links

Data Curation: The Next Frontier - YouTube
src: i.ytimg.com

Data curation is a broad term used to denote processes and activities associated with the organization and integration of data collected from various sources, data annotations, and publications and presentation of data in such a way that the value of data is maintained over time, and data remains available for reuse and preservation. Data curation includes "all processes necessary for the creation, maintenance, and management of principled and controlled data, together with the capacity to add value to data". In science, data curation can show the process of extracting important information from scientific texts, such as research articles by experts, to be transformed into electronic formats, such as the inclusion of biological databases.

In the modern era large data curation data has become more prominent, especially for high volume processing software and complex data systems. The term is also used in the use of history and humanities, where the improvement of cultural and scientific data from digital humanities projects requires the expertise and analytical practice of data curation. In a broad sense, curation means the various activities and processes undertaken to create, manage, maintain, and validate components. Specifically, curation of data is an attempt to determine what information is worth saving and for how long.


Video Data curation



History and practices

Users, rather than the database itself, usually initiate data curation and maintain metadata. According to the University of Illinois' Graduate School of Library and Information Science, "Data curation is an active and sustainable data management through the life cycle of interest and usefulness for scholarships, science, and education, curation activities enable the discovery and retrieval of data, maintain quality, add value, and provide it for reuse from time to time. "The data curation workflow is different from data quality management, data protection, life cycle management and data movement.

Census data have been available in the form of punch cards tabulated since the beginning of the 20th century and have been electronic since the 1960s. The Inter-university Consortium's Site for Political and Social Research (ICPSR) marked the year 1962 as the date of their first Survey Data Archive.

An in-depth background on library data appears in the 1982 edition of the Illinois journal, Library Trends. For historical background on the archive data movement, see "Social Scientific Information Needs for Numerical Data: International Data Evolution Infrastructure Archive." The exact curation process undertaken in any organization depends on the volume of data, how much noise is conceived data and what to expect from future use of data against its deployment.

The crisis in space data led to the creation of the 1999 Overseas Information Systems (OAIS) model, escorted by the Consultative Committee for the Space Data System (CCSDS), established in 1982.

The term data curation is sometimes used in the context of biological databases, where the first specific biological information is obtained from various research articles and then stored in certain categories of databases. For example, information about anti-depressant drugs can be obtained from various sources and, after checking whether they are available as a database or not, they are stored in the category of anti-depressant drug database. Companies also utilize data curation in their operational and strategic processes to ensure data quality and accuracy.

Maps Data curation



Projects and studies

The Dissemination Information Package (DIPS) for the Information Reuse Project (DIPIR) is studying research data produced and used by quantitative social scientists, archaeologists, and zoologists. The intended audience is a researcher using secondary data and digital curators, digital repository managers, data center staff, and others who collect, manage, and store digital information.

The Protein Data Bank was founded in 1971 at Brookhaven National Laboratory, and has grown into a global project. A database for three-dimensional structural data from other large biological proteins and molecules, GDP contains over 120,000 structures, all standardized, validated against experimental data, and annotated.

FlyBase, the main repository of genetic and molecular data for the family of insects Drosophilidae, originated in 1992. FlyBase gives annotation of the whole genome of Drosophila melanogaster .

The Linguistic Data Consortium is the data storage for linguistic data, since 1992.

The Sloan Digital Sky Survey began surveying the night sky in 2000. Computer scientist Jim Gray, while working on the SDSS data architecture, championed the idea of ​​curation of data in science.

DataNet is a research program of the National Science Foundation Office of Cyberinfrastructure, funding a project of data management in science. DataONE (Earth Observation Network) is one of the projects funded through DataNet, helping the environmental science community preserve and share data.

Launching the Data Curation Network | continuum | University of ...
src: www.continuum.umn.edu


See also

  • Biocurator
  • Archeology data
  • Data degradation
  • Data format management
  • Data preservation
  • Data management
  • Data disputes
  • Digital curation, curate published documents, not raw data
  • Digital preservation
  • Information, an individual with extensive industry expertise, acute familiarity with organizational structure and processes, in-depth domain level information and technical information systems

The Role of Data Repositories in Reproducible Research ...
src: isps.yale.edu


References


Data curation takes the value of big data to a new level ...
src: tr1.cbsistatic.com


External links

  • Ecological and environmental data curation: DataONE
  • The data management tools and services cover a wide range of disciplines: DataConservancy

Source of the article : Wikipedia

Comments
0 Comments