SUMMARY ON DATA COLLECTION AND REPOSITORIES

 

SUMMARY ON DATA COLLECTION AND REPOSITORIES

Data collection and repositories are central components of effective research data management (RDM), ensuring that data is not only generated systematically but also preserved and made accessible for future use.

Data collection refers to the structured process of gathering information through methods such as surveys, interviews, observations, or document analysis. In RDM, emphasis is placed on producing high-quality, reliable, and well-documented data. This includes the use of standardised procedures and tools, as well as the creation of metadata that describes the content, methodology, and structure of the data. Proper documentation enhances data integrity and enables reuse, which aligns with the FAIR principles ensuring that data is findable, accessible, interoperable, and reusable (Wilkison et...al., 2016).

The Digital Curation Centre (DCC) Lifecycle Model highlights data collection within the “create or receive” stage, emphasizing that data should be captured in formats and structures that support long-term preservation and usability (Higgins, 2008). Poor data collection practices such as of documentation or inconsistent format can compromise the entire data lifecycle, limiting the value of research outputs. Therefore, researchers are encountered to adopt best practices at the point of data creation, including ethical considerations, data quality checks, and adherence to institutional or disciplinary standards.

Data repositories compliment data collection by providing secure platforms for storing, preserving, and sharing research data. These repositories may be institutional, disciplinary, or general-purpose, and they play a critical role in ensuring long-term access and visibility of research outputs. Modern repositories support functionalities such as persistent identifiers for example, DOIs, metadata standards, version control, and access management. According to Tenopir, et...al., (2020), the use of repositories has increased globally, driven by funder requirements and the growing emphasis on open science, although challenges such as lack of skills and infrastructure persist, particularly in developing regions.

Repositories also align with the preservation and access stages of the DCC lifecycle, ensuring that data remains usable over time despite technological changes. However, in many African institutions, including those in Malawi, the adoption of repositories is constrained by limited infrastructure, insufficient policies, and low awareness among researchers (Chiware & Mathe, 2016). Addressing these challenges requires institutional investment in repository systems, staff training, and the development of clear RDM policies.

In a nut shell, effective data collection ensures the generation of high-quality, well documented data, while repositories ensure that data is preserved, accessible, and reusable over time. Both components are interdependent and central to robust research data management systems.  

References     

Chiware, E.R.T., & Mathe, Z. (2016). Academic libraries’ role in research data management

 services: A South African perspective. South Africa journal of Libraries and information

 Science. 82(2),1-9. https://doi.org/10.7553/82-2-1561

Higgins, S. (2009). The DCC Curation Lifecycle Model. International journal of Digital

  Curation, 3(1), 134-#140. https://doi.org/10.2218/ijdc.v3i1.48

Tenopir, C., Allard, S., Douglass, K., Aydinoglu, A. U., Wu, L., Read, E., Manoff, M., & Frame,

 M. (2011). Data sharing by scientists: Practices and perceptions. PLoS ONE, 6(6),

 e21101.

 https://doi.org/10.1371/journal.pone.0021101

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A.,

 Blomberg, N., Boiten, J. W., da Silva Santos, L. B., Bourne, P. E., Bouwman, J.,

 Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T.,

 Finkers, R., … Mons, B. (2016). The FAIR guiding principles for scientific data

management and stewardship. Scientific Data, 3, 160018.

 https://doi.org/10.1038/sdata.2016.18





 



Comments

Post a Comment

Popular posts from this blog

A SUMMARY ON SELECTION AND APPRAISAL OF DATA

LECTURE-SUMMARY OF INFORMATION LITERACY MODELS AND THE BIG SIX INFORMATION SKILLS MODEL-BY Edna Kaliza MLIS0725