Data Storage

 This week we are looking at how we can store data. The world now has enormous amounts of data and it is imperative that we devise convenient means on how we can store this data.

Wilkinson et al., (2016) and Cox et al., (2019) explain that data curation in data storage refers to the structured and systematic management of digital information to ensure that it is securely preserved, well-organized and consistently accessible for future use.This is an importatnt aspect of the data curation life cycle because the data does not only need to be collected and analyzed but must also be stored in a manner that supports long-term preservation, retrieval, sharing and reuse. In this regard effective storage underscores the reliability and sustainability of digital data assets in research and institutional environments ensuring that information remains usable beyond its initial purpose, (Wilkinson et al., 2016; Cox et al., 2019).




It is importatnt to understand that data storage in data curation is not only about saving files, no. Rather it involves deliberate organization practices that enhance data management and usability. Johnston et al., (2018) and  Kim et al., (2021) tell us that these practices include the use of metadata schemas, standardized file naming systems, indexing and version control mechanisms. Metadata, in particular, provides structured descriptive information that improves understanding, discoverability and context of datasets. On the other  hand version control helps track changes made to datasets over time, ensuring transparency and preventing the loss of earlier data versions. Research across different disciplines has shown that weak organisational practices often lead to data fragmentation, duplication and retrieval challenges which in the long term affect research reproducibility and efficiency,(Johnston et al., 2018; Kim et al., 2021).

Data storage systems are supported by a variety of infrastructures that differ in capacity, cost and accessibility. These infrastructures include local storage devices, institutional servers, cloud computing platforms and digital repositories which are maintained by universities or research organizations. Cloud-based systems are increasingly favored due to their flexibility, scalability and ability to support remote access and collaboration among distributed users. Similarly, institutional repositories play a key role in preserving scholarly outputs and ensuring that research data remains available for long-term academic use and verification, (Yoon & Kim, 2020; Johnston et al., 2018).

In order for data storage to be effective, it is importatnt that we have secure security considerations in place at our various institutions. Stored data must be protected against risks such as unauthorized access, corruption, accidental deletion and cyber-attacks. In order to to address these risks, institutions must implement layered security mechanisms including encryption, access controls, authentication procedures and monitoring systems. Kim et al., (2021) emphasizes that backup strategies form a critical part of data protection ensuring that copies of data are available in the event of eventualities such as system failure or data loss. Research around the world has shown that reliable backup and recovery systems significantly strengthen data integrity and institutional trust in digital records, records, (Kim et al., 2021; Cox et al., 2019).

One important dimension of data storage is its role in supporting data accessibility and sharing. Having properly curated storage systems in place ensures that authorized users can easily locate, retrieve, and reuse datasets. This allows to support collaborative research and improves transparency in scientific communication. The FAIR principles (findability, accessibility, interoperability and reuse) provides for a widely accepted framework for guiding such practices.When data is stored following these principles, it becomes more valuable not only for its original purpose but also for future research and innovation, (Wilkinson et al., 2016; Yoon & Kim, 2020). 


Alemneh & Hastings (2020) remind us that despite having its benefits data storage does have its challenges. Rapid growth of data generation places pressure on storage capacity and infrastructure and this leads to increase in operational costs. They go on to tell us that many institutions struggle with inadequate technological infrastructure , unreliable power supply and insufficient funding. Equipment for data storage is not cheap hence most organisations cannot afford to buy it. Additionally cyber security threats and rapid technological changes complicate long-term preservation efforts and require continuous system upgrades and skilled personnel to manage evolving storage environments effectively.

In conclusion data storage is a foundational part of data curation that ensures that digital information remains secure, structured, accessible, and reusable throughout its life cycle. Through the integration of organized storage practices, secure infrastructure, preservation strategies, and adherence to international standards such as FAIR, organizations are able to safeguard the long-term value of data. This ultimately enhances research quality, promotes collaboration, and supports sustainable knowledge management across disciplines


REFERENCES 

Alemneh, D. G., & Hastings, S. K. (2020). Developing the data curation profiles toolkit. International Journal of Digital Curation, 5(1), 93–98.

Cox, A. M., Kennan, M. A., Lyon, L., & Pinfield, S. (2019). Developments in research data management in academic libraries: Towards an understanding of research dataservice maturity. Journal of the Association for Information Science and Technology, 70(9), 1–14.

Johnston, L. R., Carlson, J., Hswe, P., Hudson-Vitale, C., Imker, H., Kozlowski, W., Olendorf, R., & Stewart, C. (2018). Data curation network: How do we compare? A snapshot of six academic library institutions’ data repository and curation services. Journal of eScienc Librarianship, 7(1), 1–13.

Kim, Y., Warga, E., & Moen, W. E. (2021). Competencies required for digital curation: An analysisof job advertisements. International Journal of Digital Curation, 8(1), 66–83.

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg,N., & Mons, B. (2016). The FAIR guiding principles for scientific data management and stewardship. Scientific Data, 3(1), 160018.

Yakel, E., Faniel, I. M., Kriesberg, A., & Yoon, A. (2019). Trust in digital repositories. International          Journal of Digital Curation, 8(1), 143–156.

Yoon, A., & Kim, Y. (2020). Understanding requirements for research data preserva International Journal of Information Management, 49, 292–303.

 

Comments

Post a Comment

Popular posts from this blog

Data collection and repositories

DATA CURATION PRESERVATION ISSUES (THREATS TO DIGITAL MATERIALS )