Data Storage
This week we are looking at how we can store data. The world now has enormous amounts of data and it is imperative that we devise convenient means on how we can store this data.
Wilkinson et al., (2016) and Cox et al., (2019) explain that data curation in data storage refers to the structured and systematic management of digital information to ensure that it is securely preserved, well-organized and consistently accessible for future use.This is an importatnt aspect of the data curation life cycle because the data does not only need to be collected and analyzed but must also be stored in a manner that supports long-term preservation, retrieval, sharing and reuse. In this regard effective storage underscores the reliability and sustainability of digital data assets in research and institutional environments ensuring that information remains usable beyond its initial purpose, (Wilkinson et al., 2016; Cox et al., 2019).
It is importatnt to understand that data storage in data curation is not only about saving files, no. Rather it involves deliberate organization practices that enhance data management and usability. Johnston et al., (2018) and Kim et al., (2021) tell us that these practices include the use of metadata schemas, standardized file naming systems, indexing and version control mechanisms. Metadata, in particular, provides structured descriptive information that improves understanding, discoverability and context of datasets. On the other hand version control helps track changes made to datasets over time, ensuring transparency and preventing the loss of earlier data versions. Research across different disciplines has shown that weak organisational practices often lead to data fragmentation, duplication and retrieval challenges which in the long term affect research reproducibility and efficiency,(Johnston et al., 2018; Kim et al., 2021).
Data storage systems are
supported by a variety of infrastructures that differ in capacity, cost and
accessibility. These infrastructures include local storage devices, institutional servers,
cloud computing platforms and digital repositories which are maintained by universities
or research organizations. Cloud-based systems are increasingly favored due to
their flexibility, scalability and ability to support remote access and
collaboration among distributed users. Similarly, institutional repositories
play a key role in preserving scholarly outputs and ensuring that research data
remains available for long-term academic use and verification, (Yoon & Kim,
2020; Johnston et al., 2018).
In order for data storage to be effective, it is importatnt that we have secure security considerations in place at our various institutions. Stored data must be protected against risks such as unauthorized access, corruption, accidental deletion and cyber-attacks. In order to to address these risks, institutions must implement layered security mechanisms including encryption, access controls, authentication procedures and monitoring systems. Kim et al., (2021) emphasizes that backup strategies form a critical part of data protection ensuring that copies of data are available in the event of eventualities such as system failure or data loss. Research around the world has shown that reliable backup and recovery systems significantly strengthen data integrity and institutional trust in digital records, records, (Kim et al., 2021; Cox et al., 2019).
One important
dimension of data storage is its role in supporting data accessibility and
sharing. Having properly curated storage systems in place ensures that authorized users can
easily locate, retrieve, and reuse datasets. This allows to support collaborative
research and improves transparency in scientific communication. The FAIR
principles (findability, accessibility, interoperability and reuse) provides for a
widely accepted framework for guiding such practices.When data is stored following these principles, it becomes more valuable not only for its
original purpose but also for future research and innovation, (Wilkinson et al.,
2016; Yoon & Kim, 2020).
Alemneh & Hastings (2020) remind us that despite having its benefits data storage does have its challenges. Rapid growth of data generation places pressure on storage capacity and infrastructure and this leads to increase in operational costs. They go on to tell us that many institutions struggle with inadequate technological infrastructure , unreliable power supply and insufficient funding. Equipment for data storage is not cheap hence most organisations cannot afford to buy it. Additionally cyber security threats and rapid technological changes complicate long-term preservation efforts and require continuous system upgrades and skilled personnel to manage evolving storage environments effectively.
In conclusion data storage is a foundational part of data curation that ensures that digital information remains secure, structured, accessible, and reusable throughout its life cycle. Through the integration of organized storage practices, secure infrastructure, preservation strategies, and adherence to international standards such as FAIR, organizations are able to safeguard the long-term value of data. This ultimately enhances research quality, promotes collaboration, and supports sustainable knowledge management across disciplines
REFERENCES
Alemneh, D. G., & Hastings, S. K. (2020). Developing the data curation profiles toolkit. International Journal of Digital Curation, 5(1), 93–98.
Cox, A. M., Kennan, M. A., Lyon, L., & Pinfield, S. (2019). Developments in research data management in academic libraries: Towards an understanding of research dataservice maturity. Journal of the Association for Information Science and Technology, 70(9), 1–14.
Johnston, L. R., Carlson, J., Hswe, P., Hudson-Vitale, C., Imker, H., Kozlowski, W., Olendorf, R., & Stewart, C. (2018). Data curation network: How do we compare? A snapshot of six academic library institutions’ data repository and curation services. Journal of eScienc Librarianship, 7(1), 1–13.
Kim, Y., Warga, E., & Moen, W. E. (2021). Competencies required for digital curation: An analysisof job advertisements. International Journal of Digital Curation, 8(1), 66–83.
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg,N., & Mons, B. (2016). The FAIR guiding principles for scientific data management and stewardship. Scientific Data, 3(1), 160018.
Yakel, E., Faniel, I. M., Kriesberg, A., & Yoon, A. (2019). Trust in digital repositories. International Journal of Digital Curation, 8(1), 143–156.
Yoon, A., & Kim, Y. (2020). Understanding requirements for research data preserva International Journal of Information Management, 49, 292–303.


Beautiful summary Chisi
ReplyDeleteStoring data without offering user accessibility is meaningless.The FAIR concept shared in your blog post is so crucial to data storage.Your write-up is so precise.
ReplyDeletewonderful
ReplyDeleteWell done, Mlonyeni
ReplyDeleteWonderful
ReplyDeleteThis comment has been removed by the author.
ReplyDeleteThis a very good write up keep it up.
ReplyDeleteNice
ReplyDeleteThis is well articulated
ReplyDeleteGood job
ReplyDeleteNice one bro
ReplyDeleteNice one
ReplyDeleteData explosion is quite a challenge as rightly pinned by too much of everything is poisonous adage.
ReplyDelete