What Difference Does Quantity Make? On the Epistemology of Big Data in Biology
- PMID: 25729586
- PMCID: PMC4340542
- DOI: 10.1177/2053951714534395
What Difference Does Quantity Make? On the Epistemology of Big Data in Biology
Abstract
Is big data science a whole new way of doing research? And what difference does data quantity make to knowledge production strategies and their outputs? I argue that the novelty of big data science does not lie in the sheer quantity of data involved, but rather in (1) the prominence and status acquired by data as commodity and recognised output, both within and outside of the scientific community; and (2) the methods, infrastructures, technologies, skills and knowledge developed to handle data. These developments generate the impression that data-intensive research is a new mode of doing science, with its own epistemology and norms. To assess this claim, one needs to consider the ways in which data are actually disseminated and used to generate knowledge. Accordingly, this paper reviews the development of sophisticated ways to disseminate, integrate and re-use data acquired on model organisms over the last three decades of work in experimental biology. I focus on online databases as prominent infrastructures set up to organise and interpret such data; and examine the wealth and diversity of expertise, resources and conceptual scaffolding that such databases draw upon. This illuminates some of the conditions under which big data need to be curated to support processes of discovery across biological subfields, which in turn highlights the difficulties caused by the lack of adequate curation for the vast majority of data in the life sciences. In closing, I reflect on the difference that data quantity is making to contemporary biology, the methodological and epistemic challenges of identifying and analyzing data given these developments, and the opportunities and worries associated to big data discourse and methods.
Keywords: big data epistemology; biology; data curation; data infrastructures; data-intensive science; databases; model organisms.
Similar articles
-
Epistemology for interdisciplinary research - shifting philosophical paradigms of science.Eur J Philos Sci. 2019;9(1):16. doi: 10.1007/s13194-018-0242-4. Epub 2018 Dec 12. Eur J Philos Sci. 2019. PMID: 30873248 Free PMC article.
-
Planning Implications Related to Sterilization-Sensitive Science Investigations Associated with Mars Sample Return (MSR).Astrobiology. 2022 Jun;22(S1):S112-S164. doi: 10.1089/AST.2021.0113. Epub 2022 May 19. Astrobiology. 2022. PMID: 34904892
-
The challenges of big data biology.Elife. 2019 Apr 5;8:e47381. doi: 10.7554/eLife.47381. Elife. 2019. PMID: 30950793 Free PMC article.
-
The Global Genome Question: Microbes as the Key to Understanding Evolution and Ecology: This report is based on a colloquium, “The Global Genome Question: Microbes as the Key to Understanding Evolution and Ecology,” sponsored by the American Academy of Microbiology and held October 11-13, 2002, in Longboat Key, Florida.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33119236 Free Books & Documents. Review.
-
Big data need big theory too.Philos Trans A Math Phys Eng Sci. 2016 Nov 13;374(2080):20160153. doi: 10.1098/rsta.2016.0153. Philos Trans A Math Phys Eng Sci. 2016. PMID: 27698035 Free PMC article. Review.
Cited by
-
Domesticating data: Traveling and value-making in the data economy.Soc Stud Sci. 2024 Jun;54(3):429-450. doi: 10.1177/03063127231212506. Epub 2023 Nov 25. Soc Stud Sci. 2024. PMID: 38006306 Free PMC article.
-
Stakeholder engagement does not guarantee impact: A co-productionist perspective on model-based drought research.Soc Stud Sci. 2024 Apr;54(2):210-230. doi: 10.1177/03063127231199220. Epub 2023 Sep 27. Soc Stud Sci. 2024. PMID: 37753924 Free PMC article.
-
Academic data science: Transdisciplinary and extradisciplinary visions.Soc Stud Sci. 2024 Feb;54(1):133-160. doi: 10.1177/03063127231184443. Epub 2023 Jul 7. Soc Stud Sci. 2024. PMID: 37417195 Free PMC article.
-
From 'making lists' to conducting 'well-rounded' studies: Epistemic re-orientations in soil microbial ecology.Soc Stud Sci. 2024 Feb;54(1):78-104. doi: 10.1177/03063127231179700. Epub 2023 Jun 30. Soc Stud Sci. 2024. PMID: 37387230 Free PMC article.
-
The practical ethics of repurposing health data: how to acknowledge invisible data work and the need for prioritization.Med Health Care Philos. 2023 Mar;26(1):119-132. doi: 10.1007/s11019-022-10128-6. Epub 2022 Nov 19. Med Health Care Philos. 2023. PMID: 36402853 Free PMC article.
References
-
- Ankeny Rachel, Leonelli Sabina. Valuing Data in Postgenomic Biology: How Data Donation and Curation Practices Challenge the Scientific Publication System. In: Stevens Hallam, Richardson Sarah., editors. PostGenomics. Duke University Press; 2015. in press.
-
- Baker Karen S., Millerand Florence. Infrastructuring Ecology: Challenges in Achieving Data Sharing. In: Parker John N., Vermeulen Niki, Penders Bart., editors. Collaboration in the New Life Sciences. Ashgate; Farnham, UK: 2010. pp. 111–138.
-
- Bauer Susanne. Mining Data, Gathering Variables, and Recombining Information: The Flexible Architecture of Epidemiological Studies. Studies in History and Philosophy of Biological and Biomedical Sciences. 2008;39:415–426. - PubMed
-
- Bechtel William. Discovering Cell Mechanisms. The Creation of Modern Cell Biology. Cambridge University Press; 2006.
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources