OLCreate: ORION_MOOC: Week 3 Forum

The standard view of this forum does not always work well with assistive technology. We also provide a simpler view, which still contains all features. Switch to simple view.

Collapse all posts

Sabine Barthold Post 1

• 27 August 2019, 1:17 PM • Edited by the author on 3 November 2019, 2:11 PM

Topic 1: Challenges and benefits of sharing research data

Data is most commonly thought of as the numeric measurements and results that emerge from experiments. However, data can refer to a large range of information that you might use or produce during research. If you think or your own research, what kinds of data do you produce and what are some of the challenges and opportunities with these types of data? What ethical issues might such data raise? How could other researchers benefit from it if it was accessible? Share your expereince and discuss with your peers in the forum!

Permalink to post 1

Sam Groves Post 2 in reply to 1

• 4 November 2019, 2:01 PM

I think it is interesting to consider the opportunities data sharing could have with qualitative datasets. Considering the length of time qualitative data collection can take in comparison to a lot of numerical methods, qualitative data sets are often small, limiting their usefulness. Furthermore, these datasets can be used often benefit from reanalysis by other researchers, be that with the same or different methods, considering the nature of most qualitative data analysis. I think data sharing would massively facilitate topic-specific qualitative dataset size whilst also increasing how much can be learnt from each piece of data originally, which is very exciting!

Jump to: Parent to post 1
Permalink to post 2

Joanne Bakker Post 5 in reply to 2

• 6 November 2019, 9:52 AM

I agree, it would be nice to be able to combine data from different groups, perhaps those studying the same disease/process/cell/molecule. I guess this will only be acceptable for most PIs if this would happen after the publication.

But maybe it would also help with the problem that sometimes happens: sitting on a dataset that is not really enough for a publication, and for some reason it can't be expanded by the lab/research group itself (e.g. no time/money/equipment/...) . Now it sometimes disappears in a shelf (if there is no-one that can be found to collaborate with to finish up the project), and it would be nice to upload this somewhere so interested people can pick it up and use the data for something else.

Jump to: Parent to post 2
Permalink to post 5

Ciara Lynch Post 3 in reply to 1

• 5 November 2019, 7:34 PM

I work with a lot of genetic data, and the genomes I get from collaborators is often of much higher quality than the ones freely available on PubMed or GenBank; what I mean to say is that you need to be very careful about how much you trust free data online. Just because it was published in GenBank does not mean the reads are of great quality or don't need any FastQC quality analysis. In fact, often times they are the worst quality reads in my dataset. This is partly because they are older and therefore from sequencing platforms that may not be as up-to-date. It's still a fantastic resource and I wish more people would upload the genomes they collect online. Just be sure to "clean" your downloaded genomes!

Jump to: Parent to post 1
Permalink to post 3

Rolando Trejos Saucedo Post 4 in reply to 1

• 5 November 2019, 10:57 PM

Sharing research data could be dangerous in some context. In a university, in which you work as a graduate research assistant, but also are involved as co-author in many research works, how you manage your data represents how trustworthy you are in front of your supervisors eyes.

Jump to: Parent to post 1
Permalink to post 4

Veronica Phillips Post 6 in reply to 1

• 6 November 2019, 11:17 AM

One of the challenges I've encountered around the sharing of research data is a lack of understanding among researchers as to what constitutes data. Generally, STEM researchers (and some in the social sciences) have a clear understanding of what constitutes the data underpinning their research, and are well served by institutional and funder policies about open sharing of data, research data management, and so on. However, current policies are, in my experience, unhelpful or unclear when it comes to research data for humanities researchers. It's often a real struggle for those of us working in research support to reach humanities researchers -- sometimes they don't perceive their research as involving 'data', or current policies are unhelpful when it comes to management and sharing of humanities research data (which can often just be primary and secondary literature, and their own notes).

Jump to: Parent to post 1
Permalink to post 6

Gunnar De Winter Post 7 in reply to 6

• 7 November 2019, 8:39 AM

This is a great point and it touches upon the issue of defining 'data'. Do we want/need this to be applicable to all disciplines indiscriminately, or can each (group of) discipline(s) have its own understanding of (meta)data? Also, is data necessarily quantitative (even qualitative data is often quantitatively 'encoded')? Anyway, just some question to ponder :)...

Jump to: Parent to post 6
Permalink to post 7

Niamh Arthurs Post 8 in reply to 1

• 7 November 2019, 1:52 PM

Data from the mobile health app involved in my research can be used to generate a heat map of dominant areas in a particular location that food marketing is having an influence on young people. The current data from this project is limited but further input from participants and in reaching areas further afield could strengthen the use of this data. This data could then be used by other researchers to link certain food ads with particular behaviours and lifestyles of young people in Ireland. It could also be used for ongoing research to support campaigns or public health initiatives that aim to reduce such influences and promote healthier behaviours and lifestyles in an attempt to halt the obesity epidemic.

Jump to: Parent to post 1
Permalink to post 8

Jose Hermes Lopez Prato Post 9 in reply to 1

• 9 November 2019, 5:24 PM • Edited by the author on 9 November 2019, 5:25 PM

I am currently involved in the development of a phd and, as usual, we have been generating products in different aspects. In our case, programming code, methodologies to integrate information obtained from different portals on the internet, such as uniprot, hgnc, among others, have produced knowledge bases that integrate all this information to describe a genetic regulatory network. In this case all the objects of the network, that have been considered, have associated knowledge trees that describe their identity, their biological processes, molecular functions, and at the same time, knowledge bases that model protein-protein interactions. Now, I can see how each one of these different products can be shared immediately, without having to wait for the doctorate to end. This opens a possibility that sincerely I had not matured enough and I am very happy about this philosophy in which all the different products of a research can be considered as shareable. This opens doors to those of us who are approaching for first time this Open Science initiative.

Jump to: Parent to post 1
Permalink to post 9

Mary Anderson-Glenna Post 10 in reply to 1

• 11 November 2019, 8:23 AM

I am a Research administrator and funding advisor, when means that I support researchers when writing proposals. The Research Data Management team, lead by the Library. The advice is that a DMP should be prepared in the proposal phase even if not required (e.g. in the Research Council of Norway the DMP is required in contract preparation phase). There is resistance to use time to prepare a DMP during the proposal preparation phase as there are so many other Things to prepare/Write. I Wonder how to approach this at the proposal phase and what it the normal practice in other Universities.

Jump to: Parent to post 1
Permalink to post 10

Olivia Tort Post 11 in reply to 1

• 12 November 2019, 3:37 PM

I collect plasma parameters and proteomic data from animals right now (and I relate both). A problem I have is to confirm that each "batch" of expeiments are comparable to other batches. to minimize this I include internal controls. Another issue that I often have are missing values, but this is not frequently reported in papers, while it happens in reallity.

Jump to: Parent to post 1
Permalink to post 11

Deirdre Winrow Post 12 in reply to 1

• 30 December 2019, 12:27 PM

We have had to think about this recently, regarding our research. After performing the test we have developed, we have a lot of information in the form of numeric CT values that indicate the level of methylation on certain genes, which then indicates the likelyhood that a particular patient has cancer. the clinicians who will use this data have indicated that this format is overly complicated for them and does not help them to form a diagnosis. We are exploring different ways of presenting the data so that all people who use it can understand it.

Jump to: Parent to post 1
Permalink to post 12

Week 3 Forum

My OpenLearn Create Profile

About this course

Course rewards

ORION MOOC for Open Science in the Life Sciences