Examples of using Big data sources in English and their translations into Thai
{-}
-
Colloquial
-
Ecclesiastic
-
Ecclesiastic
-
Computer
Big data sources tend to have ten characteristics;
Surveys linked to big data sources section 3.6.
Big data sources can be loaded with junk and spam.
Measurement is much less likely to change behavior in big data sources.
Big data sources tend to have a number of characteristics in common;
Now, I will turn to the seven properties of big data sources that are bad for research.
Big data sources do not mean the end of survey research.
Two approaches that especially benefit from big data sources are natural experiments and matching.
Third era Non-probability sampling Computer-administered Surveys linked to big data sources.
Measurement in big data sources is much less likely to change behavior.
Population drift, usage drift, and system drift make it hard to use big data sources to study long-term trends.
Rather than thinking of big data sources as observing people in a natural setting, a more apt metaphor is observing people in a casino.
For examples of researchers expressing concern about non-representative nature of big data sources, see boyd and Crawford(2012).
Another way in which researchers can use big data sources in survey research is as a sampling frame for people with specific characteristics.
Rosenbaum(2015) and Hernán and Robins(2016) offer other advice for discovering useful comparisons within big data sources.
One of the great advantages of many big data sources are that they collect data over time.
Most big data sources are incomplete, in the sense that they don't have the information that you will want for your research.
One of the great advantages of many big data sources is that they collect data over time.
Big data sources and surveys are complements not substitutes so as the amount of big data increases, I expect that the value of surveys will increases as well.
As I'm describing these characteristics you will notice that they often arise because big data sources were not created for the purpose of research.
Rather than thinking of big data sources as observing people in a natural setting, a more apt metaphor is observing people in a casino.
For more on construct validity, see Westen and Rosenthal(2003), and for more on construct validity in big data sources, Lazer(2015) and Chapter 2 of this book.
Linking surveys to big data sources enables you to produce estimates that would be impossible with either data source individually.
Social scientists call this match construct validity and it is a major challenge with using big data sources for social research Lazer 2015.
To conclude, many big data sources are drifting because of changes in who is using them, in how they are being used, and in how the systems work.
These sources of change are sometimes interesting research questions, but these changes complicate the ability of big data sources to track long-term changes over time.
Given these ten characteristics of big data sources and the inherent limitations of even perfectly observed data, what kind of research strategies are useful?
Matching in massive data might be better than a small number of field experiments when: 1 heterogeneity in effects is important and 2 there are good observables for matching. Table 2.4 provides some other examples of how matching can be used with big data sources.
These four examples all show that a powerful strategy in the future will be to enrich big data sources, which are not created for research, with additional information that makes them more suitable for research Groves 2011.
These four examples all show that a powerful strategy in the future will be to enrich big data sources, which are not collected for research, with additional information that makes them more suitable for research Groves 2011.