• Nie Znaleziono Wyników

Publication bias in the neighbourhood effects literature

N/A
N/A
Protected

Academic year: 2021

Share "Publication bias in the neighbourhood effects literature"

Copied!
8
0
0

Pełen tekst

(1)

This is the author's Post-print version (final draft post-refereeing as accepted for publication by the journal). The definitive, peer-reviewed and edited version of this article is published as: Nieuwenhuis, Jaap (2016) Publication bias in the neighbourhood effects literature. Geoforum 70, p 89-92.

DOI: http://dx.doi.org/10.1016/j.geoforum.2016.02.017

Publication bias in the neighbourhood effects

literature

Jaap Nieuwenhuis

OTB – Research for the Built Environment, Faculty of Architecture and the Built Environment, Delft University of Technology, P.O. Box 5043, 2600GA Delft, The Netherlands

Abstract

There is a strong belief amongst policy makers regarding the existence of neighbourhood effects, i.e., that above and beyond other influence, the neighbourhood in which they live influences individuals’ life chances. The evidence from scientific reviews leans towards the idea that it is likely that neighbourhoods indeed affect life chances of their residents. Yet, amongst scientists there is no consensus about the neighbourhood effects hypothesis because of differences in methodological and theoretical approaches. I ask why, when there is no consensus, the evidence leans towards supporting the neighbourhood effects hypothesis within policy circles. To test whether the scientific evidence for the neighbourhood effects hypothesis is driven by publication bias, I employ a meta-analysis over all studies that test for a relation between neighbourhood characteristics and educational outcomes. The results show that around the arbitrary boundary for statistical significance (p-value is less than 0.05), there is an unlikely large share of studies that are significant compared to the share of studies that are insignificant. This finding strongly suggests that there is publication bias in the neighbourhood effects literature.

Introduction

Since the publication of Wilson’s (1987) The truly disadvantaged, there has been a continuing interest in neighbourhood effects. The neighbourhood effects literature revolves around the hypothesis that, independent of possible confounders, the neighbourhood exerts an influence on the life chances of individuals living in that neighbourhood. Reviews reveal that most evidence in this literature shows that neighbourhoods do have an influence on individual life chances, and that this effect is mostly in a positive direction (e.g., the richer the neighbourhood, the better residents’ individual outcomes and vice versa; see e.g., Dietz, 2002; Durlauf, 2004; Ellen & Turner, 1997; Galster, 2002; Nieuwenhuis & Hooimeijer, 2015). The idea that neighbourhood characteristics can exert a (positive) influence on neighbourhood residents’ lives not only exists within the scientific community, but also amongst policy makers, where the belief appears even stronger. Amongst policy makers, the

(2)

belief that mixing neighbourhoods based on income levels will benefit the neighbourhoods’ residents, has led to many changes in the housing stocks in mainly poor neighbourhoods (Manley et al., 2013; van Ham & Manley, 2012). Neighbourhood mix policies are based on the idea that, when people from different income groups live side by side, the poor will benefit from the rich, e.g., by expanding their social networks or by having more positive role models in the neighbourhood (Musterd & Andersson 2005). Such policies are based on the often contested idea that people from different socio-economic backgrounds will start interacting just by living in close proximity of one another (Atkinson & Kintrea, 2000; Blokland & van Eijk, 2010; Kleinhans, 2004). However, in the end there is no hard proof that people actually behave like that.

There is also a community of researchers who are more sceptical about neighbourhood effects, and argue that we should study why people live in certain neighbourhoods in the first place (see e.g., Slater, 2013). The oft-heard critique is that the neighbourhood effects literature is plagued by selection bias (e.g., Hedman & Galster, 2013; van Ham & Manley, 2012). What is argued is that neighbourhoods are not random selections of households or individuals, but that people select themselves into neighbourhoods based on their preferences and constraints. When this notion is not properly taken into account into statistical modelling, a found neighbourhood effect may be an artefact that is produced by the unmeasured variation that exists in people selecting into neighbourhoods, which is then correlated with the variation in neighbourhoods, but which is also correlated with the outcome that is studied. When not properly controlling on how people select into neighbourhoods, models that test for neighbourhood effects may find spurious effects that are produced by incorrect modelling strategies.

Even though there is no clear consensus amongst scientist about neighbourhood effects, still most evidence leans towards the idea that neighbourhood effects exist. This may be caused by widely spread problem of publication bias (Ioannidis, 2005). Publication bias is a bias in research results that is caused by systematically publishing significant results in favour of insignificant results. The reasoning behind publication bias is twofold. First journals might be more likely to publish significant results (Dwan et al., 2013), perhaps because it is more interesting to be able to publish a finding that validates a new idea, than to publish a finding that is not able to verify a new idea. And second, when researchers are faced with competitive academic cultures where they are evaluated based on their number of publications; in order to get published, they might be more likely to try many different model specifications in order to find significant results (Fanelli, 2010a). Obviously, both mechanisms are related to each other, and both cause an overrepresentation of significant versus insignificant results. Papers with insignificant results may never be published, or models may be tweaked long enough until interesting (read: significant) results are found. It is not difficult to see how these mechanisms lead to an abundance of significant results, and a non-representatively small pool of studies with insignificant results.

I studied the notion of publication bias in the neighbourhood effects literature, and to be more specific in the entire body of literature that studied neighbourhood effects on individual educational outcomes. I used a meta-analytical approach to study 88 studies to answer the question whether publication bias is apparent in the neighbourhood effects literature.

(3)

Method Data

I used the data from an existing meta-analysis by Nieuwenhuis and Hooimeijer (2015). The data are based on a search of Scopus on October 2011. The search was done over the title, abstract and keywords of the studies in the Scopus database, and encompassed two themes: ‘neighbourhoods’ and ‘education’. The ‘neighbourhood’ theme included: neighb*rhood or “community characteristic*” or “residen* characteristic*” or “environment* characteristic*” or “context* characteristic*”. And the theme ‘education’ included: education* or school or grade* or drop*out or “drop out” or academic*. The asterisk symbol is used to allow for every variant of a search term. Filters were used to limit the search results to social scientific studies from peer-reviewed journals. The search result was 5,516 papers.

The relevant studies were identified in a two-step process. First, after examining the titles and abstracts, 244 potential candidate studies remained. In the second step, I conducted a full-text review of the 244 remaining studies. The studies were included if they met the following criteria: 1) ‘educational achievement’ on an individual level was the dependent variable; 2) the independent variables contained at least one neighbourhood characteristic; 3) (non-experimental) multivariate analysis was used; 4) a ‘neighbourhood’ was defined as “the neighbourhood in which the respondent lives/lived” rather than as the area around the school that the respondent attends; 5) the sample used does not consist of pre-schoolers; 6) the sample was from a developed country; 7) the study used recent data, defined as data from 1960 to the present; and 8) the article provided information to obtain the coefficient and standard error. In the final meta-analysis, 88 studies were included that met all of the inclusion criteria. Because many studies contain analyses over subgroups (e.g., men/women or blacks/whites), the final N is larger.1

In the final set of studies, ‘education outcome’ could be defined as: high school graduation; high school dropout; grades/test scores; school performance (including teacher assessments); grade retention; years of education; highest achieved education; college attendance; and college graduation. The recorded neighbourhood characteristics were defined within four broad categories: neighbourhood poverty (e.g., share of low income families); the educational climate (e.g., the share of high school dropouts); proportion of migrant/ethnic groups; and social disorganisation (e.g., crime rates or low social cohesion).

For this study on publication bias, I recorded three characteristics from the included studies: the unstandardised coefficients, standard errors, and p-values of the independent neighbourhood variables. Where no exact p-values were provided, I calculated these.

Analyses

To study publication bias in the neighbourhood effects literature, I used a series of caliper tests (Gerber & Malhotra, 2008a). The idea behind the caliper test is that the 5% significance standard (or z = 1.96) used in scientific research is an arbitrary boundary between significance and insignificance. Under the assumption that the published articles are a

1For more details about the data (e.g., a list of all included studies), I would like to direct the

(4)

random draw from a sampling distribution; when looking at equal-sized intervals just above and below the significance threshold, the chance for a result to be just above or below is equal. So when the number of studies just above the threshold is much larger than just below the threshold, this is an indication for publication bias.

Results

To examine the current consensus in research of neighbourhood effect on education outcomes, I start by looking at how many positive, negative, and insignificant neighbourhood effects are found in the literature and how big the average standardised point estimates are for these categories (Table 1).

Table 1: Average standardised point estimates of the results of neighbourhood disadvantage on individual educational outcomes.

Freq. Perc. Weighted point est.a Negative effect 130 42.76% -3.9539 No (significant) effect 139 45.72% -.2514 Positive effect 35 11.51% .7792 Total results 304 100% -1.7160

a Point estimates are standardised using the inverse of the standard error (1/s.e.)

In Table 1 there is a similar frequency of negative neighbourhood effects and results that find no effect. In order to examine whether the number of studies that find a negative effect is caused by publication bias, I look at the frequencies of studies just above and below the arbitrary significance threshold of 5% (or z = 1.96). There is no statistical reason to expect a substantially larger amount of studies just above the threshold than just below. So when I do find substantially more studies above than below the threshold, it would mean that the 5% significance level is affecting what is being published (Gerber & Malhotra, 2008a). When looking at the histogram plotting the frequency distribution of z-values (Figure 1), the frequency of z-values just under the critical value of 1.96 (equivalent to 5% significance) is about twice as small as the frequency just over 1.96. To test the likelihood of this happening, a caliper test (Gerber & Malhotra, 2008a) with bandwidths of 3, 5, 10, 15, and 20% over and under caliper is employed (Table 2).

(5)

Figure 1: Frequency distribution of z-values.

Note: The width of the bars is .2. Table 2: Caliper tests.

Over caliper Under caliper p-valuea 3% caliper 10 2 .019 5% caliper 13 5 .048 10% caliper 19 9 .044 15% caliper 27 13 .019 20% caliper 28 15 .033

Note: Over caliper includes the results that are between 0 and X% greater than the 1.96 z-value. Under caliper results are between 0 and X% less than z=1.96.

a Based on a probability distribution function.

The results of the caliper tests (Table 2) show that, independent of the chosen bandwidth, the distributions of result just under and just over caliper are unlikely to happen. The caliper tests show a high likelihood for publication bias in the neighbourhood effects literature due to the significance threshold.

Discussion and conclusion

The results indicate clear evidence for publication bias due to arbitrary significance levels in the neighbourhood effects literature. The reasons behind the discrepancy between the number of studies just above and just below the significance threshold could lie in two sources: the journals and the researchers. Journals may have a tendency to be more favourable towards studies with significant results (Dwan et al., 2013), perhaps because significant results are

0 5 10 15 20 F re q u e n cy .16 1.06 1.96 2.86 3.76 4.66 5.56 6.46 7.36 8.26 z-value (2-tailed)

(6)

more interesting than studies that fail to support their hypothesis. Furthermore, due to the competitive environment in academics (“publish or perish”), researcher are faced with a pressure to publish, and may therefore have an incentive to produce significant results (Fanelli, 2010a). This can be achieved for example by additional data sampling when a result is almost significant. Or by trying additional model specifications until a significant result is found. Studies that in the end fail to achieve statistically significant findings, may end up in the file-drawer and never be published.

The fact that the effects from the caliper test are not very large may first be caused by the small sample size. However secondly, it may also be related to the lack of consensus within the neighbourhood effects literature. It has been argued that in scientific fields that are younger or deal with highly complex phenomena, such as the field of neighbourhood effects, theoretical and methodological consensus is more difficultly reached. When there is no consensus on how to collect, analyse, and interpret data, it was shown that the publication of significant results is more likely, compared to older scientific fields that have a stronger theoretical and methodological consensus (Fanelli, 2010b).

In this study I focussed on a sample of studies that had related neighbourhoods to educational outcomes. However, it is very likely that the findings also hold up for studies of neighbourhood effects on any other outcome variable. The mechanisms behind the results are in no way related the studied dependent and independent variables. As was shown by Gerber and Malhotra, the results hold for very diverse studies in social science and political science (Gerber & Malhotra, 2008a, 2008b). Therefore, the findings could be interesting for all researchers of neighbourhood effects.

For researchers of neighbourhood effects and policy makers concerned with housing, the results should be considered as a critical note to the existing hypothesis that neighbourhoods influence the chances in life of individual residents. Due to publication bias, the evidence for this hypothesis may be based on thin grounds. This result shows that the neighbourhood effects hypothesis should be studied with even more caution than is already done. Moreover, the field would benefit from larger sample sizes and more sophisticated modelling strategies to come to more reliable outcomes. Finally, p-values were never intended to have a significance/insignificance threshold, but were intended to show the strength of the evidence (Sterne & Smith, 2001). If the arbitrary significance threshold of 5% would be abandoned and results would be presented with exact p-values, there would be no incentive for journals or researcher to favour results just over the threshold, over results just under the threshold. Under that scenario, publication bias due to significance thresholds would not be an issue, and “insignificant” results could be judged on their scientific value instead of their p-value.

Acknowledgements

I would like to thank Jarno Hoekman for his significant contribution to the idea behind this paper and his helpful comments on various draft versions. The research leading to these results has received funding from the European Research Council under the European Union's Seventh Framework Programme (FP/2007-2013) / ERC Grant Agreement n. 615159 (ERC Consolidator Grant DEPRIVEDHOODS, Socio-spatial inequality, deprived neighbourhoods, and neighbourhood effects).

(7)

References

Atkinson, R. & Kintrea, K. (2000). Owner-occupation, social mix and neighbourhood impacts. Policy and Politics, 28(1), 93-108.

Blokland, T. & van Eijk, G. (2010). Do people who like diversity practice diversity in neighbourhood life? Journal of Ethnic and Migration Studies, 36(2), 313-332. doi: 10.1080/13691830903387436

Dietz, R. D. (2002). The estimation of neighborhood effects in the social sciences: An interdisciplinary approach. Social Science Research, 31, 539-575.

Durlauf, S. N. (2004). Neighbourhood effects. In Handbook of regional and urban economics

Volume 4, Cities and Geography, edited by Henderson, J.V. & Thisse, J.F.

Amsterdam: Elsevier.

Dwan, K, Gamble, C, Williamson, P.R., Kirkham, J.J., for the Reporting Bias Group (2013). Systematic review of the empirical evidence of study publication bias and outcome reporting bias – An updated review. PLoS ONE, 8(7), e66844. doi:10.1371/journal.pone.0066844

Ellen, I.G. & Turner, M.A. (1997). Does neighbourhood matter? Assessing recent evidence.

Housing Policy Debate, 8(4), 833–866.

Fanelli, D. (2010a). Do pressures to publish increase scientists’ bias? An empirical support from US states data. PLoS One, 5(4), e10271. doi:10.1371/journal.pone.0010271 Fanelli, D. (2010b). “Positive” results increase down the hierarchy of the sciences. PLoS One,

5(4), e10068. doi:10.1371/journal.pone.0010068.

Galster, G. (2002). An economic efficiency analysis of deconcentrating poverty populations.

Journal of Housing Economics, 11(4), 303-329.

Gerber, A. S. & Malhotra, N. (2008a). Publication bias in empirical sociological research: Do arbitrary significance levels distort published results? Sociological Methods &

Research, 37(1), 3-30.

Gerber, A. & Malhotra, N. (2008b). Do statistical reporting standards affect what is published? Publication bias in two leading political science journals. Quarterly

Journal of Political Science, 3, 313-326.

Hedman, L. & Galster, G. (2013). Neighbourhood income sorting and the effects of neighbourhood income mix on income: A holistic empirical exploration. Urban

Studies, 50(1), 107-127.

Ioannidis, J. P. A. (2005). Why most published research findings are false. PLoS Medicine, 2(8), e124. doi:10.1371/journal.pmed.0020124

Kleinhans, R. (2004). Social implications of housing diversification in urban renewal: a review of recent literature. Journal of Housing and the Built Environment, 19(4), 367-90.

Manley, D., van Ham, M., Bailey, N., Simpson, L. & Maclennan, D. (Eds). Neighbourhood

effects or neighbourhood based problems? A policy brief. Dordrecht: Springer.

Musterd, S. & Andersson, R. (2005). Housing mix, social mix and social opportunities.

(8)

Nieuwenhuis, J. & Hooimeijer, P. (2015). The association between neighbourhoods and educational achievement, a systematic review and meta-analysis. Journal of Housing

and the Built Environment. Online First.

Slater. T. (2013) Capitalist urbanisation affects your life chances: exorcising the ghosts of ‘neighbourhood effects’. In: Manley, D., van Ham, M., Bailey, N., Simpson, L. & Maclennan, D. (Eds). Neighbourhood effects or neighbourhood based problems? A

policy brief. Dordrecht: Springer, pp. 113-132.

Sterne, J. A. C. & Smith, G. D. (2001). Sifting the evidence–what’s wrong with significance tests? BMJ, 332, 226-231.

van Ham, M. & Manley, D. (2012). Neighbourhood effects research at a crossroads. Ten challenges for future research. Environment and Planning A, 44, 2787-2793.

Wilson, W. J. (1987). The truly disadvantaged. The inner city, the underclass, and public

Cytaty

Powiązane dokumenty

Policies and regulations regarding the applications of agricultural biotechnology play an important role in facilitating commercial release of agricultural crops with improved

The ship heave response spectrum may thus be obtained directly from this eqression, but a more subtle approach is required to obtain the water cOlumn oscillation response spectrum

Nic udało się więc sprowadzić formy sonetu do jej istoty, która pozostaje nieuchwytna, choć przecież został przywołany schemat łatwo rozpoznawalny w swym ogólnym zarysie

Mimo tak niepomyślnych losów, jakie stały się udziałem prac bibljograficznych Załuskiego, musimy przyznać im w dzie­ jach naszej bibljografji wielkie znaczenie:

moments and quantiles of the empirical distribution, so they are estimators of the corresponding theoretical values. sample mean = estimator of the expected

Due to the suspension of the publication of the journal Antiquitates Mathematicae in 2011, the materials from subsequent scientific confer- ences in history of mathematics

polski 13 nie odnotowuje żadnej formy dativu ani singularis, ani pluralis tego rzeczownika. W Indeksie do wielkopolskich rot sądowych z XIV i XV wieku 14 analizowany przykład jest

In addition, our estimates suggest that most of correlation that is observed between the UI and DI enrolment rates can be explained by substitution effects, and not by