Daniel Vartanian
University of São Paulo
2025-02-03
This presentation will provide an overview of the thesis objectives, main concepts, methods, and results. Together, we’ll uncover how environment factors can influence the cronotype.
We will cover the following topics:
This thesis focuses on the expression of sleep and circadian phenotypes (chronotypes), seeking to provide an answer to the following question related to human populations:
Is latitude associated with chronotype?
(Reproduced from Merrow & Roenneberg (2020) (Left) and Duffy et al. (2011, Figure 1) (Right))
(Reproduced from Flanagan et al. (2021, Figure 2))
(Adapted by the author from Roenneberg et al. (2003, Figure 7F))
Sample: \(12,884\) Brazilian participants
Method: HO Questionnaire (Horne & Östberg, 1976)
\(\Delta \ \text{Adjusted} \ \text{R}^{2} = 0.388\%\) (Cohen’s \(f^2 = 0.004143174\))
(Reproduced from Leocadio-Miguel et al. (2017, Figure 2) (Left), with a rescaled version created by the author (Right))
(Created by the author)
(Adapted by the author from Borbély (1982, Figure 4))
(Reproduced from Rede Globo (2017))
The dataset used for analysis is made up of \(65,824\) Brazilian individuals aged 18 or older, residing in the UTC-3 timezone, who completed the survey between October 15th and 21st, 2017.
(Created by the author)
The study tested whether latitude significantly improves model fit when predicting chronotype with nested models.
A restricted model (excluding latitude) with a full model (including latitude) was compared.
To ensure practical significance, a Minimum Effect Size (MES) criterion was applied, following the original Neyman-Pearson framework for hypothesis testing (Neyman & Pearson, 1928a, 1928b; Perezgonzalez, 2015).
The MES was set at Cohen’s threshold for small effects (\(f^2 = 0.02\), equivalent to \(\text{R}^2 = 0.01960784\)) (Cohen, 1988, p. 413). Thus, latitude was considered meaningful only if its inclusion accounted for at least \(1.960784\%\) of the variance in chronotype.
To balance the sample, a weighting procedure was applied to the data. The weights were calculated by cell weighting, using the sex, age group and Brazil’s state as reference.
The hypothesis can be outlined as follows:
\[ \begin{cases} \text{H}_{0}: \Delta \ \text{Adjusted} \ \text{R}^{2} \leq \text{MES} \quad \text{or} \quad \text{F-test is not significant} \ (\alpha \geq 0.05) \\ \text{H}_{a}: \Delta \ \text{Adjusted} \ \text{R}^{2} > \text{MES} \quad \text{and} \quad \text{F-test is significant} \ (\alpha < 0.05) \end{cases} \]
(Created by the author based on a data visualization from Roenneberg et al. (2019, Figure 1))
Restricted Model
(Based on Leocadio-Miguel et al. (2017))
(Adapted by the author from Pereira et al. (2017))
Full Model A
(Based on Leocadio-Miguel et al. (2017))
Full Model B
All coefficients were statistically different from zero (\(p\text{-value} < 0.05\)). Assumption checking and residual diagnostics primarily relied on visual inspection, as formal assumption tests (e.g., Anderson-Darling) are often not recommended for large samples (Shatz, 2024). All validity assumptions were met, and no serious multicollinearity was found among the predictor variables.
An ANOVA for nested models revealed a significant reduction in the residual sum of squares in both tests (A \(\text{F}(4, 65814) = 51.71\), \(p\text{-value} < 1e-05\)) (B \(\text{F}(1, 65817) = 37.325\), \(p\text{-value} < 1e-05\)).
However, similarly to Leocadio-Miguel et al. (2017), when estimating Cohen’s \(f^2\) effect size (\(f^2 = 0.004143174\)), the results were below the MES (i.e., negligible) (A \(f^2 = 0.0031428, 95\% \ \text{CI}[0, 0.012203]\)) (B \(f^2 = 0.0005671, 95\% \ \text{CI}[0, 0.0095426]\)).
(Created by the author)
(Created by the author. The color scale is bounded by the first and third quartiles.)
(Created by the author)
Large samples and sensitivity
Is a difference of \(0.00001\) valid?
Statistical ritual versus Statistical thinking
Confirmation bias
Comparison of a 95% of confidence level (\(\alpha = 0.05\)) and an n-dependent p-value curve. The parameter \(n_{\alpha}\) represents the minimum sample size to detect statistically significant differences among compared groups. The parameter \(n_{\gamma}\) represents the convergence point of the p-value curve. When the p-value curve expresses practical differences, the area under the red curve (\(A_{p(n)}\)) is smaller than the area under the constant function \(\alpha = 0.05\) (\(A_{\alpha = 0.05}\)) when it is evaluated between \(0\) and \(n_{\gamma}\).
(Chart reproduced from Gómez-de-Mariscal et al. (2021, Figure 3))
SMALL EFFECT SIZE: \(f^2 = .02\). Translated into \(\text{R}^{2}\) or partial \(\text{R}^{2}\) for Case 1, this gives \(.02 / (1 + .02) = .0196\). We thus define a small effect as one that accounts for 2% of the \(\text{Y}\) variance (in contrast with 1% for \(r\)), and translate to an \(\text{R} = \sqrt{0196} = .14\) (compared to .10 for \(r\)). This is a modest enough amount, just barely escaping triviality and (alas!) all too frequently in practice represents the true order of magnitude of the effect being tested. (p. 413)
[…] in many circumstances, all that is intended by “proving” the null hypothesis is that the ES [Effect Size] is not necessarily zero but small enough to be negligible […]. (p. 461)
(Quotes reproduced from Cohen (1988). Photo by an unknown author.)
(Artwork by Virpi Oinonen)
I suggest that it is the aim of science to find satisfactory explanations, of whatever strikes us as being in need of explanation.
This study, using what is arguably one of the largest datasets on chronotype, found no evidence supporting the latitude hypothesis in humans.
(Quote reproduced from Popper (1972/1979, p. 193). Photo by an unknown author.)
This thesis was presented to the School of Arts, Sciences and Humanities (EACH) at the University of São Paulo (USP) as a requirement for the degree of Master of Science by the Graduate Program in Complex Systems Modeling (PPGSCX).
I am deeply grateful to my advisor, Prof. Dr. Camilo Rodrigues Neto, for introducing me to complexity science in 2012, guiding my dissertation with patience, and demonstrating remarkable integrity in navigating a challenging supervisory transition.
The presentation was created using the Quarto publishing system. All analyses presented are fully reproducible and were conducted using the R programming language.
To explore the code and repository for this thesis, click here. The research compendium is also accessible via The Open Science Framework by clicking here.
Financial support was provided by the Coordination for the Improvement of Higher Education Personnel (CAPES) (Grant number: 88887.703720/2022-00).
In accordance with the American Psychological Association (APA) Style, 7th edition.
(Reproduced from Nobel Prize Outreach AB (n.d.))
(Adapted by the author from Kuhlman et al. (2018, Figure 2B))
Stable macroscopic patterns arising from local interaction of agents (Epstein, 1999).
When the aggregate exhibits properties not attained by summation (Holland, 2014).
(Artworks by Kurzgesagt – In a Nutshell, Quanta Magazine, and Journey to the Microcosmos)
(Reproduced from Lewin (1993, Figure 1))
An aggregate behavior emerges from the interactions of the parts (CAS) (Holland, 2012).
(Reproduced from Holland (2012, Figure 1.1))
(Artwork by an unknown author)
(Reproduced from The Worldwide Experimental Platform (n.d.))
(Created by the author)
Brazil
Europe
(Created by the author (Left) and reproduced from Roenneberg et al. (2019, Figure 1) (Right))
Brazil
Europe
(Created by the author (Left) and reproduced from Roenneberg et al. (2007, Figure 4) (Right))
(Created by the author, based on a data visualization from Roenneberg et al. (2007, Right Figure))
(Created by the author. The color scale is bounded by the first and third quartiles.)
(Created by the author)
(Reproduced from Cohen (1992))
WorldClim 2.1 in NetLogo (NetLogo/Scala-Java).
(Reproduced from Vartanian et al. (2025, Unpublished))
The activity can be represented by a general schema of problem- solving by the method of imaginative conjectures and criticism, or, as I have often called it, by the method of conjecture and refutation. The schema (in its simplest form) is this:
flowchart LR A(P1) --> B(TT) B --> C(EE) C --> D(P2)
flowchart LR A(P) --> B(TT) B --> C(EE) C --> A
(Reproduced from Popper (1972/1979, p. 164))
Here \(\text{P}_1\), is the problem from which we start, \(\text{TT}\) (the ‘tentative theory’) is the imaginative conjectural solution which we first reach, for example our first tentative interpretation. \(\text{EE}\) (‘error- elimination’) consists of a severe critical examination of our conjecture, our tentative interpretation: it consists, for example, of the critical use of documentary evidence and, if we have at this early stage more than one conjecture at our disposal, it will also consist of a critical discussion and comparative evaluation of the competing conjectures. \(\text{P}_2\) is the problem situation as it emerges from our first critical attempt to solve our problems. It leads up to our second attempt (and so on).
flowchart LR A(P1) --> B(TT) B --> C(EE) C --> D(P2)
(Reproduced from Popper (1972/1979, p. 164))
The history of ideas teaches us very clearly that ideas emerge in logical or, if the term is preferred, in dialectical contexts. My various schemata such as
flowchart LR A(P1) --> B(TT) B --> C(EE) C --> D(P2)
may indeed be looked upon as improvements and rationalizations of the Hegelian dialectical schema: they are rationalizations because they operate entirely within the classical logical organon of rational criticism, which is based upon the so-called law of contradiction; that is to say, upon the demand that contradictions, whenever we discover them, must be eliminated. Critical error-elimination on the scientific level proceeds by way of a conscious search for contradictions.
(Reproduced from Popper (1972/1979, p. 297))
Notre activité intellectuelle est suffisamment excitée par le pur espoir de découvrir les lois des phénomènes, par le simple désir de confirmer ou d’infirmer une théorie. […] la philosophie positive est le véritable état définitif de l’intelligence humaine […] (Comte, 1892, pp. 9–10).
Why post-positivist?
But I shall certainly admit a system as empirical or scientific only if it is capable of being tested by experience. These considerations suggest that not the verifiability but the falsifiability of a system is to be taken as a criterion of demarcation. In other words: I shall not require of a scientific system that it shall be capable of being singled out, once and for all, in a positive sense; but I shall require that its logical form shall be such that it can be singled out, by means of empirical tests, in a negative sense: it must be possible for an empirical scientific system to be refuted by experience.
(Reproduced from Popper (1934/2005))
Everybody knows nowadays that logical positivism is dead. But nobody seems to suspect that there may be a question to be asked here—the question “Who is responsible?” or, rather, the question “Who has done it?”. (Passmore’s excellent historical article does not raise this question.) I fear that I must admit responsibility.
(Reproduced from Popper (1974/2005))
(Reproduced from Popper (1963/2002))
(Reproduced from Popper (1963/2002))
(Reproduced from Popper (1963/2002))
(Reproduced from Popper (1963/2002))
One can sum up all this by saying that the criterion of the scientific status of a theory is its falsifiability, or refutability, or testability.
(Reproduced from Popper (1963/2002))