Evidence-Based Pre-clinical Medicine – Alexandra Bannach-Brown Malcolm Macleod, Gregers Wegener, Hanna Vesterinen, Jing Liao Title: Understanding in vivo modelling of depression Abstract: Depression is a debilitating mental illness affecting an estimated 400 million people worldwide. Despite many clinical and preclinical investigations, the mechanisms underlying the disease are still not well understood. A systematic review of the current literature on animal models of depression would be beneficial to gain a better overview and understanding strengths and limitations of current approaches and of the models and outcome measures used. This might provide insights into factors affecting the efficiency of model induction and the efficacy of intervention. Here we outline the protocol for a systematic review and meta-analysis of preclinical studies modelling depression-like behaviours in animals. Key words: depression, animal models, pre-clinical investigation, systematic review protocol Acknowledgements: This research is funded by the University of Edinburgh Principal’s International Office as part of the Excellence in European Doctoral Training Programme. We thank members from the SLIM (Systematic Living Information Machine) consortium for their collaboration. We thank Zsanett Bahor and Sarah McCann for their comments on earlier versions of the protocol. Stage of the Project at Time of Protocol Submission: Stage of Process Preliminary searches Piloting study selection Formal screening with final search criteria Data extraction from included papers Quality assessment Data analysis Manuscript writing

Started Yes Yes No No No No No

Completed Yes No No No No No No

Background: What is already known about this disease/ model/ intervention? Why is it important to do this review? Depression is a mental illness characterised by “low mood, loss of interest and pleasure or loss of energy” (DSM-IV & ICD-10 in NICE guidelines, 2009). Depression is the leading cause of disability in the world (Marcus et al., 2012) and is currently the brain disorder with the highest financial cost in Europe (Gustavsson et al., 2011). The number of people diagnosed with depression worldwide is estimated at 400 million (WHO, 2016). Depression places a huge burden on patients, and great cost to health care systems and governments. The rate of remission with antidepressant medication is at best 70%, and may only be achieved after several levels of intervention (Rush et al., 2006; Geddes et al, 2003). Despite decades of investigation into depression, little is known about the biological mechanisms underpinning the disease (Nestler, Gould & Manji, 2002; Slattery & Cryan, 2014). With better understanding of the mechanisms causing depression the development of novel and more reliable treatments might be possible. There is solid rationale the further investigation into the mechanisms and factors that contribute to the development of depression. This is a highly important area to tackle, both from a clinical, and a preclinical perspective.

Preclinical investigations contribute significantly to the understanding of the underlying mechanisms behind depression which can, in turn, inform treatment development and increase translational success of clinical research. One example of this contribution is the investigations into the CLOCK gene’s involvement in circadian rhythms and depression (Vitaterna et al., 1994; Bunney & Bunney, 2000). This research contributed to advances in successful treatments such as light therapy for depression and seasonal affective disorder (Eastman et al. 1998; Tuunainen et al., 2004). Preclinical experiments have the ability to model and dissect important mechanisms of depression and therefore provide insights into the neurobiology of the disorder (Krishnan & Nestler, 2008). Preclinical experiments can also investigate the safety and efficacy of proposed treatments prior to exposure in human cohorts (Kieburtz & Olanow, 2007). This knowledge can subsequently aid investigations into the best way to prevent the occurrence of depression, and the best and earliest interventions, which are top research priorities recently identified by an MQ: Transforming Mental Health report (MQ: Transforming Health, 2016). Due to the sheer volume of preclinical investigations of depression, it is difficult to achieve an overview of what is already known, and to assess the marginal contribution of new research (de Vries et al., 2011). In this context, a systematic review of the existing preclinical literature could provide an unbiased, collective, overview of existing knowledge and allow the marginal contribution of new research to be assessed. It could also provide better understanding of the laboratory methods used to induce the condition, the range of outcome measures used to assess depression phenotypes, and the variables that might impact on the efficacy of different treatments (de Vries et al., 2011). The findings from this systematic review and meta-analysis may also contribute to the refinement of methods used in animal investigations of depression, reducing the distress caused to animals by substitution with equally informative methods of lower severity; and contribute to the optimisation of the numbers of animals used in depression research by informing well founded power calculations.

Objectives of this Systematic Review Specify the disease / health problem of interest: We will investigate how depression is modelled in vivo. Depression is defined as specified in the DSM-IV-R under the clinical diagnoses of “Depression”, “Major Depressive Disorder (MDD)”, “Dysthymia (Persistent Depressive Disorder)”, “Other Specified Depressive Disorder” and/or “Unspecified Depressive Disorder”. In preclinical modelling of depression, some methods induce depressive-like behaviour as a single manifestation rather than modelling the full range of features associated with the clinical diagnoses. Therefore any model that attempts to mimic one or several major symptoms of depression in an animal model will be considered. Specify the population /species studied: All preclinical studies on any animal species at any stage of development will be included. Specify the intervention/exposure: This study will investigate any mode of inducing depressive behaviour or a model that seeks to mimic the human condition or symptoms of depression using; genetic, surgical lesions, pharmacological, developmental or a combination of interventions. We will include models induced acutely, chronically, genetically, or through a combination of these methods. We will also consider experiments where the efficacy of an intervention is tested in such models. Specify the control population: Studies will be included in this review if they include a suitable control, defined as a cohort animals that have not been exposed to the method of inducing depressive-like behaviour that was used to create the depressive model. The control cohort may have received an appropriate equivalent, for example sham surgery instead of lesion or placebo without the active ingredient. For treatment

intervention studies, a suitable control is defined as a cohort of animals that have had the same exposure to model the disorder as those that are given a treatment, but has not been exposed to the treatment tested. Specify the outcome measures: Primary outcome measure: Behavioural outcome measures of animal studies inducing depressive behaviour. Secondary outcomes: Anatomical outcomes, electrophysiological outcomes, neurochemical outcomes, prevalence of reporting of measures to reduce risk of bias. Tertiary outcomes: Drug efficacy, inter-rater agreement in the application of the inclusion criteria, sample size, genomic, proteomic and metabolomic outcomes.

State your research question (based on point 7-11): 1) How are animal models of depression induced? 2) What type of outcome measures are assessed in animal models of depression? 3) How precise and accurate are the outcome measures at assessing induced behaviours? 4) To what extend are the outcomes measured in animal models relevant to the endpoints investigated in human trials? 5) How efficacious are different drug interventions in reducing observed manifestations in in vivo animal models?

Methods Search & Study Identification Identify literature databases to search: Both PubMed and Embase will be searched. Define electronic search strategies: See attached for PubMed search terms (Appendix 1) & EMBASE search terms (Appendix 2). Identify other sources for study identification: Relevant recent reviews will be identified via an additional PubMed search and the reference list will be searched for any primary research articles that were not identified with the search.

Study selection procedure Define screening phases (e.g. pre-screening based on title/abstract, full text screening, both): Pubmed and Embase search results will be downloaded to EndNote or Reference Manager 12, duplicates removed and full text of articles retrieved where available using the inbuilt feature. Screening Phase 1: Title and abstracts retrieved from PubMed and Embase will be screened. Screening Phase 2: Full text papers, concurrently with data extraction. Specify number of observers per screening phase: Method of screening: Phase 1: A machine learning approach is proposed to assist with the screening phase for inclusion and exclusion criteria. We will pilot the most promising approaches in the context of an ongoing collaboration where we are developing machine learning tools for systematic review.

Firstly, a seed set of papers will be randomly selected and screened by two independent screeners and categorised as included or excluded. Any discrepancies will be resolved by a third screener. This provides the seed set that the machine learning algorithm will be trained on. Initially, a seed set of 2000 papers, is proposed. The machine learning algorithm with the best performance will be selected and used for the classification. Machine learning approaches will be assessed on ‘recall’ and Work Saved over Sampling (WSS@95% = (True Negative + False Negative)/ Number of samples – 0.05)(Cohen et al., 2006; Howard et al., in press). If machine learning approaches do not reach the required level of performance (95% recall and WSS@95% of over 50%) the seed set will be increased to 10% of the total papers and the machine learning approaches will then be tested again. If increasing the seed set does not improve the levels of recall and WSS, our reserve approach is to use human screeners, from a group of CAMARADES (Collaborative Approach to Meta-Analysis and Review of Animal Data from Experimental Studies) screeners, and manually screen all papers. Phase 2: Two independent screeners for full text analysis and data extraction, with the aid of machine learning and text mining assistance, for example risk of bias classification. A third independent screener will resolve any discrepancies.

Study selection criteria. Define all inclusion and exclusion criteria based on: Type of study design: Inclusion criteria: Any article providing primary data of an animal model of depression or depressive behaviour with an appropriate control group (specified above). Exclusion criteria: Review article, editorials, case reports, letters or comments, conference or seminar abstracts, studies providing primary data but not appropriate control group. Type of animals/ population (e.g. age, gender, disease model): Inclusion criteria: Animals of all ages, sexes and species, where depression-like behaviours intended to mimic the human condition have been induced. Exclusion criteria: Human studies, ex vivo, in vitro or in silico studies, animals with comorbidities or where depressive-like behaviour is studied as a side-effect of another condition or intervention, in vivo experiments where a treatment is investigated but no depressive behaviour induced beforehand (e.g. some pharmacological and toxicology studies). Studies will be excluded if authors state an intention to induce or investigate only anxiety or anxious behaviour. Type of intervention: Inclusion criteria: All studies that claim to model depression in animals. Studies where depressive behaviour is induced in animals as the primary objective. Studies that induce depressive behaviour or model of depression and which also test a treatment or intervention, with no exclusion criteria based on dosage, timing or frequency. Exclusion criteria: Studies that investigate treatments or interventions but no depressive behaviour or model of depression is induced (e.g. toxicity and side-effect studies). Outcome measures: Inclusion criteria: Studies measuring behavioural, anatomical and structural, electrophysiological, histological, and/or neurochemical outcomes. And where genomic, proteomic, or metabolomic

outcomes are measured in addition to either behavioural, anatomical, electrophysiological, histological or neurochemical outcomes. Exclusion criteria: Where metabolic outcomes measures are the primary outcome measure. If genomic, proteomic, metabolic or metabolomic outcomes are the sole outcome measures in a study. Language restrictions: Inclusion criteria: all languages Exclusion criteria: none Publication date restrictions: Inclusion criteria: all publication dates Exclusion criteria: none Other: Inclusion criteria: Studies must investigate methods or models that induce depressive behaviour in vivo, or authors must claim that they investigate a model of depression. Exclusion criteria: Studies claiming to induce only anxiety behaviour or a model of anxiety. In cases where both models of anxiety and depression are investigated, the study will be included and only the depression-related data will be extracted. In the case of data duplication (two or more papers reporting the same data), the paper reporting the smallest dataset or fewest outcomes will be excluded. Order of Priority Exclusion Criteria Per Screening Phase: Selection phase 1: screening based on title and abstract 1. Article must be primary research article (excluding reviews, comments or letters). 2. Exclude studies on humans. 3. Exclude ex vivo, in vitro or in silico investigations. 4. Exclude study if no depressive behaviour or model of depression has been induced. Selection phase 2: full text screening 1. The above criteria. 2. Exclude if no appropriate outcome measured. 3. Exclude if no appropriate control group. 4. Exclude if number of animals or measure of precision is not reported. 5. Where data cannot be extracted and contacted authors have not got back in touch. 6. Exclude the study with least information in the case of multiple publicatiosn describing the same work. (Bahor et al., 2014)

Study characteristics to be extracted: Study meta-data (e.g. authors, year): The first author, corresponding author, year, title, journal name, source of funding and DOI will be extracted. Study design characteristics (e.g. experimental groups, number of animals) The number of animals in the experimental and control groups will be extracted. If the number of animals is given as a range, the most conservative estimate will be extracted.

Animal model characteristics (e.g. species, gender) The species, strain, sex, age and/or weight of animal will be extracted. Intervention characteristics (e.g. intervention, timing, duration) The intervention or method used to induce depressive-like behaviour will be extracted as well as the duration of the induction. If applicable the following information will be extracted for the intervention; the dosage of intervention given, route of delivery, mode of delivery, how long the treatment was given for. The length of time between model induction and outcome measurement will be extracted as well as the length of time between the model induction, the intervention, and outcome measurement. For studies with several interventions or methods of induction given, data from all time points will be extracted and details of each method of induction. Outcome measures: 1. Method of outcome assessment/measure (mean, SD or SEM, and number of animals per group). 2. Details of the outcome measure (e.g. the sub-type or name of the outcome measure, and e.g. in the case of food restriction, the length of time the animal was restricted for). 3. The amount of times the outcome measure was assessed or followed up for. How many times was the animal tested on the same outcome measure? 4. The number of different outcome measures the animal was tested on. 5. The category of the symptom or biomarker the outcome measure is measuring (e.g. anhedonia, sleep or weight loss, markers of oxidative stress) 6. Any measures taken before the disease model induction will be extracted. The details of the before-and-after comparison will be extracted. 7. For studies inducing depression model and investigating effect of a subsequent drug intervention, has a suitable test been selected to measure this (e.g. a behavioural test that does not rely on behaviour that will likely be affected by side-effects of the drug intervention). Other (e.g. drop-outs) The number of excluded animals will be extracted and the reason for their exclusion if noted.

Risk of bias & Study quality Define criteria to assess the internal validity of included studies (e.g. selection, performance, detection and attrition bias): An adjusted CAMARADES checklist will be used to assess risk of bias. Including the following criteria: 1. Publication in a peer reviewed journal 2. Reporting of random allocation 3. Reporting of blinding of the conduct of the experiment 4. Reporting of blinded assessment of outcome 5. Use of comorbid animals 6. Reporting of a sample size calculation 7. Reporting of compliance with animal welfare regulations 8. Reporting of a potential conflict of interest 9. Reporting of exclusions of animals 10. Whether a study protocol is available dated before the experiments began (Zwetsloot et al., 2015) We will report the median number of study checklist items scored and the interquartile range.

Collection of outcome data Methods for data extraction/retrieval (e.g. extraction from graphs, contacting authors): 1. Numerical data will be extracted from full text of publication (mean, SD or SEM and group sample size). 2. In studies where data are presented only graphically, the software Universal Desktop Ruler, or similar tool, will be used to extract the data into numerical value. For certain pdf presentations it may be possible to use data mining approaches to extract these data. 3. If any data are missing the corresponding authors will be contacted. 4. In the absence of a response from authors (we will allow 2 months to reply with a follow-up email sent after the 1st month), data will be excluded from analysis. If the screeners or extractors consider that two sources may describe the same data we will contact the authors seeking clarification. If we receive no response we will include only the most recent data source.

Data analysis/synthesis Data gathering & Combination: All data will be gathered and inserted in the CAMARADES-SyRF database. We will provide a qualitative summary along with meta-analysis. Specify how the decision as to whether a meta-analysis is appropriate will be made: Based on previous systematic reviews in models of psychiatric disorders, we expect about 10-15% of the studies to be included in the analysis. We expect high heterogeneity between studies due to the differences in the study designs; therefore meta-analysis is proposed to investigate sources of this heterogeneity.

If a meta-analysis seems feasible/sensible: Specify the effect measure to be used (e.g. mean difference, standardized mean difference, risk ratio, odds ratio) Mean, SD or SEM and group sample size will be extracted for all outcome measures for both experimental and control groups. Where a single control group serves multiple intervention groups, the size of the control group used in the meta-analysis will be adjusted by dividing it by the number of intervention groups it serves. If numbers of animals is presented in a range, the most conservative estimate will be extracted (e.g. if presented as n = 6-12 we will consider that n=6). Categorical or qualitative information relating to the outcome measures, such as the behavioural measure or the symptom the model is trying to elucidate, will be extracted in text/comment field or from drop-down menu. A decision will be made once the data has been extracted, as to which effect size is the most appropriate to use. As most outcome measures are continuous variables and outcome measures are not likely to be measured on the same scale, Normalised Mean Difference (NMD) effect sizes will be calculated where possible. This effect size calculation will be used where an appropriate ‘sham’ or ‘control’ group is present (Vesterinen et al., 2014) or where it is possible to impute the outcome in a “normal” animal. If the data are unsuitable for calculating NMD, Standardised Mean Difference (SMD) will be used. NMD and SMD will be calculated using the equations outlined in Vesterinen and colleagues (2014). Specify statistical model of analysis (e.g. random or fixed effects model):

The data extracted will in all likelihood cover many different species, ages, and gender, as well as different study designs and models of induction. Therefore, the true effect size is likely to differ between studies, and a random-effects model will be used (Borenstein et al., 2009). Statistical analyses will be performed using Stata, Statistical Software (College Station, TX: StataCorp LP). Specify statistical methods to assess heterogeneity (e.g. I2, Q): Cochran’s Q will be used for partitioning of heterogeneity; Q is used to calculate the excess variance (Q-k, where k is the degrees of freedom). A p-value can be calculated for Q, giving an indication of whether all studies share a common effect size (p < 0.05) or not (p > 0.05). I2 will be used to report heterogeneity as this describes the proportion of observed variance that reflects true differences in effect size between studies. (Bahor et al., 2014) Specify which study characteristics will be examined as potential sources of heterogeneity (subgroup analysis): Meta-regression will be used to investigate the impact of different study characteristics on the outcome, where the effect estimate (NMD or SMD) is the outcome variable. Categorical variables will be transformed into dummy variables. Meta-regression will not be conducted unless there are sufficient studies per variable. At least 10 independent comparisons per covariate investigated are required (Borenstein et al., 2009). Where there are sufficient data, a multivariate meta-regression will be used for both model induction and drug models, otherwise a univariate model will be used with Bonferroni correction based on the number of variables. Model Induction Model: Sub-groups analyses: 1. Species of animals (mice vs rats vs etc. vs all) 2. Gender of animals (male vs female vs mixed) 3. Type of animal model 4. Method of model induction (i.e. developmental, genetic, pharmacological, lesion or combination) 5. The outcome measure(s) investigated (behavioural, electrophysiological, neurochemical, anatomical) 6. Amount of times the outcome assessment was measured (once vs several) 7. The time from model induction to time of outcome assessment 8. Randomisation (yes/no) 9. Blinding: a. Allocation concealment (yes/no) b. Assessment of outcome (yes/no) 10. Source of funding (public vs industry) A separate model will be used to investigate the effect of drug intervention on outcome. Drug Model: Sub-group analyses: 1. Drug Treatment or Intervention 2. Method of model induction (i.e. developmental, genetic, pharmacological, lesion or combination) 3. The outcome measure(s) investigated (behavioural, electrophysiological, neurochemical, anatomical)

4. Time the treatment is given in relation to model induction (investigated separately per treatment, pre or post model induction) 5. Time the treatment is given in relation to time of outcome assessment 6. Source of funding (public vs industry) Sensitivity analyses will be performed to assess how missing data from study characteristics and effect size might have affected the results. This will be presented in the form of a summary table. Correction for multiplicity of testing: Where there are more than two groups being compared in a univariate model we will use HolmBonferroni correction for multiplicity of testing. Specify the method for assessment of risk of publication bias: Risk of publication bias analyses will be assessed using funnel plot assessment, p-curve analysis, and Egger’s regression. Trim and fill analysis will be used to identify potentially missing studies. Analyses will be carried out using SigmaPlot and STATA software package using the insheet and metatrim commands (StataCorp LP; SYSTAT Software Inc). Conflict of Interest: The authors declare that there are no conflicts of interest.

References: Bahor, Z., Macleod, M., Thompson, L., Sena, S., Nunes-Fonseca, C., (2014). Improving our understanding of the in vivo modelling of psychotic disorders. http://www.dcn.ed.ac.uk/camarades/files/Updated%20Protocol.pdf Borenstein, M., Hedges, L.V., Higgins, J., Rothstein, H.R. (2009) Introduction to Meta-Analysis. John Wiley & Sons, Ltd: Chichester, UK Bunney WE, Bunney BG. Molecular clock genes in man and lower animals: possible implications for circadian abnormalities in depression. Neuropsychopharmacology. 2000;22:335–345. Cohen, A. M., Hersh, W. R., Peterson, K., & Yen, P.-Y. (2006). Reducing Workload in Systematic Review Preparation Using Automated Citation Classification. Journal of the American Medical Informatics Association, 13, 2, 206-219. de Vries, R. B., Hooijmans, C. R., Tillema, A., Leenaars, M., & Ritskes-Hoitinga, M. (2011). A search filter for increasing the retrieval of animal studies in Embase. Laboratory Animals, 45, 4, 268-70. de Vries, R. B., Wever, K. E., Avey, M. T., Stephens, M. L., Sena, E. S., & Leenaars, M. (2014). The usefulness of systematic reviews of animal experiments for the design of preclinical and clinical studies. ILAR Journal,55(3), 427-437. Eastman CI, Young MA, Fogg LF, Liu L, Meaden PM. Bright light treatment of winter depression: a placebo-controlled trial. Arch Gen Psychiatry. 1998;55:883–889 Geddes, J. R., Carney, S. M., Davies, C., Furukawa, T. A., Kupfer, D. J., Frank, E., & Goodwin, G. M. (2003). Relapse prevention with antidepressant drug treatment in depressive disorders: a systematic review. Lancet (England), 361,9358, 653-61. Gustavsson, A., Svensson, M., Jacobi, F., Allgulander, C., Alonso, J., Beghi, E., Dodel, R., ... Olesen, J. (2011). Cost of disorders of the brain in Europe 2010. European Neuropsychopharmacology, 21, 10, 718-779. Howard, B.E, Phillips, J., Miller, K., Holmgren, S., Pelch, K., Walker, V., Lunn, R., Rooney, A., Macloed, M.R., Shah, R.R., Thayer, K., (2016) SWIFT-Review: A Text-mining Workbench for Systematic Review. In press. Kieburtz, K., & Olanow, C. W. (2007). Translational experimental therapeutics: The translation of laboratory-based discovery into disease-related therapy. Mount Sinai Journal of Medicine: a Journal of Translational and Personalized Medicine, 74, 1, 7-14. Krishnan, V., & Nestler, E. J. (2008). The molecular neurobiology of depression. Nature, 455, 7215, 894-902. Marcus, M., Yasamy, M. T., van Ommeren, M., Chisholm, D., & Saxena, S. (2012). Depression: A global public health concern. World Health Organisation. Retrieved 5th November, 2015 MQ: Transforming Mental Health. (2016). Depression: Asking the Right Questions. Retrieved from: http://b.3cdn.net/joinmq/10fbab8ed26626f32e_a3m6bjx2e.pdf 22/01/2016

National Institute for Health and Clinical Excellence (Great Britain). (2009). Depression in Adults: Recognition and management. London: National Institute for Health and Clinical Excellence. nice.org.uk/guidance/cg90 Nestler, E. J., Gould, E., & Manji, H. (2002). Preclinical models: status of basic research in depression. Biological Psychiatry, 52, 6, 503-528. Rush, A. J., Trivedi, M. H., Wisniewski, S. R., Nierenberg, A. A., Stewart, J. W., Warden, D., Niederehe, G., ... Fava, M. (2006). Acute and longer-term outcomes in depressed outpatients requiring one or several treatment steps: a STAR*D report. The American Journal of Psychiatry, 163, 11, 1905-17. Slattery, D. A., & Cryan, J. F. (2014). The ups and downs of modelling mood disorders in rodents. Ilar Journal / National Research Council, Institute of Laboratory Animal Resources, 55, 2, 297-309. Tuunainen A, Kripke DF, Endo T. Light therapy for non-seasonal depression. Cochrane Database Syst Rev. 2004;2:CD004050. Vitaterna MH, King DP, Chang A-M, Kornhauser JM, Lowrey PL, McDonald JD, Dove WF, Pinto LH, Turek FW, Takahashi JS. Mutagenesis and mapping of a mouse gene, Clock, essential for circadian behavior. Science. 1994;264:719–725. World Health Organisation & The World Bank. Out of the Shadows: Making Mental Health a Global Development Priority. World Bank Group/IMF Spring Meeting, April 2016, Washington D.C. Zwetsloot, P. P., Jansen, . L. S. J., Végh, A. M. D., Hout, G. P. J., Currie, G. L., Goumans, M. J., Chamuleau, S. A. J., ... Sluijter, J. P. G. (2015). Cardiac stem cell treatment in myocardial infarction: protocol for a systematic review and meta-analysis of preclinical studies. Evidence-based Preclinical Medicine, 2, 1, 10-15.

ABB-depression-Protocol-09052016.pdf

There was a problem loading this page. Retrying... ABB-depression-Protocol-09052016.pdf. ABB-depression-Protocol-09052016.pdf. Open. Extract. Open with.

651KB Sizes 4 Downloads 149 Views

Recommend Documents

No documents