Statistical tests and identifiability conditions for pooling and analyzing multisite datasets

Hao Zhou(University of Wisconsin–Madison), Vikas Singh(University of Wisconsin–Madison), Sterling C. Johnson(University of Wisconsin–Madison), Grace Wahba(University of Wisconsin–Madison), the Alzheimer’s Disease Neuroimaging Initiative, Adam Fleisher, Adrian Preda, Aimee Pierce, Akiva Mintz, Alan J. Lerner, Alexander Norbash, Allan I. Levey, Allyson Rosen, Amanda Smith, Anaztasia Ulysse, Andrew E. Budson, Andrew J. Saykin, Andrew Kertesz, Angela Oliver, Ann Marie Hake, Anna Burke, Ansgar J. Furst, Antero Sarrael, Anton P. Porsteinsson, Arthur W. Toga, Ashley Lamb, Athena Lee, Balebail Ashok Raj, Barton Lane, Beatriz Yáñez‐Rivera, Beau M. Ances, Benita Mudge, Berkeley, Betty Lind, Bojana Stefanovic, Bonnie S. Goldstein, Bonnie S. Goldstein, Borna Bonakdarpour, Brandy R. Matthews, Bret Borowski, Brian R. Ott, Brigid Reynolds, Bruce L. Miller, Bryan M. Spann, Carl Sadowsky, Chad Ward, Charles Bernick, Charles D. Smith, Charles DeCArli, Chet Mathis, Chiadi Onyike, Chris Heyn, Chris Hosein, Christi Leach, Christine M. Belden, Christopher H. van Dyck, Christopher Clark, Chuang‐Kuo Wu, Clifford R. Jack, Colleen S. Albers, Connie Brand, Courtney Bodge, Craig Nelson, Curtis Tatsuoka, Cynthia M. Carlsson, Dana Mathews, Dana Nguyen, Daniel Catalinotto, Daniel D’Agostino, Daniel Silverman, Daniel Marson, Daniel Varon, Danielle Harvey, Dariella Fernandez, David A. Wolk, David Bachman, David Bickford, David Clark, David Geldmacher, David M. Hart, David M. Holtzman, David T. Jones, David S. Knopman, David C. Perry, David Winkfield, Debra Fleischman, Del D. Miller, Denise A. Reyes, Devon Gessert, Devon Gessert, Diana Kerwin, Dick Drost, Dino Massoglia, Donna M. Simpson, Donna Munic, Douglas W. Scharre, Dzintra Celmins, Earl A. Zimmerman, Edmond Teng, Edward Coleman, Edward Zamrini, Effie Mitsis, Elaine R. Peskind, Eli Lilly, Elise Ong, Elizabeth Finger, Elizabeth Oates, Elizabeth Shaffer, Elizabeth Sosa, Ellen Woo, Emily Rogalskı, Eric C. Petrie, Eric M. Reiman, Erin Drake, Erin Franklin, Erin Householder, Evan Fletcher, Francine Parfitt, Franz Hefti, Gaby Thai, Gad A. Marshall, Gad A. Marshall, Gail Li, Gary Conrad, Geoffrey Tremont, George Bartzokis, Ging‐Yuek Robin Hsiung, Gloria Chiang, Godfrey D. Pearlson, Gregory A. Jicha, Greg Sorensen, Gustavo Jiménez, Helen Vanderswag, Hillel Grossman, Horacio Capote, Howard Bergman, Howard Chertkow, Howard Feldman, Howard Fillit, Howard J. Rosen, Howard Rosen, Hristina Koleva, Hyungsub Shim, Irina Rachinsky, J. Jay Fruehling, Jacobo Mintzer, Jacqueline Hayes, Jaimie Ziolkowski, James Brewer, James J. Lah, Jamika Singleton-Garvin, Janet S. Cellar, Jared R. Brosch, Jared Tinklenberg, Jason H. Karlawish, Javier Villanueva‐Meyer, Jeff Gunter, Jeffrey Kaye, Jeffrey M. Burns, Jeffrey R. Petrella, Jennifer Salazar, Jerome A. Yesavage, Jerome A. Yesavage, Joanne Allard, Joanne Lord, Joel Hetelle, John Brockington, John C. Morris, John Hsiao, John C. Morris, John Olichney, John Q. Trojanowki, John Rogers, Jordan Grafman, Joseph Quinn, Joseph S. Kass, Joy L. Taylor, Judith L. Heidebrink, Karen Anderson, Karen Blank, Karen Crawford, Karen Ekstam Smith, Karen L. Bell, Karl E. Friedl, Kathleen Johnson, Kathleen Tingus, Kathryn DeMarco, Kaycee M. Sink, Keith A. Johnson, Kejal Kantarci, Kelley Faber, Kelley Faber, Kelly E. Behan, Kelly Harless, Kelly M. Makino, Kelly Nudelman, Kelly Scherer, Kenneth Spicer, Kewei Chen, Ki Won Nam, Kim Martin, Kim Poki-Walker, Kimberly S. Martin, Konstantinos Arfanakis, Kris Johnson, Kristin Fargher, Kristine Lipowski, Kwangsik Nho, Kyle Womack, Laura A. Flashman, Laurel Beckett, Lawrence S. Honig, Lean Thal, Leon J. Thal, Leslie M. Shaw, Lew Kuller, Li Shen, Liana G. Apostolova, Liberty Teodoro, Lindsey Hergesheimen, Lindsey Hergesheimer, Lisa C. Silbert, Lisa Ravdin, Lisa Taylor‐Reinwald, Lon S. Schneider, Lori A. Daiello, Marek-Marsel Mesulam, M. Saleem Ismail, Magdalena Korecka, Marc Raichle, Marc Seltzer, Marek-Marsel Mesulam, María C. Carrillo, Maria Carroll, Maria Kataki, Maria T. Greig, Maria T. Greig‐Custo, Marilyn Albert, Marissa Natelson Love, Mark A. Mintun, Martin R. Farlow, Martin Sadowski, Marwan N. Sabbagh, Mary L. Creech, Mary L. Hynes, Mary Quiceno, MaryAnn Oakley, Matthew L. Senjem, Matt A. Bernstein, Mauricio Becerra, Megan Witbracht, Melanie Keltz, Melissa Lamar, Meryl A. Butters, Mia Yang, Michael Borrie, Michael Donohue, Michael Z. Lin, Michael W. Weiner, Michael W. Weiner, Michal Figurski, Michele Assaly, Michelle Rainka, Michelle Zmuda, Mike Donohue, Mimi Dang, Mohammed O. Sheikh, Mrunalini Gaikwad, Munir Chowdhury, Nadira Trncic, Nancy Johnson, Nancy Kowalksi, Nathaniel Pacini, Neil Buckholtz, Neil W. Kowall, Neill R. Graff‐Radford, Nick C. Fox, Nigel J. Cairns, Norbert Schuff, Norm Foster, Norman Relkin, Ntekim E. Oyonumo, Nunzio Pomara, Olga Brawman‐Mintzer, Olga James, Olu Ogunlana, Oscar L. Lopez, Owen Carmichael, P. Murali Doraiswamy, Parianne Fatica, Patricia Lynn Johnson, Patricia Samuels, Paul Aisen, Paul Malloy, Paul M. Thompson, Paula Ogrocki, Pauline Maillard, Peter Davies, P. Hardy, Peter J. Snyder, Peter J. Snyder, Pierre N. Tariot, Po H. Lu, Pradeep Varma, Prashanthi Vemuri, Rachelle S. Doody, Raina Carter, Raj C. Shah, Randall Griffith, Randy Yeh, Ranjan Duara, Rawan Tarawneh, Raymond Scott Turner, Raymundo Hernando, Reisa A. Sperling, Rema Raman, Richard E. Carson, R.T. Frank, Riham El Khouli, Robert Bartha, Robert A. Koeppe, Robert B. Santulli, Robert C. Green, Ronald Killiany, Ronald Petersen, Rosemarie Rodriguez, Russell H. Swerdlow, Saba Wolday, Salvador Borges‐Neto, Samuel Stark, Sandra A. Jacobson, Sandra E. Black, Sandra Harding, Sandra Weıntraub, Sanjay Asthana, Sanjeev Vaishnavi, Sara Dolen, Sara S. Mason, Sarah Kremen, Sarah Walter, Scott Herring, Scott Mackin, Scott Neu, Shannon Finley, Sherye A. Sirrel, Smita Kittur, Sonia Pawluczyk, Stacy Schneider, Stephanie Kielb, Stephanie Reeder, Stephen Correia, Stephen Pasternack, Stephen Pasternak, Stephen Salloway, Sterling C. Johnson(University of Wisconsin–Madison), Sterling C. Johnson(University of Wisconsin–Madison), Steven Chao, Steven E. Arnold, Steven Potkin, Steven M. Paul, Steven Potkin, Sungeun Kim, Susan K. Schultz, Susan Landau, Susan Rountree, Tatiana Foroud, Terence Z. Wong, Teresa Villena, Thomas C. Neylan, Thomas O. Obisesan, Tom Montine, T‐Y Lee, Valory Pavlik, Vernice Bates, Veronika Logovinsky, Vesna Sossi, Victoria Shibley, Virginia M.‐Y. Lee, Walter Martínez, William J. Jagust, William M. Brooks, William Pavlosky, William C. Potter, Yaakov Stern, Yiu Ho Au, Yuliana Cabrera, Zaven S. Khachaturian
Proceedings of the National Academy of Sciences
January 31, 2018
Cited by 38Open Access
Full Text

Abstract

When sample sizes are small, the ability to identify weak (but scientifically interesting) associations between a set of predictors and a response may be enhanced by pooling existing datasets. However, variations in acquisition methods and the distribution of participants or observations between datasets, especially due to the distributional shifts in some predictors, may obfuscate real effects when datasets are combined. We present a rigorous statistical treatment of this problem and identify conditions where we can correct the distributional shift. We also provide an algorithm for the situation where the correction is identifiable. We analyze various properties of the framework for testing model fit, constructing confidence intervals, and evaluating consistency characteristics. Our technical development is motivated by Alzheimer’s disease (AD) studies, and we present empirical results showing that our framework enables harmonizing of protein biomarkers, even when the assays across sites differ. Our contribution may, in part, mitigate a bottleneck that researchers face in clinical research when pooling smaller sized datasets and may offer benefits when the subjects of interest are difficult to recruit or when resources prohibit large single-site studies.


Related Papers

No related papers found

Powered by citation graph analysis