[Washington Statistical Society]
[WSS home] [WSS Newsletter] [WSS Information] [Seminars] [Short Courses] [Employment] [Feedback] [Join WSS!]

Washington Statistical Society Seminars

Current | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | 1998 | 1997 | 1996 | 1995 | Methodology

January 2001
5
Fri.
Multipoint Linkage Analysis Using Affected Sib Pairs: Incorporating Linkage Evidence from Unlinked Regions
9
Tues.
Hierarchical Bayesian Model with Nonresponse
16
Tues.
An Integrated Framework for Database Privacy Protection
17
Wed.
Meeting the Sampling Needs of Business
31
Wed.
Data on the Racial Profiling of Travelers
February 2001
5
Mon.
Federal Statistics and Statistical Ethics: The Role of the ASA's "Ethical Guidelines for Statistical Practice"
8
Thur.
Evaluation of the Brady Handgun Violence Prevention Act
13
Tues.
Statistical Aspects of Neural Networks
14
Wed.
WSS Seminar & Julius Shiskin Award Presentation
Bias in Aggregate Productivity Trends Revisited
22
Thur.
A Prototype Data Dissemination System for the 2002 Census of Agriculture
March 2001
8
Thur.
Small-Area Poverty Estimates and Public Policy: Looking to the Future
21
Wed.
The Role of Questionnaire Design in Medicaid Estimates: Results from an Experiment
22
Thurs.
Mass Imputation of Agricultural Economic Data Missing by Design-A Simulation Study of Two Regression Based Techniques
30
Fri.
Oatmeal Cookies, Weather Modification, and Organ Failure: The Art of Combining Results from Independent Statistical Studies
April 2001
4
Wed.
NIST Engineering Division Symposium: Gibbs, MCMC, and Importance Sampling
12
Thur.
Interviewer Refusal Aversion Training to Increase Survey Participation
12
Thur.
Effect of Using Priority Mail on Response Rates in a Panel Survey
12
Thur.
New Standards for a New Decade: The Standards for Defining Metropolitan and Micropolitan Statistical Areas
May 2001
2
Wed.
Modern Regression Methods: Differences and Similarities
24
Thur.
Producing Small Area Estimates from National Surveys: Methods for Minimizing Use of Indirect Estimators
30
Wed.
Data Mining in Classification and Cluster Analysis
June 2001
4
Mon.
The 2001 Roger Herriot Award For Innovation In Federal Statistics
6
Wed.
New GAO Report on Research Record Linkage
7
Thur.
Interagency Activities to Address Nonresponse in Household Surveys (part 2)
11
Wed.
WSS President's Day
Sam Greenhouse Memorial Symposium
The Funding Opportunity In Survey Research

25
Mon.
Spatial Modeling of Age, Period and Cohort Effects
27
Wed.
The Federal Reserve Board's 1998 Survey of Small Business Finances: Methodological Issues and Cost Considerations
July 2001
12
Thur.
Revising Statistical Standards: An Exercise in Quality Improvement
16
Mon.
WSS Seminar and Julius Shiskin Award Presentation
Time Series Decomposition and Seasonal Adjustment
17
Tues.
The Data Web Project: Confidentiality Issues
26
Thur.
The Statistical Power of National Data to Evaluate Welfare Reform
August 2001
1
Thur.
Converting Your Technical Paper into the Best Presentation of Your Life!
September 2001
13
Thur.
Evaluating Welfare Reform in an Era of Transition Report of the Committee on National Statistics Panel on Data and Methods for Measuring the Effects of Changes in Social Welfare Programs
13
Thur.
New Concepts in Test Equating and Linking
28
Fri.
A new alternative to Bayes factors: the resolution of Lindley's paradox through the posterior distribution of the likelihood ratio
October 2001
17
Wed.
Informing America's Policy on Illegal Drugs: What We Don't Know Keeps Hurting Us. Report of the Committee on Data and Research for Policy on Illegal Drugs
23
Thur.
Likelihood Analysis Of Neural Network Models
November 2001
1
Thur.
University of Maryland
Statistics Program, Department of Mathematics Seminar
The Fisher Information on a Location Parameter Under Additive and Multiplicative Perturbations
2
Fri.
The George Washington University
Department of Statistics Seminar
A New Alternative To Bayes Factors: The Resolution Of Lindley's Paradox Through The Posterior Distribution Of The Likelihood Ratio
7
Wed.
Why Mathematics Is Needed to Understand Disease-Gene Associations
7
Wed.
The Empirical Role of Young Versus Old Gestational Ages in the Abortion Debate
8
Thur.
University of Maryland
Statistics Program, Department of Mathematics Seminar
Multidimensional Time Markov Processes
9
Fri.
The George Washington University
Department of Statistics Seminar
Modified Maximum Likelihood Estimators Based on Ranked Set Samples
13
Tues.
Evaluation of Score Functions to Aid in the 2002 Census of Agriculture Review Process
13
Tues.
THE MORRIS HANSEN LECTURE
Election Night Estimation
15
Thur.
University of Maryland
Statistics Program, Department of Mathematics Seminar
Clinical Outcome Prediction in Diffuse Large Bcell Lymphoma via Micro array
20
Tues.
U.S. Bureau Of Census
Statistical Research Division Seminar
From Single-Race Reporting to Multiple-Race Reporting: Using Imputation Methods to Bridge the Transition
27
Tues.
U.S. Bureau Of Census
Statistical Research Division Seminar
An Exploratory Data Analysis Retrospective
28
Wed.
U.S. Bureau Of Census
Statistical Research Division Seminar
Residential Mobility and Census Coverage
29
Thur.
The George Washington University
Department of Statistics Seminar
Urban Heat Island Effect in the Greater Washington Metropolitan Area
December 2001
19
Wed.
University of Maryland
Statistics Program, Department of Mathematics Seminar
Bioinformatics for HIV Genomics
19
Wed.
Outlier Selection for RegARIMA Models



WSS Home | Newsletter | WSS Info | Seminars | Courses | Employment | Feedback | Join!


Title: Multipoint Linkage Analysis Using Affected Sib Pairs: Incorporating Linkage Evidence from Unlinked Regions



Abstract:

Credentials that are outside the traditional postsecondary educational attainment We will start the talk with a brief discussion on some central questions in genetic epidemiological research. We will then focus on one of the main questions concerning localizing susceptibility genes. We present a multipoint linkage method to assess evidence of linkage to one region by incorporating linkage evidence from other regions. This method is especially useful for complex diseases such as asthma, diabetes and psychiatric disorders in which the effect of each gene is likely to be small or modest. Our approach uses affected sib pairs in which the number of alleles shared identical by descent is the primary statistic. The method proposed uses data from all available families to simultaneously test the hypothesis of statistical interaction between regions and to estimate the location of the susceptibility gene in the target region. As an illustration, we have applied this method to an asthma sib pair study (Wjst et al., 1999, Genomics) which earlier reported evidence of linkage to chromosome 6 but showed no evidence for chromosome 20. Our results yield strong evidence of linkage to chromosome 20 after incorporating linkage information from chromosome 6. In addition, it estimates with 95% certainty that the map location of the susceptibility gene is flanked by markers D20S186 and D20S101, which are approximately 16.3 CM apart.

Title: Hierarchical Bayesian Model with Nonresponse

Abstract:

We describe a hierarchical Bayesian model to analyze multinomial nonignorable nonresponse data from small areas. We use Dirichlet prior on the multinomial probabilities and beta prior on the response probabilities which permit a pooling of data from different areas. This pooling is needed because of the weak identifiability of the parameters in the model. Inference is sampling based and Markov chain and Monte Carlo methods are used to perform the computations. We apply our method to Body Mass Index (BMI) data from the Third National Health and Nutrition Examination Survey (NHANES III).

Title: An Integrated Framework for Database Privacy Protection

Abstract:

One of the central objectives of studying database privacy protection is to protect sensitive information held in a database from being inferred by a generic database user. We propose a framework to assist in the formal analysis of the database inference problem. In this work, the inference problem is dealt with from two perspectives which are characterized by different properties of database attributes. One perspective is about the probabilistic dependency relationship among attributes. The other perspective is about the identification of an individual data item. The proposed framework involves two-tier processing. First, a similarity analysis is used to examine attributes that can be used to effectively identify individual information. Attributes selected from this analysis are processed by aggregation. The second tier is to mitigate the inference induced by the probabilistic dependency relationship. In our present approach, a blocking strategy is adopted to reduce the amount of released information. The desired attribute values to be blocked are determined by the corresponding sample probability. Our framework is based on an association network which is composed of a similarity measure and a (Bayesian) probabilistic network model. This provides a unified framework for database inference analysis.

Topic: Meeting the Sampling Needs of Business

Abstract:

Sampling for business uses often requires efforts to keep the sample small. There are situations where there are major cost implications associated with an increase in sample size. Generally, in these circumstances, the increased precision of ratio or regression estimation is useful. For small samples, the design-based ratio estimate under simple random sampling could be seriously biased unless the sample is balanced with respect to the covariate. Stratification on the covariate can achieve the effect of balancing, but the sample size needs to be reasonably large. We propose a deep stratification method at the design stage such that only one unit is drawn from each of the equal sized strata. We then use the regular design-based ratio estimate and its variance estimate. This method makes a remarkable contribution towards the bias reduction and also gives good variance estimates and coverage rates. Simulation results are presented.

Title: Data on the Racial Profiling of Travelers

Abstract:

Law enforcement officers are prohibited from engaging in discriminatory behavior on the basis of individuals' race, ethnicity, or national origin. However, there have been numerous allegations of such behavior from minorities who travel the nation's roadways and those who transit through the nation's airports.

This presentation will discuss two reviews that GAO undertook at the request of Congress. One review examined the available quantitative research on racial profiling of motorists, as well as data that federal, state, and local law enforcement agencies collect on motorist stops. The other review examined the U.S. Customs Service's policies and procedures for conducting personal search and what controls Customs had in place to ensure that airline passengers were not inappropriately subjected to personal searches because of their race or sex.

With respect to racial profiling of motorists, we found five quantitative analyses as of March 2000. All contained significant limitations, the most prominent being the use of inappropriate benchmarks to assess whether minorities were stopped proportionately more often than whites. Most analyses examined the racial composition of motorists who were stopped, but not the racial composition of motorists at risk of being stopped. The best studies collected data on both the population of travelers as well as the traffic violators on specific roadways. However, even the well-designed studies had methodological limitations, so we could not conclusively determine whether and to what extent racial profiling of motorists may occur. There is a clear need for more and better data on the subject, which law enforcement agencies and additional researchers are attempting to collect.

With respect to personal search practices of the U.S. Customs Service, we obtained data on all Customs personal searches, the outcome of the search, and some passenger characteristics, such as race and sex. Using a series of logistic regression models, we examined which types of passengers were more or less likely to be searched and the results of the searches. We found that certain groups of passengers were selected for more intrusive searches at rates that were not consistent with the rates of finding contraband. While this finding may or may not be evidence of profiling, it indicates that there is room for improvement in the efficiency of the Customs Service's targeting criteria.

Title: Federal Statistics and Statistical Ethics: The Role of the ASA's "Ethical Guidelines for Statistical Practice"

Abstract:

In their daily work, government statisticians must reconcile a host different requirements and demands. Among these are: user needs, statistical and subject-matter understandings and constraints, agency policies and traditions, supervisory instructions, political priorities, budgetary imperatives, federal law, and personal beliefs and friendships. These and other categories of requirements and constraints often provide conflicting guidance. Moreover, many of the categories themselves do not offer monolithic or even consistent instruction.

Needless to say, the process of sorting out all this conflicting advice on what to do and what not to do is usually implicit. In these circumstances, mistakes, big and small, can and do happen. Ethics provide a tool to help examine and proceed through this maze "do's" and "dont's" and thereby reduce the likelihood of such mistakes. While ethics are helpful in avoiding all sorts of mistakes in coping with conflicting demands, they are particularly helpful in avoiding ethical mistakes. Three sets of ethical standards are currently available for use in connection with the federal statistical system: (1) the American Statistical Association's (ASA) "Ethical Guidelines for Statistical Practice," (2) the International Statistical Institute's "Declaration on Professional Ethics," and (3) the United Nations Statistical Commission's "Fundamental Principles of Official Statistics." Although the focus and language of each is slightly different, certainly with respect to official statistics, they provide consistent guidance. The ASA guidelines, the subject of today's talk, consists of an executive summary, a preamble, and nine subsections that constitute the body of the guidelines. The ASA guidelines provide guidance on normal work issues as well as on the extraordinary challenges that one may sometimes face. The ASA guidelines, moreover, can be an important starting point for fostering ethical awareness in a statistical agency and ethical activism among agency leadership and staff. Indeed, recent research has demonstrated that, without such awareness and activism, the single-minded pursuit of other goals have led to serious lapses.

Title: Evaluation of the Brady Handgun Violence Prevention Act

Abstract:

The Brady Act established a nationwide requirement that licensed firearms dealers observe a waiting period and initiate a background check for handgun sales. As it turned out, 18 states already met these requirements at the time of its implementation in February 1994. They serve as a control group in analyzing the effect of the Act on homicide and suicide rates. An evaluation written by Jens Ludwig and Phillip Cook that reported largely negative findings was published in the Journal of the American Medical Association of August 2, 2000, and has been attacked from both sides since then. The pro-control folks argue that the "control" states in fact benefitted from Brady, while the anti-control folks argue that if we had done the evaluation right we would have found that Brady increased the homicide rate.

In this talk, the speaker will review the methods and findings, and review the bidding on the critics. He will also discuss the gaping loopholes in the Act that most likely account for its ineffectiveness.

Title: Statistical Aspects of Neural Networks

Abstract:

An Artificial Neural Network (ANN) is an information-processing paradigm inspired by the way the brain processes information. ANNs have been vigorously promoted in the computer science literature for tackling a wide variety of scientific problems. Recently, statisticians have started to investigate whether they are useful for tackling various statistical problems. Most of the interest in ANNs is motivated by their use as a universal function approximator. However, comparative studies with traditional statistical methods have given mixed results as to the added benefit they might provide.

This talk will explain what neural networks are and how they relate to statistical models. Model selection, estimation, and validation will be discussed. Advantages and disadvantages of their use and successful applications will be presented.

WSS Seminar & Julius Shiskin Award Presentation

Topic: Bias in Aggregate Productivity Trends Revisited

Abstract:

This paper develops measures of U.S. multifactor productivity (MFP) growth for aggregate sectors and for industries. MFP is designed to measure the joint influences on economic growth of factors such as technological change, efficiency improvements, returns to scale, and reallocation of resources. The paper updates results from earlier work. We continue to find a number of industries outside of manufacturing with negative MFP trends during the 1980s and 1990s. These include insurance, construction, banking, and health care. The paper considers the possibility that these negative trends reflect problems in measuring outputs and inputs. Attention is focused on the fact that service output sector trends are low relative to input trends. This may reflect the fact that we are better able to measure quality change for goods, especially high tech goods, than for services.

Following the seminar presentation, Marilyn Manser, Associate Commissioner for Productivity and Technology, BLS, will make remarks granting the Julius Shiskin Award of 2000 to Edwin R. Dean, formerly of the Bureau of Labor Statistics, for his important contributions to the improvement of and understanding of productivity measures, and also to programs on international comparisons of labor statistics and international technical cooperation. The Julius Shiskin Award was intended to honor original and important contributions in the development of economic statistics and in their use in interpreting economic events. It is jointly sponsored by the Washington Statistical Society and the National Association of Business Economists. Edwin Dean's expertise and innovation has placed increased emphasis on the Bureau of Labor Statistics' international technical cooperation program. These steps have helped foster the reputation of the United States as a leader in the World's increasingly global economy.

Please join the Washington Statistical Society on February 14, 2001, at 12:30 p.m. to honor Edwin Dean as we present the award to him and celebrate in a reception following the award.

Title: A Prototype Data Dissemination System for the 2002 Census of Agriculture

Abstract:

Historically, the Census of Agriculture represents the leading source of local area statistics about U.S. agriculture. In 1997, responsibility for the Census of Agriculture was transferred from the Bureau of the Census to the National Agricultural Statistics Service (NASS). In large part, this responsibility involves collecting, analyzing, and publishing data regarding all places defined as farms.

This prototype system uses data from the 1997 Census of Agriculture to demonstrate methods of graphical display that could give NASS data customers a better understanding of patterns and structure in the data. These methods could also give data customers enhanced ability to view, analyze, and interact with summary data previously available only in tables.

Using concepts and methods developed and inspired by Tukey, Cleveland, Tufte, Carr, and others, the system demonstrates how NASS data can be more effectively displayed and disseminated using dots, lines, arrows, and maps. While this system uses data from the 1997 Census of Agriculture, the applicability to other sources of survey and census data will hopefully be apparent.

The prototype system presented uses a Web browser to display and disseminate data, and will ultimately feature the ability to dynamically generate and interact with various charts and maps.

Whether the customer's intent is to print a given display or view it on a screen, the system is structured to make NASS published data easier to access and understand.

Topic: Small-Area Poverty Estimates and Public Policy: Looking to the Future

Abstract:

More than $130 billion of federal funds are allocated each year to states and localities by means of formulas that include estimates of poverty or income. States also use small-area income and poverty estimates to allocate their own and federal funds to substate areas. The funds support a wide range of activities and services, including child care, community development, education, job training, nutrition, public health and others.

The newest source of estimates is the Census Bureau's Small Area Income and Poverty Estimates (SAIPE) Program, which was begun in the early 1990s to provide estimates that would be more timely than those from the decennial census. 1994 legislation specified the use of SAIPE estimates to allocate more than $7 billion each year of Title I Elementary and Secondary Education Act funds for disadvantaged children, pending a review by a Committee on National Statistics (CNSTAT) panel, and the SAIPE estimates are used for other federal programs as well.

The CNSTAT Panel on Estimates of Poverty for Small Geographic Areas evaluated the currently used SAIPE methods and estimates and outlined an agenda for future research and develpment. Graham Kalton, chair of the panel, and Connie Citro, the panel's study director, will review the panel's findings and recommendations, with a particular emphasis on future R&D and the role that new surveys, such as the 2000 census long form and the American Community Survey, and administrative records can play in improving the estimates for public policy use in the next decade and beyond.

Topic: The Role of Questionnaire Design in Medicaid Estimates: Results from an Experiment

Abstract:

Two Congressional acts of the 1990s -- the welfare reform law (Personal Responsibility and Work Opportunity Reconciliation Act of 1996) and the Children's Health Insurance Program (CHIP) in 1997 -- have profound implications for the Medicaid target population. While researchers and public policy experts have monitored Medicaid since its inception, these recent federal initiatives have heightened the demand for reliable statistics on the number and characteristics of people on Medicaid, both to detect unintended consequences of welfare reform and to track the success of the CHIP program. Figures on Medicaid, however, vary significantly depending on the survey used to generate the estimates.

This session will present findings from an experimental survey (the Census Bureau's 1999 Questionnaire Design Experimental Research Survey) which measured Medicaid participation under four different survey designs. The designs contain key features of several important health surveys, including the Current Population Survey (the survey used by the federal government to generate official statistics on health insurance coverage), the Survey of Income and Program Participation and the Robert Would Johnson Foundation's Community Tracking Survey. Results will focus on the levels and characteristics of Medicaid participants as measured under these four different designs. Attendants of the session can expect to recognize associations between survey design features and the estimates those surveys generate, and to analyze existing survey data and design new surveys in light of these associations.

Topic: Mass Imputation of Agricultural Economic Data Missing by Design-A Simulation Study of Two Regression Based Techniques

Abstract:

In making an effort to reduce respondent burden, an approach that one might take is to reduce significantly the amount of data being collected. It would be desirous to do this in a manner that facilitates the use of methodology that decreases the impact of this reduction in data collection on both point estimates and analyses obtained from the reduced data set.

Two imputation techniques are investigated, one based on a Markov Chain Monte Carlo algorithm (Schafer, 1997) under a conditional multivariate normal assumption, the other using a simple least squares regression with an added random empirical residual (RER). Computer simulation was used to study the statistical characteristics of agriculture economic data sets completed by imputing data using these methods. The performance of these methods when applied to situations where, by design, as much as 60 % of the records require an imputed value for a given variable is of particular interest.

Particularly problematic is the extreme skewness and semi-continuous nature of many of these data-the former a result of a relatively few large values, the latter caused by a preponderance of legitimate zero values. First, power transformations are used to create more normal-like marginal distributions for each variable. Logistic regression is then used to determine if a positive or zero is to be imputed for a given variable on a given record. Then, the afore mentioned modeling techniques are applied to obtain the necessary positive values to impute.

For estimates of means, moderate gains in precision were achieved for some variables, but for other variables, the imputations appeared to add nothing but noise and/or bias. The Random Empirical Error method did preserve correlations quite well whereas the Markov Chain Monte Carlo approach was less effective in this regard.

Title: Oatmeal Cookies, Weather Modification, and Organ Failure:
The Art of Combining Results from Independent Statistical Studies

Abstract:

The global information explosion in almost all areas of science, coupled with the movement of evidence-based medicine, has generated the need for the synthesis and assessment of evidence. There is now a huge body of studies that deal with specific problems. For example, there are over 750 experiments on the effect of cloud seeding, some of which may use different seeding agents, may seed in different months, and so on. In the health sciences there are many studies concerned with the effects of a particular drug or treatment. For example, is a combination of estrogen and progesterone effective in reducing osteoporosis in women, or is aspirin effective in diminishing heart attacks? In each case, different studies may use different populations, concentrations, or frequency may vary, etc. Statisticians are being challenged to define procedures for combining the results of such uncoordinated studies. The set of such procedures has been called meta-analysis, in contrast to primary or secondary analyses.

An area of current interest is how to model dependencies. The mechanisms for such modeling may arise from statistical concepts, physical structures, characterizations, mixtures, etc. Thus, we might expect models for the joint failure distributions of the engines on an airplane that differ from the models of failure distributions of organs in the body. This area is fascinating in that it permits one to study various physical phenomena statistically, and that it requires a combination of both physical and statistical insight.

Note: The talk is similar to Dr. Olkin's Fisher Lecture at the ASA meetings last August.

NIST Statistical Engineering Division Symposium

As part of the project at NIST to update the Handbook of Mathematical Functions by Abramovitz and Stegun and to make it available electronically as the Digital of Library of Mathematical Functions conferences are being held at NIST for some of the principal topics. The Mathematical and Computational Sciences Division and the Statistical Engineering Division are jointly sponsoring a symposium on Gibbs, MCMC and Importance Sampling, specific topics that are part of the expansion of coverage for the Digital Library. A demonstration of the NIST-SEMATECH handbook, also a web-based publication, will be included.

The symposium is open to the public; and WSS members in particular are invited; and there is no fee. Reservations are requested in order to guarantee space and to take NIST shuttle from Metro. Call Stephany Bailey at 301 975-2839 (Statistical Engineering Division) or via email at stephany.bailey@nist.gov for advance registration or for additional information.