Lifecycle & Comparative Studies
Metadata Needs
of the Future CESSDA RI
Uwe Jensen
GESIS – Leibniz Institute for the Social Science (Germany)
28.05.2009 E4 – CESSDA PPP
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Comparative research - a major feature in social science activity
• Globalisation & European integration > international collaboration & cross disciplinary networking,
• the growth in potential funding sources (EC FP’s, ESF; national funding)
Comparative research - variety of forms and strategies
–
–
High quality & ambitious research affects dynamically methodology and concepts
Projects expand in no. of countries & increased complexity of the collected data of various types
Variety of Subjects
–
–
Political & social institutions & systems (like welfare, organisations, families …)
Attitudes, values, and behaviour pattern in manifold areas (health, consumer, election …)
Variety of design concepts & spatial & temporal dimension (examples)
–
–
–
–
–
Special topics / EU Candidate countries (Eurobarometer)
Modules for replications > cumulation (ISSP)
Ex-post times series (EB Trend file)
Longitudinal studies to observe the same items over long periods of time (ESS, EVS).
National studies conceptualized to allow comparison (Household panels; GSS, Age; Health)
Metadata needs of the future CESSDA RI regard
Conception > Production > Repurposing of complex data
Simplified Study Lifecycle
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Conception >
> Production
> Repurposing
Data producer (Primary investigator) provide metadata (Reports on …)
• Project (proposal & initialisation)
> project mission & organisation & management & work principles
> funding & conditions > archiving > public-re-use (> Data sharing)
• Study concept and survey design
> Background and research questions for the study instance
• Questionnaire development
> basic questionnaire
> translations > country specific questionnaire / questions
documentation & monitoring procedures
• Sample selection
> Pilot / Pre-test of the questionnaire
Project considers or decide on
• additional or new metadata needs
• to develop and to document (at this phase or at a later time)
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Conception >
> Production
> Repurposing
New metadata needs may occur as consequence of methodological findings, e.g. on:
o Mode of Data Collection (mixed mode > face-to-face, telephone, Internet and paper self-completion)
o Strategies on improving response rates,
o Functional equivalence of question
Needs for Contextual statistics to support analysis of variables
Development of attitudinal indicators
e.g. link between “life satisfaction” <> national economic indicator (ESS)
Demographic database for the country level
to provide additional personal or household characteristics (sex, income, educational level, ...)
Socio-economic macro statistics
> Example: “Contextual data for the European Social Survey”
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Conception >
> Production
> Repurposing
Data collection
• Interviewer training
> Guideline & monitoring
• Country specific fieldwork
> monitoring and outcomes > Reports
Additional metadata needs
• Complementary documentation of Event Data
> Database to document context effects during fieldwork (like social, economic, natural events) (like ESS)
Depending on project organisation:
• Quality control of the fielded data and (initial) data documentation
– by country according to ex-ante proved standard (like ISSP; ESS, EVS) or
– by other project regulation (e.g. integrated dataset by field work institute > Eurobarometer )
Primary Data Analyses
Publication of results (by Primary investigator / Project)
Pass Study to a Data provider instance
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Conception >
> Production
> Repurposing
Contracting / Ingest / Long-term preservation > special topic: Preservation metadata
Data Processing - Harmonisation - Integration & Metadata Documentation
Extension of metadata needs to support public retrieval and re-use of comparative data (selective!!)
Metadata capture & data processing – Standard procedures and documentation standards
• Question / Variable …. :
• Project information & Study design (funding, concepts; methodological design details)
Index / Classify by controlled vocabs > Study – Question – Variable
Referencing general subjects to other collections / studies (> ISSP; EVS, ESS, ….)
Substantial context information:
• Standards used in documentation (Employment; Regions, …)
• From other disciplines & trusted (web) resources (statistics, country fact sheet, Health, Political Parties …)
Knowledge references & products
• Linkage of publications on the data & (primary & secondary)
• project based on-line bibliography database to register publications referring to respective data
• Durable Citation of versioned metadata/data (PID) (study; the data (a single variable, group or trend)
Lon-term preservation of processed study product
Publication / dissemination
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Conception >
Production >
Repurposing >
Publication of Data & Metadata : Store > Discover > Explore > Access (> CESSDA data portal
Metadata needs to support re-use of data (via local holdings > integrating CESSDA portal)
> Metadata on structural & substantial data context (study x question x variable for space x time dimensions)
Browsing – Presentation views on one Study module by topics; by year, …
• Module as: IDS, Country DS X Q/V groups – plus Quest., Reports, Tables, ref. Studies, Publications, DBs …
• Cumulation: trend view > category groups and selection by country & year (or groups of them)
Search by categories to discover comparative & (potentially) comparable Questions / Data from diff. studies
+ applying controlled vocabs for European languages (WP4)
Minimum requirement: universe (object) + concept (property) + variable content (representation) (ISO/IEC 11179 compliance)
+ further methodological aspects … > selective for XY countries / XY year(s)
Question / Variable browsing & retrieval > specialized platform for Harmonisation & Question DB (WP9)
• Trend presentations from different surveys > …
• Concepts / scale information – multilingual question wording in a study series - ….
Data explorations & analysis > Data access
Role of Gov. I - 1985
IDS C6/V141/R7350
Role of Gov. II - 1990
IDS C11/V142/R14897
Role of Gov. III - 1996
IDS C26/V141/R35313
Role of Gov. IV – 2006
IDS C34/V141/R48641
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Role of Gov. I-IV (2008)
CUM
C 22 – V 114+ - R 83485
A short stop: How to manage complex collections of metadata & relationships at the Archive level?
Conception >
Production >
Repurposing >
> metadata …
> metadata …
> metadata …
Role of Gov. IV – 2006
IDS C34/V141/R48641
Role of Gov. III - 1996
IDS C26/V141/R35313
Role of Gov. II - 1990
IDS C11/V142/R14897
Role of Gov. I - 1985
IDS C6/V141/R7350
Role of Gov. I-IV (2008)
Cumulation
C22 - V114+… - R83485
Further Standards & Models - OAIS > Preservation > Statistical data …
DDI 3.1
Why?!
HowTo
…
CESSDA
Policies
DDI 3 Best practice
> on the way + …
Strategy plan
Business Plan
Projects to
develop tools
Comparison & grouping
Longitudinal data
Workflows on ex-ante/ex-post proc
Metadata re-use & Repositories
Keyword & concepts
International
cooperation
Metadata storage & preservation
Migration strategies
Metadata discovery
Re-Use of Data & Metadata in secondary analysis
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Conception >
> Production
> Repurposing
• Re-Use of Data & Metadata
Role of Gov. I - 1985
IDS C6/V141/R7350
Role of Gov. II - 1990
IDS C11/V142/R14897
Role of Gov. III - 1996
IDS C26/V141/R35313
Role of Gov. IV – 2006
IDS C34/V141/R48641
Role of Gov. I-IV (2008)
CUM C22/V114+/R83485
Metadata needs of the future CESSDA RI
Metadata provision for re-use and replication of available data
• Expands present data & metadata > new knowledge products
Harmonisation platform with
• to inform on constructs and classifications (Constructs, Classifications, Conversions DB)
• to enhance comparability of data across surveys > expand present metadata by community-produced metadata
complementary Question database
• to inform comparative research on large scale on national or time-specific version of a question
> allows for immediate systematic comparison > connected to concepts & classifications
• to support the development of new questionnaire
new Conception > Production > Repurposing …..
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Thank you!
Contacts & information:
http://www.cessda.org
Metadata needs & functional specifications for the CESSDA RI
WP4
WP8
WP9
WP11
WP12
WP5
Controlled vocabularies
Enhancement of data and metadata infrastructures
Harmonisation platform & question database
Investigating the potential of grid technologies
Technical support for the preparatory phase
One-stop-shop Portal >
IASSIST / IFOD: Mobile Data and the Life Cycle – Tampere, Finland May 26-29, 2009
Descargar

Folie 1 - Etusivu | Tietoarkisto