SOMAR Data Access Application Guide

SOMAR Data Access Application Guide

Welcome! Thank you for your interest in applying for access to SOMAR’s Data!

This guide will help you complete SOMAR's application forms and prepare the required documentation for upload. Please carefully review this guide, and if you haven’t already, we recommend viewing the publicly available resources listed in the sections below for specific datasets you are interested in accessing.

Please use the sidebar on the right side of this page for easier navigation.

When ready, please return to SOMAR’s Data Access Applications Portal (SOMAR-APPLY) and select the dataset you wish to access to begin.

IMPORTANT NOTE: Please complete the application form in one sitting. SOMAR is working to enhance the forms so progress can be saved prior to submission.

This guide is a living document and will be updated as needed for easier navigation and more clarity in the information provided.

Feel free to reach out to somar-help@umich.edu if you have any questions about the applications!


Table of Contents


SOMAR Data Access Methods and Requirements

Data at SOMAR that require applications typically are available via one of two access methods. Researchers interested in exploring and using SOMAR data are encouraged to review the access methods and requirements defined below to understand the requirements to access a data file. Please reach out to somar-help@umich.edu for any questions.

Virtual Data Enclave (VDE) - data access only available in a secure virtual environment

Some SOMAR datasets have highly sensitive or identifiable information or usage restrictions. These datasets are only available in our Virtual Data Enclave (VDE). VDE datasets are noted in the “Dataset required for research” section. One example of datasets available in the VDE is the U.S. 2020 Facebook and Instagram Election Study Collection.

  • Accounts needed

    • Jira account (sign-up required; it's free)

  • Application requirements

    • Full application - All required items must be completed and provided, including IRB or Ethics Committee Review documentation, Restricted Data Use Agreement (RDUA), CVs or resumes, etc.

Controlled Download - data download available via a secure link

Some SOMAR datasets may cover sensitive topics or have usage restrictions. These datasets require an application and agreement to follow ICPSR’s Terms of Use. However, these datasets can be used in a researcher’s own computing environment. After your application is approved, we will send you a link to download the dataset. Controlled Download datasets are noted in the “Dataset required for research” section. One example of datasets available via Controlled Download is State of Social Connections Study.

  • Accounts needed

    • Jira account (sign-up required; it's free)

  • Application requirements

    • Short application - the following items are not always required: RDUA, IRB or Ethics Committee Review documentation, confidential/sensitive data experience, ethical guidelines, coding/technical experience

  • Agreement to Terms of Use in the Application

    • Researchers applying to access Controlled Download data must agree to Standard Terms of Use from ICPSR


Resources for Data from Meta

If you haven't already, we recommend reviewing the Meta Transparency Center webpages and exploring detailed documentation to learn more about the features, search quality, and data available in Meta Content Library and Content Library API.

Meta has partnered with the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan to share public data from Meta’s platforms in a responsible, privacy-preserving way. This partnership is enabled through ICPSR’s industry-leading SOMAR initiative.

Meta Content Library and Content Library API

Meta Content Library User Interface (UI) is hosted by Meta Platforms. Content Library API is hosted within SOMAR’s secure Virtual Data Enclave (“VDE”) and the Meta Secure Research Environment. Lead researchers can select to access Content Library API on either secure computing platform in their Meta Content Library application for their team. All collaborators within a research team must use the same secure computing platform to access Meta Content Library API. Please refer to FAQs for more information, including the breakdown of both platform options to inform your decision during your application process. Content Library UI can be used separately or in conjunction with Content Library API.

You can access the application to Meta Content Library on Research Tools Manager, the Meta-hosted application portal. The application form is the same for applying to both Content Library UI and API access. If you apply for access to Meta Content Library API on the SOMAR Virtual Data Enclave, SOMAR will require additional information from you after approval.

Meta Content Library Update

You can now apply to Meta Content Library on Research Tools Manager. To streamline the application and onboarding experience, the Meta Content Library application-hosting and account management infrastructure has moved to Research Tools Manager, the Meta-hosted application portal. Please refer to the Meta documentation for updated information.

If you have any questions related to Research Tools Manager, Meta Content Library UI, and API in Meta SRE, you can open a ticket through Meta’s support channel.

If you have any questions related to using Meta Content Library API in SOMAR’s VDE, please reach out to us at somar-help@umich.edu.

Meta Ad Targeting Dataset Update

Applications for access to the Ad Targeting dataset paused on December 19, 2025. Continue to refer back to the Meta Get access page for the latest application information.


Application Forms

SOMAR’s Data Access Application portal enables us to collect information about your research project and your credentials, thus helping us better understand how we can support your research and data analysis.

There are two forms:

  • SOMAR Data Access Application

    • Lead Researchers or Applicants should use this form to submit requests for access to data from the Social Media Archive (SOMAR)

  • Collaborator Request

    • Lead Researchers, Applicants, or Collaborators should use this form to submit requests to add collaborators who require access to data from the Social Media Archive (SOMAR).

Note: Most fields are required, indicated with an asterisk *

SOMAR Data Access Application

Project Application Fields

The fields listed below capture information that helps SOMAR staff to record and track applications as well as easily identify a point person for each application.

Applicable Datasets

Fields/Questions

Notes for Applicants

Applicable Datasets

Fields/Questions

Notes for Applicants

All datasets

Dataset required for research*

This is a cascading drop-down menu question.

Please let us know the names of the dataset(s) needed for your research project.

U.S. 2020 Facebook and Instagram Election Study

Since you selected the U.S. 2020 Facebook and Instagram Election Study, please select the files you need for your research project.

This is a multi-select dropdown question. Please select all of the datasets you need for your research.

Researcher Contact Details

This application requires information that helps SOMAR staff to record and track applications as well as easily identify a point person for each application. The roles in this application form are the following: 

  1. Applicant: Individual who submits and manages the application as the primary point of contact. This role is administrative only, and applicants do not receive data access. If the applicant also needs access, they must be listed as a collaborator. The applicant may also be the Lead Researcher or another member of the research team.

  2. Lead Researcher: Individual leading the proposed research project or larger research agenda/initiative that requires access to the data for which they are applying. If the research project or research agenda is team-based, the Lead Researcher will be the Principal Investigator, Lead Investigator, or Research Lead, or equivalent in your organization (e.g. Social Media Monitor). In some instances, the applicant and the Lead Researcher will be the same individual.

  3. Institutional signatory: Individuals at the Lead Researcher’s affiliated institution who holds the authority to enter into legal agreements and sign contracts on the institution’s behalf. Signatories typically work in a department such as the Contracts Office, Office of Sponsored Research, Research Contracts Management office, etc. The institutional signatory will have to sign the restricted data agreement(s) associated with this application.

  4. Collaborators: Anyone other than the Lead Researcher who has access to the data or handles the data in any capacity. If any computing staff, data librarian, research assistants (including students), or general staff will be handling the data, they must be included in this application as collaborators. Depending on institutional requirements, collaborators may be required to be officially affiliated with the Lead Researcher's institution. Please reach out to somar-help@umich.edu for any questions. NOTE: If you are applying for Meta Content Library API access in SOMAR’s VDE and you are a collaborator on a research team, follow the instructions in the Meta documentation for how to receive your invitation to submit your application.

 

Applicant Details

Applicable Datasets

Fields/Questions

Notes for Applicants

Applicable Datasets

Fields/Questions

Notes for Applicants

All Datasets

First name of Applicant*

 

Last name of Applicant*

 

Institutional email of Applicant*

The Applicant's email address must show their institutional affiliation.

 

Lead Researcher Details

The fields listed below gather high-level information about the Lead Researcher, their institution, and their experience. It is important that the information provided for these fields is as current and accurate as possible, as it helps to keep SOMAR’s application reviews efficient. 

Lead Researcher is the individual leading the proposed research project or larger research agenda/initiative that requires access to the dataset selected for research. If the research project or research agenda is team-based, the Lead Researcher will be the Principal Investigator, Lead Investigator, Research Lead, or equivalent in your organization (e.g. Social Media Monitor). In some instances, the applicant and the Lead Researcher will be the same individual.

Applicable Datasets

Fields/Questions

Notes for Applicants

Applicable Datasets

Fields/Questions

Notes for Applicants

All Datasets

First name of Lead Researcher*

 

All Datasets

Last name of Lead Researcher*

 

All Datasets

Institutional email of Lead Researcher*

The Lead Researcher's email address must show their institutional affiliation. NOTE: Personal email addresses, such as @gmail.com, are not permitted.

All Datasets

Primary discipline or professional area of expertise*

 

  • Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice

  • ChatGPT in education: A discourse analysis of worries and concerns on social media

  • Politweets: Tweets of politicians, celebrities, news media, and influencers from India and the United States

  • U.S. 2020 Facebook and Instagram Election Study

Highest degree earned*

Please name the highest degree earned by the Lead Researcher.

NOTE: Lead Researchers must hold a terminal degree (e.g., PhD, MD, DrPh, or JD) to access data the VDE.

  • Replication data for "Emergent structures of attention on social media are driven by amplification and triad transitivity"

  • State of Social Connections Study (Controlled Download)

Highest degree earned*

Please name the highest degree earned by the Lead Researcher (e.g., PhD, MD, DrPh, JD, MA, MS, MPH, BA, etc.)

NOTE: this information is collected for demographic purposes; no particular degree is required to access the data.

All Datasets

Affiliational profile URL (organization, research, or academia)*

The Lead Researcher is required to provide a personal profile showcasing their current affiliation with their organization. Please include a link to any of the following: an active institutional profile or biography, organizational website with directory or staff list, ORCID or ResearchGate profile, or a webpage for their research group, lab, or department. If providing these links is not feasible, other documentation may be submitted at the end of the application to verify the researcher’s affiliation.

NOTE: LinkedIn profiles, GitHub Pages, or personal websites are not permitted.

All Datasets

Institution name*

The Lead Researcher must be currently affiliated with the institution provided in this field.

 

Lead Researchers must either be affiliated with an academic institution or other non-university organization, institute or society which operates as a not-for-profit entity and holds scientific or public interest research as a primary purpose or core activity. Researchers from different disciplinary and professional backgrounds are welcome to apply.

All Datasets

Type of Institution*

 

All Datasets

Country of institution (ISO Code)*

Please provide the 3-letter ISO country code for your institution (e.g., USA, CHN, AUS). To find the correct code, use the following link and search for your country’s 3-letter code: https://www.iso.org/obp/ui/#search/code/.

All Datasets

State/Province of institution*

Please provide the full name of the state or province of the affiliated institution.

All Datasets

City of institution*

Please provide the full name of the city of the affiliated institution.

All Datasets

Department name*

If your institution does not have a department, please add “N/A’

All Datasets

Institute, center, or lab name*

If you do not have anything to report, please add "N/A".

All Datasets

Role at institution*

Applicants will choose one answer from the following options:

  • Assistant professor

  • Associate professor

  • Professor

  • PhD student

  • Master’s student

  • Lecturer

  • Postdoctoral fellow

  • Independent researcher

  • Staff researcher

  • Other (describe)

All Datasets

Lead Researcher's resume or curriculum vitae*

Please upload the Lead Researcher's resume or curriculum vitae in Adobe PDF or Microsoft Word.

 

Collaborator Details

Applicable Datasets

Fields/Questions

Notes for Applicants

Applicable Datasets

Fields/Questions

Notes for Applicants

All Datasets

Are there any collaborators that need access to the requested data for this research project?*

 

Please select 'Yes' only if your collaborators will perform any type of analysis or data management activities.

It is important to note that only individuals who need to perform any data analysis or directly interact with the data can be added as collaborators.

Individuals who can review output/results outside the Virtual Data Enclave or immediately before submission to a publisher or a conference do not need to be added as collaborators for your data access request.

Collaborators can be added at any time after the Lead Researcher’s application has been submitted.

A separate form must be completed for each collaborator. The following information is required:

  • Lead Researcher’s application ID (SOMARAPPLY-####)

  • Research project title

  • Lead Researcher’s last name and email address

To add collaborators, use the Collaborator Addition Request Form. The questions on this form can be found in this document. If you need to add a Collaborator for Meta Content Library API access in SOMAR’s VDE, follow the instructions in the Meta documentation.

Note: If the Project Application references a collaborator’s experience to meet eligibility or expertise requirements, that collaborator must complete the Collaborator Addition Request form for the application to proceed. Until the form is submitted, we cannot consider their qualifications in the review process of the Project Application, and the application will be placed on hold.

 

Additional Contacts

Applicable Datasets

Fields/Questions

Notes for Applicants

Applicable Datasets

Fields/Questions

Notes for Applicants

All Datasets

Additional Email Addresses for Notifications

Provide the email addresses of anyone who should receive updates about this application. These individuals will receive the same notifications as the Applicant or Lead Researcher. Use commas to separate multiple addresses (e.g., john.doe@university.edu, jane.smith@university.edu).

 

Individuals such as Lead Researcher’s institutional signatory can be included in the application notifications if needed.

 

Research Project Information

These items collect information about the Lead Researcher’s research project or agenda, including the summary and why the data are needed. We encourage applicants to provide as much relevant information as needed about their research.  If necessary, applicants are welcome to upload additional supporting documentation about the research project or the researcher’s experience later in the application form. 

Applicable Datasets

Fields/Questions

Notes for Applicants

All Datasets

Research Project Title*

Please provide your project title or a brief, 1-2 sentence overview of your research agenda.

*Examples could include “investigate public health discourse on social media,” “study social media content related to upcoming presidential elections,” etc.

All Datasets

Research project summary*

In 250 words or less (up to 1750 characters), describe your research agenda or plans for a non-specialist audience. Your research agenda can include longer term or ongoing research that you and your team are conducting. Please be sure to address: your general area of focus, guiding research questions, methodologies, as well as any endpoints, data types, or data fields of interest. If you haven't already, please consult the publicly available documentation about the data before you begin your application.

All Datasets

Keywords associated with research*

List the keywords associated with the research project. Please separate the keywords with commas.

All Datasets

Funding source(s)*

List the funding sources that will support the Lead Researcher's data analysis. Please separate the funding items with commas.

You may enter "N/A" if there are none.

All Datasets

Justification for why requested data are required for your research activities.*

(250 words or less)

Please briefly explain why the data you are applying to obtain access are required for your research activities.

All Datasets

Research outcomes*

Please select the intended outcome(s) of your research project.

 

One or more answers from the following options may be selected:

  • Peer-reviewed article

  • Conference presentation

  • White paper

  • Article (popular press)

  • Study reproduction (for journal peer reviewers)

  • Study reproduction (for researchers/data users)

  • Article review

  • Report

  • Other

 

Additional requirements for data access in the VDE

These items collect additional information about the Lead Researcher and research project to be reviewed for access to the data in the Virtual Data Enclave (e.g., Meta Content Library API). If necessary, applicants are welcome to upload additional supporting documentation about the research project or the researcher’s experience later in the application form.

Applicable Datasets

Fields/Questions

Notes for Applicants

  • Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice

  • ChatGPT in education: A discourse analysis of worries and concerns on social media

  • Meta Content Library and Content Library API (for API Access)

  • Politweets: Tweets of politicians, celebrities, news media, and influencers from India and the United States

  • U.S. 2020 Facebook and Instagram Election Study

Including any education, the total number of years of coding experience*

 

Information about the Lead Researcher's coding experience is necessary to inform SOMAR how best to assist researchers for VDE access. Please provide details in these next few questions.

Coding experience can come from using Python, R, Java, C++, and so on.

 

Applicants will choose one answer from the following options:

  • No experience

  • Less than 1 year

  • 1 to 4 years

  • 5 to 9 years

  • 10-14 years

  • 15+ years

Preferred programming languages* 

Please select your preferred programming languages.

Preferred statistical software for quantitative data analysis*

Please select your preferred statistical software for quantitative data analysis.

Evidence of technical skills of Lead Researcher and/or collaborators*

Experience with Python, R, SQL, or another coding or querying language is recommended for VDE access. Explain your experience with a coding or querying language here, or applicants may provide evidence via a link to a GitHub repository or other location where code examples have been shared. Access to the VDE is not dependent on experience with coding. This field is to help SOMAR know how to best support researchers.

Note: If the Project Application references a collaborator’s experience to meet eligibility or expertise requirements, that collaborator must complete the Collaborator Addition Request form for the application to proceed. Until the form is submitted, we cannot consider their qualifications in the review process of the Project Application, and the application will be placed on hold.

*The character limit is 5000.

Evidence of responsible experience or use of sensitive or restricted data*

 

Evidence of responsible experience or understanding of sensitive or restricted-use datasets is required. Please provide up to 3 citations or examples of your research that demonstrate your experience or understanding of using sensitive data.

 

*The character limit is 5000.

Ethical guidelines to be used to guide research agenda*

Please 1) indicate the ethical guidelines you will use to guide your research project, and 2) describe how you plan to apply specific principles in practice. For example, you may refer to the AOIR Ethics, NeurIPS Code of Ethics, or Williams, Burlap, and Sloan (2017), but you must also describe how you adhere to the guidelines you selected.

I understand that these data are to be used solely for statistical analysis and reporting of aggregated information and not for the investigation of specific individuals or organizations.* 

This field is required.

 

Please select “True” or “False”

I understand that I and any collaborator I include on my team must have the skill sets to use the data if granted access.*

This field is required.

 

Please select “True” or “False”

 

Required Questions for Virtual Enclave Onboarding

Each person using the VDE must complete the following ICPSR VDE training as part of the application and onboarding process. Please take the following steps:

  1. Watch the ICPSR VDE Training video (~7 minutes)

    • Please note that SOMAR’s VDE environment has some differences, but the VDE security information remains the same.

  2. Complete the VDE Training Quiz (~1-2 minutes)

    • Although instructions in the confirmation of submission message ask researchers to email ICPSR that they completed the quiz, SOMAR does not need these messages. We will onboard researchers to the VDE after their applications have been approved and all requirements for data access in the VDE are met.

  3. If you answered any quiz questions incorrectly, please rewatch the video.

  4. Complete the questions directly below.

Applicable Datasets

Fields/Questions

Notes for Applicants

Applicable Datasets

Fields/Questions

Notes for Applicants

  • Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice

  • ChatGPT in education: A discourse analysis of worries and concerns on social media

  • Meta Content Library and Content Library API (for API Access)

  • Politweets: Tweets of politicians, celebrities, news media, and influencers from India and the United States

  • U.S. 2020 Facebook and Instagram Election Study

Do you currently have active credentials (example: uniqname) to access the SOMAR VDE?*

Please select “Yes” or “No”.

Active credentials mean you can access the SOMAR VDE.

Date of Birth (month/day/year)*

This field is required if you select “No” to the active VDE credentials question.

Phone Number (home/mobile)*

This field is required if you select “No” to the active VDE credentials question.

Please enter your phone number using numbers only (no symbols, spaces, or letters). Include your country and area codes. For example: 441234567890 (UK) or 12125551234 (US).

As outlined in Steps 1 and 2 above, I have watched the VDE training video and passed the VDE training quiz.*

This field is required if you select “No” to the active VDE credentials question.

Please select “Yes” or “No”. You will not be able to submit the application form unless you complete these steps and select “Yes”.

Restricted Data Use Agreement with ICPSR

(also known as Data Use Agreement)

For access to the data in SOMAR's Virtual Data Enclave, the Restricted Data Agreement (RDUA) must be reviewed, signed, and dated by the Lead Researcher and their institutionʼs legal representative or signatory. The Restricted Data Use Agreement is an agreement between the University of Michigan and the Lead Researcherʼs institution, signed by both the Lead Researcher and an institutional signatory (i.e., legal representative) of the Lead Researcherʼs institution, which specifies the terms of use of the restricted data.

Please download the agreement document and upload the completed version in the form.

SOMAR is happy to collaborate with researchers whose institutions seek to modify the Restricted Data Use Agreement (RDUA). Please note that all proposed modifications—whether new or previously accepted in other agreements—must still undergo our standard review and approval process with the University of Michigan research contracts department.

If an institution needs to request modifications to the agreement, they must be submitted as a redlined Word document. Please note that requesting changes will significantly extend processing time and may delay data access by several weeks, as all revisions must be reviewed and approved by the University of Michigan legal team. We encourage you to carefully consider whether the proposed changes are essential before proceeding.

An institutional representative such as a contracts officer or legal signatory should email the redlined agreement to ICPSR-help@umich.edu and indicate that the request is related to SOMAR data. Modified agreements can take up to three months to process, although most are completed within six weeks. Timelines depend on the number of agreements currently in the queue.

For additional information and a detailed timeline, please see the ICPSR Agreement Modification Process:

Applicable Datasets

Fields/Questions

Notes for Applicants

Applicable Datasets

Fields/Questions

Notes for Applicants

  • Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice

  • ChatGPT in education: A discourse analysis of worries and concerns on social media

  • Politweets: Tweets of politicians, celebrities, news media, and influencers from India and the United States

  • U.S. 2020 Facebook and Instagram Election Study

Upload Signed Restricted Data Agreement*

Please upload the signed agreement in PDF format. To access SOMAR's Virtual Data Enclave, the Restricted Data Agreement must be signed and dated by both the Lead Researcher and an Institutional Representative authorized to act on their institution’s behalf. This representative is typically responsible for research compliance and agreements, legal counsel, or an executive such as a President, Vice President, Dean (in certain non U.S. institutions), or department head (in certain non U.S. institutions).

Institutional signatory name*

Please provide the name of the legal authority at the affiliated institution that will sign the restricted data agreement associated with this research project.

Institutional signatory title*

Please provide the title of the legal authority identified for this application (e.g., Vice President, Contracts Specialist, Chair, Dean, etc.).

Institutional signatory university email address*

Please provide the email address for the legal authority identified for this application.

 

Ethics Committee or Institutional Review Board (IRB) documentation

*Note: This documentation is not required for Meta Content Library API data in SOMAR VDE or datasets available via Controlled Download.