SOMAR Data Access Application Guide
Welcome! Thank you for your interest in applying for access to SOMAR’s Data!
This guide will help you complete SOMAR's application forms and prepare the required documentation for upload. Please carefully review this guide, and if you haven’t already, we recommend viewing the publicly available resources listed in the sections below for specific datasets you are interested in accessing.
Please use the sidebar on the right side of this page for easier navigation.
When ready, please return to SOMAR’s Data Access Applications Portal (SOMAR-APPLY) and select the dataset you wish to access to begin.
IMPORTANT NOTE: Please complete the application form in one sitting. SOMAR is working to enhance the forms so progress can be saved prior to submission.
This guide is a living document and will be updated as needed for easier navigation and more clarity in the information provided.
Feel free to reach out to somar-help@umich.edu if you have any questions about the applications!
Table of Contents
- 1 SOMAR Data Access Methods and Requirements
- 2 Resources for Data from Meta
- 3 Application Forms
- 3.1 SOMAR Data Access Application
- 3.1.1 Project Application Fields
- 3.1.2 Researcher Contact Details
- 3.1.2.1 Applicant Details
- 3.1.2.2 Lead Researcher Details
- 3.1.2.3 Collaborator Details
- 3.1.2.4 Additional Contacts
- 3.1.3 Research Project Information
- 3.1.4 Additional requirements for data access in the VDE
- 3.1.5 Required Questions for Virtual Enclave Onboarding
- 3.1.6 Contracts
- 3.1.7 Restricted Data Use Agreement with ICPSR
- 3.1.8 Ethics Committee or Institutional Review Board (IRB) documentation
- 3.1.9 Applicant Acknowledgement (Meta Content Library and Ad Targeting Dataset)
- 3.1.10 Applicant Acknowledgement (U.S. 2020 Facebook and Instagram Election Study)
- 3.1.11 ICPSR Terms of Use (Datasets available via Controlled Download)
- 3.1.12 Additional Documents
- 3.1 SOMAR Data Access Application
- 4 Application Review
SOMAR Data Access Methods and Requirements
Data at SOMAR that require applications typically are available via one of two access methods. Researchers interested in exploring and using SOMAR data are encouraged to review the access methods and requirements defined below to understand the requirements to access a data file. Please reach out to somar-help@umich.edu for any questions.
Virtual Data Enclave (VDE) - data access only available in a secure virtual environment
Some SOMAR datasets have highly sensitive or identifiable information or usage restrictions. These datasets are only available in our Virtual Data Enclave (VDE). VDE datasets are noted in the “Dataset required for research” section. One example of datasets available in the VDE is the U.S. 2020 Facebook and Instagram Election Study Collection.
Accounts needed
Jira account (sign-up required; it's free)
Application requirements
Full application - All required items must be completed and provided, including IRB or Ethics Committee Review documentation, Restricted Data Use Agreement (RDUA), CVs or resumes, etc.
Controlled Download - data download available via a secure link
Some SOMAR datasets may cover sensitive topics or have usage restrictions. These datasets require an application and agreement to follow ICPSR’s Terms of Use. However, these datasets can be used in a researcher’s own computing environment. After your application is approved, we will send you a link to download the dataset. Controlled Download datasets are noted in the “Dataset required for research” section. One example of datasets available via Controlled Download is State of Social Connections Study.
Accounts needed
Jira account (sign-up required; it's free)
Application requirements
Short application - the following items are not always required: RDUA, IRB or Ethics Committee Review documentation, confidential/sensitive data experience, ethical guidelines, coding/technical experience
Agreement to Terms of Use in the Application
Researchers applying to access Controlled Download data must agree to Standard Terms of Use from ICPSR
Resources for Data from Meta
If you haven't already, we recommend reviewing the Meta Transparency Center webpages and exploring detailed documentation to learn more about the features, search quality, and data available in Meta Content Library and Content Library API.
Meta has partnered with the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan to share public data from Meta’s platforms in a responsible, privacy-preserving way. This partnership is enabled through ICPSR’s industry-leading SOMAR initiative.
Meta Content Library and Content Library API
Meta Content Library User Interface (UI) is hosted by Meta Platforms. Content Library API is hosted within SOMAR’s secure Virtual Data Enclave (“VDE”). Researchers will use the virtual data enclave to access and analyze public data from the API. Content Library UI can be used separately or in conjunction with Content Library API.
Content Library UI and Content Library API have different application requirements. If you are applying for access to the UI only, you will fill out the first part of the application. If you would also like to apply for access to the API, you will be prompted to answer another set of questions based on API application requirements. These additional application requirements for Content Library API include:
A brief technical skills questionnaire
Evidence of responsible experience or use of sensitive or restricted data
Preferred programming languages
Preferred statistical software for quantitative data analysis
Restricted Data Use Agreement between ICPSR and your institution, which must be executed before you are granted access to SOMAR's VDE. Navigate to the related section below to learn more about this item.
Please note that applications may be submitted without a Data Use Agreement, with the option to upload the documents later in the process. This will help expedite the application process for earlier UI access, while you and your team work on the additional documentation for API access.
Summary of changes to Meta Content Library API (MCL API) access
Meta Content Library API users (both current and new) now have an additional option for access to MCL API. Researchers can choose to access the API either through the SOMAR Virtual Data Enclave (VDE) or the Meta Secure Research Environment (SRE), formerly known as Researcher Platform.
Lead Researchers applying to use the MCL API will now select their preferred platform—either the SOMAR VDE or the Meta SRE—as part of the application process. Lead Researchers can only select one of the options. All collaborators within a research team must use the same secure computing platform to access Meta Content Library API.
Please refer to the FAQs for more information, including the breakdown of both platform options to inform your decision during your application process.
Meta Content Library Application Pause from November 20 to December 7
To streamline the application and onboarding experience, the Meta Content Library application-hosting and account management infrastructure will be transitioned from SOMAR’s JIRA application portal to a new, Meta-hosted application portal that will launch on December 8, 2025.
To facilitate this transition, there will be an application pause to Meta Content Library from November 20 to December 7. If you and your research team would like to apply, submit your application before November 20. During this application pause, SOMAR will continue to review and process applications that have been submitted before November 20.
Application submissions will resume on December 8 on the Meta-hosted application system. Please refer back to the Meta documentation for updated information.
If you have any additional questions, please do not hesitate to reach out to us at somar-help@umich.edu. If you have questions related to the Meta SRE, you can open a ticket through their support channel.
Ad Targeting Dataset
Please carefully review this guide and if you haven't already, we recommend reviewing the Meta Transparency Center and exploring detailed documentation to learn more about the dataset.
The Ad Targeting dataset contains the ad targeting logic of all of the Social Issue, Electoral, and Political (SIEP) ads run beginning August 3, 2020, on the Facebook and Instagram platforms. The coverage includes all countries where we currently have our ad authorizations and disclaimer tools available. More information on the countries this includes is available in our Business Help Center.
Application Forms
SOMAR’s Data Access Application portal enables us to collect information about your research project and your credentials, thus helping us better understand how we can support your research and data analysis.
There are two forms:
SOMAR Data Access Application
Lead Researchers or Applicants should use this form to submit requests for access to data from the Social Media Archive (SOMAR)
Collaborator Request
Lead Researchers, Applicants, or Collaborators should use this form to submit requests to add collaborators that require access to data from the Social Media Archive (SOMAR).
Note: Most fields are required, indicated with an asterisk *
SOMAR Data Access Application
Project Application Fields
The fields listed below capture information that helps SOMAR staff to record and track applications as well as easily identify a point person for each application.
Applicable Datasets | Fields/Questions | Notes for Applicants |
|---|---|---|
All datasets | Dataset required for research* | This is a cascading drop-down menu question.
|
U.S. 2020 Facebook and Instagram Election Study | Since you selected the U.S. 2020 Facebook and Instagram Election Study, please select the files you need for your research project. | This is a multi-select dropdown question. Please select all of the datasets you need for your research. |
Meta Content Library API (includes UI) | Since you selected "Meta Content Library API (includes UI)," please select the secure computing platform you need for your research project. | Lead Researchers can only select one of the options. All collaborators within a research team must use the same secure computing platform to access Meta Content Library API.
|
Researcher Contact Details
This application requires information that helps SOMAR staff to record and track applications as well as easily identify a point person for each application. The roles in this application form are the following:
Applicant: Individual who submits and manages the application as the primary point of contact. This role is administrative only, and applicants do not receive data access. If the applicant also needs access, they must be listed as a collaborator. The applicant may also be the Lead Researcher or another member of the research team.
Lead Researcher: Individual leading the proposed research project or larger research agenda/initiative that requires access to Meta Content Library UI and/or API. If the research project or research agenda is team-based, the Lead Researcher will be the Principal Investigator, Lead Investigator, or Research Lead, or equivalent in your organization (e.g. Social Media Monitor). In some instances, the applicant and the Lead Researcher will be the same individual.
Institutional signatory: Individuals at the Lead Researcher’s affiliated institution who holds the authority to enter into legal agreements and sign contracts on the institution’s behalf. Signatories typically work in a department such as the Contracts Office, Office of Sponsored Research, Research Contracts Management office, etc. The institutional signatory will have to sign the restricted data agreement(s) associated with this application. The institutional signatory will only be required if you are applying for access to the API or if the individual clickthrough agreement for Content Library UI access is not appropriate for your institution.
Collaborators: Anyone other than the Lead Researcher who has access to the data or handles the data in any capacity. If any computing staff, data librarian, research assistants (including students), or general staff will be handling the data, they must be included in this application as collaborators. Depending on institutional requirements, collaborators may be required to be officially affiliated with the Lead Researcher's institution. Please reach out to somar-help@umich.edu for any questions.
Applicant Details
Applicable Datasets | Fields/Questions | Notes for Applicants |
|---|---|---|
All Datasets | First name of Applicant* |
|
Last name of Applicant* |
| |
Institutional email of Applicant* | The Applicant's email address must show their institutional affiliation. |
Lead Researcher Details
The fields listed below gather high-level information about the Lead Researcher, their institution, and their experience. It is important that the information provided for these fields is as current and accurate as possible, as it helps to keep SOMAR’s application reviews efficient.
Lead Researcher is the individual leading the proposed research project or larger research agenda/initiative that requires access to the dataset selected for research. If the research project or research agenda is team-based, the Lead Researcher will be the Principal Investigator, Lead Investigator, Research Lead, or equivalent in your organization (e.g. Social Media Monitor). In some instances, the applicant and the Lead Researcher will be the same individual.
Applicable Datasets | Fields/Questions | Notes for Applicants |
|---|---|---|
All Datasets | First name of Lead Researcher* |
|
All Datasets | Last name of Lead Researcher* |
|
All Datasets | Institutional email of Lead Researcher* | The Lead Researcher's email address must show their institutional affiliation. NOTE: Personal email addresses, such as @gmail.com, are not permitted. |
All Datasets | Primary discipline or professional area of expertise* |
|
| Highest degree earned* | Please name the highest degree earned by the Lead Researcher. NOTE: Lead Researchers must hold a terminal degree (e.g., PhD, MD, DrPh, or JD) to access data the VDE. |
| Highest degree earned* | Please name the highest degree earned by the Lead Researcher (e.g., PhD, MD, DrPh, JD, MA, MS, MPH, BA, etc.) NOTE: this information is collected for demographic purposes; no particular degree is required to access Ad Targeting or Meta Content Library UI or API. |
All Datasets | Affiliational profile URL (organization, research, or academia)* | The Lead Researcher is required to provide a personal profile showcasing their current affiliation with their organization. Please include a link to any of the following: an active institutional profile or biography, organizational website with directory or staff list, ORCID or ResearchGate profile, or a webpage for their research group, lab, or department. If providing these links is not feasible, other documentation may be submitted at the end of the application to verify the researcher’s affiliation. NOTE: LinkedIn profiles, GitHub Pages, or personal websites are not permitted. |
| Facebook Profile URL* | Please provide the URL of the Lead Researcher's publicly accessible personal Facebook profile page. If they do not have a Facebook profile, they will need to create one to obtain access to the tools. Organizational profiles or business Pages are not allowed. |
All Datasets | Institution name* | The Lead Researcher must be currently affiliated with the institution provided in this field.
Lead Researchers must either be affiliated with an academic institution or other non-university organization, institute or society which operates as a not-for-profit entity and holds scientific or public interest research as a primary purpose or core activity. Researchers from different disciplinary and professional backgrounds are welcome to apply. |
All Datasets | Type of Institution* |
|
All Datasets | Country of institution (ISO Code)* | Please provide the 3-letter ISO country code for your institution (e.g., USA, CHN, AUS). To find the correct code, use the following link and search for your country’s 3-letter code: https://www.iso.org/obp/ui/#search/code/. |
All Datasets | State/Province of institution* | Please provide the full name of the state or province of the affiliated institution. |
All Datasets | City of institution* | Please provide the full name of the city of the affiliated institution. |
All Datasets | Department name* | If your institution does not have a department, please add “N/A’ |
All Datasets | Institute, center, or lab name* | If you do not have anything to report, please add "N/A". |
All Datasets | Role at institution* | Applicants will choose one answer from the following options:
|
All Datasets | Lead Researcher's resume or curriculum vitae* | Please upload the Lead Researcher's resume or curriculum vitae in Adobe PDF or Microsoft Word. |
Collaborator Details
Applicable Datasets | Fields/Questions | Notes for Applicants |
|---|---|---|
All Datasets | Are there any collaborators that need access to the requested data for this research project?*
| Please select 'Yes' only if your collaborators will perform any type of analysis or data management activities. It is important to note that only individuals who need to perform any data analysis or directly interact with the data can be added as collaborators. |
Collaborators can be added at any time after the Lead Researcher’s application has been submitted.
A separate form must be completed for each collaborator. The following information is required:
Lead Researcher’s application ID (SOMARAPPLY-####)
Research project title
Lead Researcher’s last name and email address
To add collaborators, use the Collaborator Addition Request Form. The questions on this form can be found in this document.
Note: If the Project Application references a collaborator’s experience to meet eligibility or expertise requirements, that collaborator must complete the Collaborator Addition Request form for the application to proceed. Until the form is submitted, we cannot consider their qualifications in the review process of the Project Application, and the application will be placed on hold.
Additional Contacts
Applicable Datasets | Fields/Questions | Notes for Applicants |
|---|---|---|
All Datasets | Additional Email Addresses for Notifications | Provide the email addresses of anyone who should receive updates about this application. These individuals will receive the same notifications as the Applicant or Lead Researcher. Use commas to separate multiple addresses (e.g., john.doe@university.edu, jane.smith@university.edu).
Individuals such as Lead Researcher’s institutional signatory can be included in the application notifications if needed. |
Research Project Information
These items collect information about the Lead Researcher’s research project or agenda, including the summary and why the data are needed. We encourage applicants to provide as much relevant information as needed about their research. If necessary, applicants are welcome to upload additional supporting documentation about the research project or the researcher’s experience later in the application form.
Applicable Datasets | Fields/Questions | Notes for Applicants |
All Datasets | Research Project Title* | Please provide your project title or a brief, 1-2 sentence overview of your research agenda. *Examples could include “investigate public health discourse on social media,” “study social media content related to upcoming presidential elections,” etc. |
| Will your research contribute to the detection, identification, and understanding of systemic risks in the European Union?* | Please select “Yes” or “No”. |
All Datasets | Research project summary* | In 250 words or less (up to 1750 characters), describe your research agenda or plans for a non-specialist audience. Your research agenda can include longer term or ongoing research that you and your team are conducting. Please be sure to address: your general area of focus, guiding research questions, methodologies, as well as any endpoints, data types, or data fields of interest. If you haven't already, please consult the publicly available documentation about the data before you begin your application. |
All Datasets | Keywords associated with research* | List the keywords associated with the research project. Please separate the keywords with commas. |
All Datasets | Funding source(s)* | List the funding sources that will support the Lead Researcher's data analysis. Please separate the funding items with commas. You may enter "N/A" if there are none. |
All Datasets | Justification for why requested data are required for your research activities.* | (250 words or less) Please briefly explain why the data you are applying to obtain access are required for your research activities. |
All Datasets | Research outcomes* | Please select the intended outcome(s) of your research project.
One or more answers from the following options may be selected:
|
| Research region of interest* | Please select the region of interest from the drop-down menu as applicable to the research project. |
| Primary research country (or countries) of interest* | Please select the country of interest from the drop-down menu as applicable to the research project. |
Additional requirements for data access in the VDE
These items collect additional information about the Lead Researcher and research project to be reviewed for access to the data in the Virtual Data Enclave (e.g., Meta Content Library API). If necessary, applicants are welcome to upload additional supporting documentation about the research project or the researcher’s experience later in the application form.
Applicable Datasets | Fields/Questions | Notes for Applicants |
| Including any education, the total number of years of coding experience*
| Information about the Lead Researcher's coding experience is necessary to inform SOMAR how best to assist researchers for VDE access. Please provide details in these next few questions. Coding experience can come from using Python, R, Java, C++, and so on.
Applicants will choose one answer from the following options:
|
Preferred programming languages* | Please select your preferred programming languages. | |
Preferred statistical software for quantitative data analysis* | Please select your preferred statistical software for quantitative data analysis. | |
Evidence of technical skills of Lead Researcher and/or collaborators* | Experience with Python, R, SQL, or another coding or querying language is recommended for VDE access. Explain your experience with a coding or querying language here, or applicants may provide evidence via a link to a GitHub repository or other location where code examples have been shared. Access to the VDE is not dependent on experience with coding. This field is to help SOMAR know how to best support researchers. Note: If the Project Application references a collaborator’s experience to meet eligibility or expertise requirements, that collaborator must complete the Collaborator Addition Request form for the application to proceed. Until the form is submitted, we cannot consider their qualifications in the review process of the Project Application, and the application will be placed on hold. *The character limit is 5000. | |
Evidence of responsible experience or use of sensitive or restricted data*
| Evidence of responsible experience or understanding of sensitive or restricted-use datasets is required. Please provide up to 3 citations or examples of your research that demonstrate your experience or understanding of using sensitive data.
*The character limit is 5000. | |
Ethical guidelines to be used to guide research agenda* | Please 1) indicate the ethical guidelines you will use to guide your research project, and 2) describe how you plan to apply specific principles in practice. For example, you may refer to the AOIR Ethics, NeurIPS Code of Ethics, or Williams, Burlap, and Sloan (2017), but you must also describe how you adhere to the guidelines you selected. | |
I understand that these data are to be used solely for statistical analysis and reporting of aggregated information and not for the investigation of specific individuals or organizations.* | This field is required.
Please select “True” or “False” | |
I understand that I and any collaborator I include on my team must have the skill sets to use the data if granted access.* | This field is required.
Please select “True” or “False” |
Required Questions for Virtual Enclave Onboarding
Each person using the VDE must complete the following ICPSR VDE training as part of the application and onboarding process. Please take the following steps:
Watch the ICPSR VDE Training video (~7 minutes)
Please note that SOMAR’s VDE environment has some differences, but the VDE security information remains the same.
Complete the VDE Training Quiz (~1-2 minutes)
Although instructions in the confirmation of submission message ask researchers to email ICPSR that they completed the quiz, SOMAR does not need these messages. We will onboard researchers to the VDE after their applications have been approved and all requirements for data access in the VDE are met.
If you answered any quiz questions incorrectly, please rewatch the video.
Complete the questions directly below.
Applicable Datasets | Fields/Questions | Notes for Applicants |
|---|---|---|
|