Erika Siregar

Data Scientist & R Developer

View My Work

About Me

Erika Siregar

Hello, my name is Erika Siregar. I am a data scientist from Indonesia, currently pursuing a PhD in Information Science at the University of Sheffield, UK. Prior to my PhD, I worked as a data scientist at BPS–Statistics Indonesia, the national statistics agency, where I focused on building data pipelines and visual analytics to support public sector decision-making.

In addition to my academic and professional work, I actively contribute to the R community through R-Ladies, open-source projects, and workshops, particularly with R-Ladies Jakarta, which I co-founded.

This portfolio showcases selected R-based projects I have developed or contributed to across academic research, open-source initiatives, and applied data analysis. My work includes web scraping, automation, data cleaning, statistical modelling, and interactive visualisation, often focused on social impact, public policy, and Indonesian contexts.

Featured R Projects

rPlaywright: Modern Web Automation for R

rPlaywright: An R wrapper for Playwright

An R wrapper for Playwright, allowing users to automate modern browser interactions (e.g., navigation, clicks, scraping) in Chromium, Firefox, and WebKit. Developed as part of my participation in the rOpenSci Champions Program 2024.

Tools: V8, jsonlite, processx, Node.js (Playwright), R6
  • Programmatic browser control from R
  • Scrapes JavaScript-heavy websites
  • Useful for UI testing, automation, and modern web scraping
learningtower PISA data visualization

learningtower: OECD PISA Datasets (2000-2022)

A user-friendly R package providing easy access to PISA data from OECD. Developed as part of rOpenSci OzUnconf 2019 and available on CRAN.

Tools: tidyverse, readxl, knitr, rmarkdown, ggplot2, forcats, scales
  • Cleaned and Sampled PISA Data (2000–2022)
  • User-Friendly Access to Structured Datasets
  • Ideal for Educational and Analytical Applications
COVID-19 Economic Vulnerability Tracker

Indonesia COVID-19 Economic Vulnerability Tracker

An interactive Shiny dashboard to track economically vulnerable provinces in Indonesia concerning COVID-19, utilizing both official statistics and big data proxies.

Tools: shiny, ggplot2, dplyr, readr, broom, knitr, rmarkdown
  • Tracks economic vulnerability at provincial level
  • Combines official data with social media proxies
  • Interactive dashboard for policy insights
Bioenergy Potential Dashboard

Bioenergy Potential from Agro-industrial Waste Dashboard

Interactive bioenergy potential dashboard with dynamic charts, maps, and distance calculations. Developed in collaboration with GIZ Indonesia.

Tools: shiny, leaflet, ggplot2, highcharter, geojson, MongoDB
  • Distance calculation between supply & demand points
  • Regional mapping and comparison
  • Built to support bioenergy planning
LiteRate Data Visualization Tool

LiteRate: Data Visualization for BPS-Statistics Indonesia

Interactive data exploration tool built with R Shiny for visualizing data from BPS-Statistics Indonesia's website, using advanced web scraping techniques.

Tools: shiny, rvest, ggplot2, DT, wordcloud
  • Handles lazy loading and complex HTML structures
  • Multiple plot types and filters
  • Exportable visual outputs
AnalevR Analysis Platform

AnalevR: Modular R-Based Analysis Platform

Online R-based analysis environment with notebook-like interface, allowing analysis and visualization without owning raw data. Presented at NTTS 2019.

Tools: shiny, ggplot2, dplyr, ReactJS, PHP, Redis, Webdis
  • Interactive survey data analysis via web interface
  • On-Demand R session handling with real-time visualization
  • Modular architecture for scalable multi-user support
Biomethane Rollout Strategy Dashboard

Biomethane Rollout Strategy Dashboard

Interactive real-time dashboard showing biomethane potential supply and demand in Indonesia. Developed in collaboration with GIZ Indonesia.

Tools: shiny, ggplot2, dplyr, leaflet, geojson, MongoDB, highcharter
  • Dynamic charts showing potential biomethane by region
  • Supply map showing potential demands within radius
  • Demand map showing potential supplies within radius
Food Price Monitoring Dashboard

National Strategic Food Price Data Dashboard

Interactive dashboard monitoring strategic food commodity prices in Indonesia using automated web scraping and dynamic visualizations.

Tools: shiny, highcharter, rvest, xml2, dplyr, tidyr, lubridate
  • Automated daily scraping of food-price tables
  • Interactive dashboard with dynamic charts and maps
  • Rich exploration tools with export capabilities
Web Scraping Hotel Data

Web Crawling for Hotel and Restaurant Information

Large-scale web scraping project collecting hotel and restaurant data from travel platforms, gathering 43,600 records with advanced scraping techniques.

Tools: rvest, jQuery (Node.js), MongoDB
  • Solved scraping challenges like lazy loading & virtual scroll
  • Supports tourism and location analytics use cases
  • Collected comprehensive location and business data

Get In Touch

I'm always interested in collaborating on data science projects, R package development, and research opportunities.