Chi-Fan Lo

Chi-Fan Lo

MSCS @ Carnegie Mellon University

About Me

πŸ‘¨β€πŸ’» Hi! I’m a master student studying Computer Science at Carnegie Mellon University (CMU), previously graduated from National Taiwan University (NTU) with B.S. Computer Science / B.A. in Economics. I am generally interested in fields in Artificial Intelligence and Machine Learning.

πŸ“‹ Previously I’ve conducted SWE internships at quantitative trading startup Kronos, Microsoft and NVIDIA.

✏️ I write tutorials about advanced data structure and algorithms here on a random basis. Most topics I wrote are competitive programming stuff. I enjoy problem-solving and won second/fourth place at ICPC 2020/2019 Asia Taipei-Hsinchu Regional as team CRyptoGRapheR.

πŸ“š In 2021, I coordinated a training camp, IOICamp, which helps students prepare for top-tier informatics competitions like IOI and ICPC. I also founded another entry-level coding camp in 2021 summer, APCSCamp, which teaches C++ and basic algorithm for all students from elementry school to college.

πŸ”‘ PGP Public Key

Currently I’m…
  • MS student at CMU, Pittsburg, US
Education
  • Carnegie Mellon University, 2022 - 2023 (exp.)

    M.S. Computer Science

    • Courses: 15-640 Distributed Systems (Go), 10-703 Deep RL, 11-711 Advanced NLP, 15-618 Parallel Computer Architecture, 16-824 Visual Learning and Recognition, 10-708 Probablistic Graphical Models
  • National Taiwan Unversity, 2018 - 2022, graduate rank 3/140 (2\%)

    B.S. Computer Science / Economics

    • Courses: Advanced Statistical Inference, Calculus, Linear Algebra, Algorithm Design and Analysis(TA), Operating System, System Programming, Micro/Macroeconomics, Financial Engineering

Work Experience

 
 
 
 
 
July 2021 - Aug. 2021
Taipei, Taiwan
System Software Engineer Intern
NVIDIA SW RM Team
  • Designed and built GC6 Latency Perf Test, a test that measures sleep-wake latency of engines in the GPU Resource Manager(RM). It will compare the result with historical data and raise error when regression i.e. a higher sleep-wake latency is detected, using C and Python scripts for data analysis.
  • Implemented a website to visualizes the GC6 Latency Perf Test. Designed a RESTful website with NodeJS+Express as backend and ReactJS as frontend to visualize historical sleep-wake latency data.
 
 
 
 
 
Sep. 2020 - June 2021
Taipei, Taiwan
Software Engineer Intern
Microsoft, Bing Maps & Geocoding Team
  • Speed up the Bing Maps Entity Ranking / Confidence Model. Reduced the latency of the (gradient boost decision tree) model by 300+% with only 0.5% accuracy drop via feature selection and model compression techniques.
  • Deployed the BingGC Debugger, a debugger that visualize the entity ranking model. Configured it via IIS(Internet Information Service) manager.
 
 
 
 
 
July 2020 – Sep. 2020
Taipei, Taiwan
Summer Analyst
  • Designed and built a interface for backtesting system. Designed, documented, and implemented the interface with Python, MySQL and CrobJobs which let traders to launch/compare/analyze multiple tests at one click. Reduced number of manual configuration and increased the productivity of traders.
  • Improved the accuracy of backtesting. Created a linear regression model that increased the accuracy of backtesting. Discovered and fixed lots of issues/bugs in the system while studying the data.
 
 
 
 
 
May 2021 - Aug. 2021
Taipei, Taiwan
Organizer
  • Organized a 10-day intro programming summer camp. Led a team of 50+ people in organizing a fully online Intro-to-C++/Algorithm Design programming summer camp for 200+ high school/college students from scratch.
  • Raised US$8000 from multiple firms. Secured sponsorships from LINE, Shopee, Conflux.
 
 
 
 
 
July 2020 - Jan. 2021
Taipei, Taiwan
Organizer
  • Organized a 5-day competitive programming winter camp. Led a team of 40+ people, organizing a winter camp that trained 100+ high school/university students for top-tier programming contests like the International Olympiad of Informatics (IOI) and International Collegiate Programming Contest (ICPC).
  • Raised US$10000 from multiple firms. Secured sponsorships from MixerBox, Kronos Research, SYSTEX.

Projects

Hierarchical Prompting Assists Large Language Model on Web Navigation
  • Proposed a new way for large language model prompting on sequential decision task. Achieved new SOTA on a simulated web shopping environment WebShop, where the task is to find a product that matches the given natural language instruction.
  • Submitted to The 61st Annual Meeting of the Association for Computational Linguistics (ACL’23) (Under Review)
Neural Derivative Calculator with Transformer
  • Desinged a neural derivative calculator with transformer, and achieve 99% validation accuracy. It support basic arithmetic operations, parenthesis and sin, cos, exp.
  • GitHub Repo
Does ”Per Bag Trash Collection Fee” Policy Reduce The Amount of Trash?
  • Confirmed the policy effect on trash reduction with causal inference method: We analyzed the effect of Taipei government’s “Per Bag Trash Collection Fee” policy on trash amount of Taipei. We utilized Regression discontinuity design in Time (RDiT) to prove the causal relationship between the policy and the reduction of trash.
  • paper link
Covid19 footprint tracker
  • Designed a system for covid19 footprint tracking. Lead a six people team that designed and implemented a cloud-based covid19 footprint tracker. It has a simple APP/Web frontend that enable people to check-in in less then a minute. The backend is deployed on AWS EKS as a serverless service, managed with Knative. Our backend resources can scale up/down(even to 0) automatically in line with the current amount of traffic.
Document Information Extraction with BERT
  • Designed a NER model for Japanese bidding documents. Lead a three people team that designed and built a Named Entity Recognition(NER) model with BERT, which extracts a predefined set of tags from Japanese bidding documents. Applied the Question Answering framework and overcomed the scarcity of data by transfer learning and data augmentation. Further improved the model by tree-based pre-processing, prediction post-processing (sanity check), class-balanced loss and ensemble technique.
  • Placed First out of 17 teams with f1-score 97.904% (Team 18: Kaggle link).
  • Tools & technologies: Python, Pytorch, HuggingFace, BERT, NumPy, Pandas, Latex, Matplotlib
Atari Game Bot
  • Built an Atari game bot with reinforcement learning with policy gradient and deep-Q learning. The resulting agents outperforms average human performance.
  • Tools & technologies: Python, Pytorch, OpenAI Gym, NumPy
ADA Judge
  • Improved an online judge system for course Algorithm Design and Analysis(ADA). Added new feature and implemented automatic remote backup on modern cloud service e.g. Google drive.
  • Tools & technologies: Python, Javascript, HTML/CSS, Semantic UI, NodeJS, Jade(Node Template Engine), MongoDB
Cβˆ’βˆ’ Compiler
  • Designed and implemented a RISC-V compiler for Cβˆ’βˆ’, which is a simplified C that supporting basic arithmetic operations, loops, functions and arrays.
  • Tools & technologies: RISC-V, Compiler, flex(lexer), bison(parser), C

Skills

Fast learner, able to pick up new stuffs fast

  • Programming: Python (NumPy, Pandas, Matplotlib, Pytorch) β€’ C/C++ β€’ JavaScript (NodeJS) β€’ HTML/CSS β€’ SQL β€’ MongoDB
  • Technologies & Tools: Git β€’ Latex β€’ Tmux β€’ Vim β€’ Linux β€’ Github Action β€’ Github Pages
  • Languages: English (Fluent) β€’ Chinese (Native)
  • Others: Algorithm Design & Analysis β€’ Teaching β€’ typeracer.com

Awards & Certificates

Awards
  • Placed Second out of 101 teams in 2020 ICPC Asia Taipei Regional Contest (Team CRyptoGRapheR) (scoreboard)
  • Placed Fourth out of 108 teams in 2019 ICPC Asia Taipei Regional Contest (Team CRyptoGRapheR)
  • Placed Fourth out of 25 teams in 2021 AIS3 EOF CTF (Team CRyptoGRapheR) (scoreboard)
  • Placed Third out of 18 teams in 2021 TTCPC (Team CRyptoGRapheR) (scoreboard)
Certificates