Menu Menu Previous Interview Question Next Interview Question Search print thumbs-down thumbs-up Check Square bookmark arrow-right MockQuestions.com Logo MockQuestions MockQuestions.com Logo MockQuestions
Job interview preparation
Sign in
Upgrade
Career Interviews
Interview Topics
Company Interviews
Close
What was the largest data set that you processed? How did you process it?
Advice
You will likely have your most prominent studies listed on your resume. Focus on those studies that you are most proud of in your career. Talk to the interviewer a bit about them, what you achieved, and what you were able to learn. Briefly walk the interviewer through each study.

Practice More Interviews

Behavioral
We all have some behaviors that are typical of us and which sometimes results in ...
Common
There are some questions that employers ask at almost every interview. The questions ...
Communication
Communication skills are essential for any workplace to function efficiently. However, ...
Leadership
If you are interviewing for any type of leadership role; prepare with our leadership ...
Management
Our management questions are designed for any interview for a position that has authority ...
Teamwork
This set of questions aims to ascertain what your idea of teamwork is and whether ...

User-Submitted Answers

What was the largest data set that you processed? How did you process it?
1.
My PhD project was based on a cohort, which contained more than 6k participants.
2.
A dataset with more than 2000 patients and more than 50 variables. I formatted and labeled it with SAS.
3.
The biggest data set I processed is 6000*874, which need to be classified.
4.
The CDC dataset on the National Health and Nutrition Examination Survey of 10,000 subjects.
5.
10000000 million observations. Processed it using SAS
6.
I have processed Census data in a research project, I processed it by performing several analysis on the data. Such as, the well know mean, mode and median of different variables. I havve also performed the frequencies of diferent variables. From there, I had to perform variaty of testings to see if the data is following a normal distribution curve. If not, I had to remove all the outliers in order to make the data normal. Using quality control meansures.
7.
Repeated measures data using mixed models.
8.
Nhanes data and seer madicare data and United health data.