I am a data scientist with an education company in Irvine, CA; Ask Me Anything!

Kate Ta
Aug 28, 2017

This will be my second AMA about data science! Data science was my natural fit career due to my background in statistics & business. I would love to answer questions about the impact of data on business as well as how to maintain if not qualify the data using data science skills! Please ask me what data science is also if you do not understand it; it is very important to me to let people know what are the benefits & responsibilities involved with data science.

Conversation

What is it that you love about data science?

What would you class as data exactly?

Yikes, everything that is considerably relevant to my business question. I would not store the information on Ikea chairs & their form factor if I was trying to answer about a business's expense sheet. Maybe if I was trying to predict the hype cycle of Ikea's products compared to my own products. 

Thanks for your answer Kate

Aug 28, 12:14PM EDT0

Have you got a link to a FB or Twitter page I could follow?

Aug 28, 7:15AM EDT0

I have a Twitter : @ktbernoulli.

What is the difference between a data scientist & a data engineer?

A data engineer combines their knowledge of data storage with data maintenance. A warehouse of data storage is what you desire then a data engineer is who you need to buy it / maintain it. Once you're happy with the data you've been able to collect, now you want to analyze it for information? Come talk to the data scientist.

I get it thanks, so you basically analyze data in terms of specific trends, correct?

Is data science something used by supermarkets & retailers when figuring out what to stock by their app/sales data?

Before, no, sometimes grocers / big retail have sort of company tracking inventory that will automatically tell them when the next time to buy is approximately, or they have some kind of subcontract that allows them to buy only at a certain time. After, the game changes because regardless of which of the two scenarios they are in, I'm sure after data science, they will not only know how much to buy, but how much to sell, how long to sell, how often to stock on shelves, etc. etc.

Who uses your service mostly?

How can a business gather useful data about their target market?

The only problem with collecting data on your target market is there will be an instance where your target market is not the supposed target market for your product. You can definitely start with who has been historically buying your product over the time that you have been offering it as well as simultaneously seeing if the current customers are the same as the previous ones. What's really interesting is someone who was completely outside of your supposed target market buying your product. That is the person you should really analyse to see why they did not fit your predictions & how would you able to capture more customers like them.

Ah ok. Do you think everything can be analyzed and how big is the margin of error?

How long has data science been around?

I've been inaccurately told that data science is just some machine learning repackaged over the years, but it's pretty wrong in my opinion. Those who do not know the full scope of what a data scientist can do should respectfully sit down & consider when they have ever been considered for a recommendation systems project or incorporated image data in their studies. So quite frankly, five years.

Ok, so it is a fairly new job then. Are there any courses or how do you train to become a data scientist?

What should business owners be measuring exactly?

You should consider your customers as the first line of data you should be consider collecting on. It sounds like privacy issues, but as long as you are getting extremely specific like 'how many times did Bob walk into my store this past week?' kind of situation then most customers would be delighted for personalization of their experience with your product.

Ah ok. Does it not take up quite a lot of time and for a small shop is it really worth it in your opinon?

What is involved in your job?

As a consultant, either bits & pieces are given to me to finish in a project that is my specialty or the whole project end-to-end. Both reuqire the same amount of dedication because I personally have a bare minimum in quality before I even let it be considered viewable.

As an educator, I am always flexing the material to better suit the students to really help them both understand data science & realize that their industry needs it. 

I understand you gather a lot of facts which you subsequently analyze but how accurate is the analysis when you are dealing with the unpredictiveness of human behavior?

What is data science research & what is it used for?

Data science research might as well be either research using data science or research on the future techniques for data science. Either of which needs a PhD & patience, because research itself is pure hard work along with techniques for data science being discovered are very industry specific.

I understand, is marketing and psychology part of the subjects that are taught when you study to become a data scientist?

What is your favourite thing about your job?

Being able to educate those who do not have a background in data science about the uses of data science. Their eyes really open up about what they can now answer / do once they acquire the skills to do so. 

Great to hear that you have a fulfilling job that gives you job satisfaction!

What should every business be watching in regards to useful data?

Useful is a very debatable word. One moment, you do not care precisely the ratio of peas in your products' soup & the next day, there is a pea shortage announced so now you have to care. It might be as slightly meaningless as that or much more serious. So in all of my experience so far, it is better to have too much data than to have too little data.

I hear you. 

Would you say data brings clarity?

In all seriousness, if a person in a division that's not form your own were to come offer & crystal clearly understand what your data / how you are collecting & storing & labeling your data, then it is the clarity we all want to achieve. Unfortunately, there will be instances many instances of companies having little to no notes on why they label their data a certain way & expect outsiders to know exactly what they are saying.

I understand. You check all the facts available and then analyze it and find the 'mistakes' so you or the company can come to a solution on how to fix it. What is the most interesting case you have seen so far? 

What is machine learning in data science?

Machine learning is a subdivision of data science; while the true work of data science lies in the data preparation, we only prepare data for the machine learning algorithms we will use further. But what sets machine learning apart from data science is the fact that data science then goes beyond what machine learning is & answers a business purpose.

Is this predominantly used in governments and other large corporations? 

How should data be stored to be of most use?

If you are talking about database data, then databases. If you are talking about images, videos, Twitter feeds, Facebook notifications, etc. etc. then by all means necessary.

Thanks. I guess everyone has their own system on how they handle their information as well. 

Have you personally made any breakthroughs in regards to reading data?

Reading data? Hm, there's the machine learning aspect that might lead you first in your analysis about the data, but physically just yourself 'reading' the data ? None, besides pure research on the data you are dealing with.

What is the role of data science in modern society?

Hm, in my perspective, data science has the role of linking the ideals of academia with the pureness of business today. A data scientist very well knows the capabilities of their skills to the world of academia, but chooses to display it / incorporate it with their chosen industry / company's issues.

Do you believe there is such thing as over analyzing data?

What is Data Engineering?

To a data scientist, a data engineer is who we come to when we feel like the quality of the data that was presented to us is neither at a minimal quality nor can be fixed by us. Data engineers are in charge of both maintaining a company's data as well as keeping the quality in check.

Ah ok, I get it. So it only really applies to big companies correct?

How long have you worked in this field?

I have been in the data science field for about 2 years.

Do you work in a team or does that depend on the size/volume of data you need to work with?

What is data science?

Data science is a multidisciplinary field that includes programming, statistics, a sense of communication, & domain expertise. There have been some data scientist that have scrapped by without say... domain expertise because their team/industry has domain experts for them to thrive on.

For what do you need to know programming and what programming language is usually required? Also, what do you mean by domain expertise?

