Skip to main content

An Overview on Data Science

So before we get into what is data science let us first understand what is data actually, and how it is important for business, e-Commerce, for security, for identity of someone, even for scientific purpose or research and for even much more.

So data is nothing but a piece of information , the information that we are collecting could be anything it can be your date of birth, your body weight, your eyes or hair colour, your meal list, what you are searching in your mobile or computer, the places you visit, so we can say anything around you either connected to you or around you can be data. 

But if someone is novice he will ask, how all these things can be data ? Answer is Data is everywhere but what type of data is our need and which type of data is not our need makes the all difference.

Lets understand this clearly through an example- Suppose you want to do some shopping on Amazon, and you decided to buy a new mobile phone, you fixed the budget, then features that you want in the that you are looking for, also battery performance, camera quality, style but unfortunately you did not find a phone matching to your view point, so you decided to search another day, when you searched another day you find that Amazon recommending some phones based on your past search activities, and hopefully you find the type of phone that you are looking for. So how did Amazon find that the type of phone you are looking for ? Answer is when you searched for the phone and its specifications the amazon stored your search data and then it processed that data, and according to that it recommended you lots of phones of same specifications, so here data is the type of text that you have typed and what you are looking for.



What is Data Science ?

So basically data science is the science of data, in more details data science is a multidisciplinary subject which includes statistics, computer science, ML (machine Learning) and domain expertise to get knowledge  or insights of data. Though it is a multidisciplinary subject the end product of data science is to develop a data product.

Now if you are thinking what data product is, let me tell you by taking the same above example, so after when you tried to search for a phone of same specifications, according to your previous data amazon recommended you phones type you are looking for, so you can say that by taking your previous search data the e-commerce company changed that data into a data product which helped the user to get his product and by it company sold that product, so here we can that data product is nothing but a program used to solve problem.

If you are looking for definition then here it is :"Changing the data of a company into a product to solve a problem is called a data product"

since we have seen that data science is a combination of subjects, now lets take a close why these subjects are very important in the field of dat science and also what role they play,

Statistics

In data science statistics is very important because, you are dealing with a large size of data or Big Data, statistics helps to find mean, median, mode from the data and with these it helps the  data scientists to analyze and understand the data. Now there are three type statistics used in data science,

  • Descriptive Statistics : This type of statistics helps the data scientists to consolidate or summarize the data for further analysis.
  • Inferential Statistics : It helps to find the relationship between the samples of data collected.
  • Regression Analysis : This helps to find the relationship between multiple variables. 


Machine Learning
ML helps the data scientists to solve the complex calculations, an to develop algorithms and predict a variable.

Domain Expertise
Domain expertise is nothing but the knowledge of data set. Suppose if the data set is related to healthcare the healthcare is the domain expertise, if the data set is related to science the domain expertise is science, if the data set is related to business then domain expertise is business etc.

Data Visualization is also a part of data science, it has its own field called data analytics, in which you have to represent the data in graphical forms to get insights from the data.



Comments

Popular posts from this blog

How to Remove Dandruff: A Complete Guide

Dandruff can be an embarrassing and frustrating condition, but the good news is that it’s manageable. In this guide, we’ll explore what dandruff is, its causes, and the most effective ways to eliminate it. Whether you prefer home remedies or over-the-counter solutions, there’s something here for everyone. What is Dandruff? Dandruff is a common scalp condition characterized by flaking and itching. It occurs when the scalp sheds dead skin cells excessively, often due to dryness, sensitivity, or fungal infections. While it’s not harmful, it can be a nuisance and impact self-confidence. Causes of Dandruff Understanding the root causes of dandruff can help you choose the right treatment. Here are some common reasons: - Dry Skin: A dry scalp often leads to flaking, especially during winter months. - Sensitivity to Hair Products: Certain shampoos, conditioners, or styling products can irritate the scalp. - Fungal Infections: Malassezia, a type of yeast, thrives on oily scalps and can trigger ...

The Environmental Toll of Data Centers: Energy Consumption, Water Usage, and Carbon Emissions

Why Data Centers Are Danger To Environment ?     Data centers are critical for modern society because they serve as the backbone for modern infrastructure, to power modern business and technologies. They play crucial role to power modern internet, to host websites, applications and process customer data, storing huge volumes of data and powering e-commerce platforms. But with these great things there are some disadvantages are also related to data centers which makes them a threat to environment. Data centers helps in support cloud services, analytics, Storage, cloud computing, empowering streaming services like Amazon, Netflix, Facebook, You Tube, also AI and Machine learning rely on these data centers to process huge data to process business logics etc. But in order to do all these great tasks they need tremendous amount of energy and electricity to power networking, servers, storage equipment, cloud services and the infrastructure supporting these services. Data centers ae...

Revolutionizing Data Centers: Cutting-Edge Construction Techniques Reshaping the Digital Landscape

  In a world where a single data center can consume as much water in a day as a small city does in a year, the race to build more efficient digital fortresses is on. The humble data center, once a nondescript building humming with servers, has become a hotbed of architectural and engineering innovation. As these facilities evolve to meet the insatiable appetite for data processing and storage, they're reshaping the very foundations of construction technology. Modular Design: The Future of Data Center Architecture Gone are the days of painstakingly slow, brick-by-mortar builds. Today's data centers are rising from the ground at breakneck speeds, thanks to modular design. This isn't just a trend; it's a revolution, with up to 70% of facilities now being pieced together like high-tech Lego sets in factories before ever touching their final destination. The benefits are as stackable as the modules themselves: Speed demons: These prefab marvels sprint to completion 60% faste...