Data science

From Wikitia
Jump to navigation Jump to search

An interdisciplinary field, data science is concerned with the extraction of knowledge and insights from noisy, structured, or unstructured data and the application of this knowledge and actionable insights across a broad range of application domains. Scientific methodology, processes, algorithms, and systems are employed in data science to extract knowledge and insights from noisy, structured, or unstructured data. Data science is associated with data mining, machine learning, and large amounts of available information.

A data scientist is someone who uses data to "understand and evaluate real events." Data science is defined as a "concept that unifies statistics, data analysis, informatics, and their associated techniques." Mathematics, statistics, computer science, information science, and domain expertise are used in conjunction with methods and ideas from a variety of disciplines. In contrast to computer science and information science, data science is a specialised field. Data scientist and Turing Award winner Jim Gray envisioned data science as a "fourth paradigm" of science, joining the other three paradigms of empirical, theoretical, computational, and now data-driven science, and asserted that "everything about science is changing" as a result of the impact of information technology and the data deluge.

A data scientist is someone who develops computer code and then integrates it with statistical expertise in order to get insights from business data.