توضیحات
ABSTRACT
n this paper, we review recent progresses in the area of mining data from multiple data sources. The advancement of information communication technology has generated a large amount of data from different sources, which may be stored in different geological locations. Mining data from multiple data sources to extract useful information is considered to be a very challenging task in the field of data mining, especially in the current big data era. The methods of mining multiple data sources can be divided mainly into four groups: (i) pattern analysis, (ii) multiple data source classification, (iii) multiple data source clustering, and (iv) multiple data source fusion. The main purpose of this review is to systematically explore the ideas behind current multiple data source mining methods and to consolidate recent research results in this field.
INTRODUCTION
The advancement of information communication technology has generated a large amount of data from different sources, which may be stored in different geological locations. Each database may have its own structure to store data. Mining multiple data sources distributed at different geological locations to discover useful patterns are critical important for decision making. In particular, the Internet can be seen as a large, distributed data repository consisting of a variety of data sources and formats, which can provide abundant information and knowledge. Data from different sources may seem irrelevant to each other. Once information generated from different sources is integrated, new and useful knowledge may emerge. Here is an excellent example of how an organization to utilize mining data from different data sources to obtain profound information, which cannot obtain from an individual source. The Australian Taxation Office (ATO) mines data from different data sources such as social media posts, private school records and immigration data to detect tax cheats. Mining data from different data sources become a sophisticated tool to crackdown tax cheats .that yielded nearly $10 billion in 2016
Year: ۲۰۱۸
Publisher : ELSEVIER
By : Ruili Wang , Wanting Ji , Mingzhe Liu, Xun Wang , Jian Weng , Song Deng Suying Gao , Chang-an Yuan
File Information: English Language/ 9 Page / size: 412 KB
سال : ۱۳۹۶
ناشر : ELSEVIER
کاری از : Ruili Wang، Wanting Ji، Mingzhe لیو، Xun وانگ، جیان ونگ، آهنگ دنگ Suying گائو، چانگ یوان
اطلاعات فایل : زبان انگلیسی / 9 صفحه / حجم : KB 412
نقد و بررسیها
هنوز بررسیای ثبت نشده است.