submit urlsubmit rss feedadd directory

article

Data mining (DM), also known as Knowledge-Discovery in Databases (KDD) or Knowledge-Discovery and Data Mining (KDD), is the process of automatically searching large volumes of data for patterns. Data mining is a fairly recent and contemporary topic in computer science. However, Data mining applies many older computational techniques from statistics, information retrieval, machine learning and pattern recognition.

Definition


Data mining can be defined as "the nontrivial extraction of implicit, previously unknown, and potentially useful information from data" W. Frawley and G. Piatetsky-Shapiro and C. Matheus, Knowledge Discovery in Databases: An Overview. AI Magazine, Fall 1992, pp. 213-228. and "the science of extracting useful information from large data sets or databases" D. Hand, H. Mannila, P. Smyth: Principles of Data Mining. MIT Press, Cambridge, MA, 2001. ISBN 0-262-08290-X. Although it is usually used in relation to analysis of data, data mining, like artificial intelligence, is an umbrella term and is used with varied meaning in a wide range of contexts. It is usually associated with a business or other organization's need to identify trends.

Data mining involves the process of analysing data to show patterns or relationships; sorting through large amounts of data; and picking out pieces of relative information or patterns that occur e.g., picking out statistical information from some data.

More on [ Data mining ]


directory of related categories

 

 
directory of related topics

Machine Learning :: Artificial Intelligence
Data Warehousing :: Databases
Data Integrity and Cleansing Tools :: Data Warehousing
OLAP :: Databases
Log Analysis :: Site Management
Market Analysis :: Marketing
Knowledge Discovery :: Knowledge Management
Information Visualization :: Knowledge Discovery
Text Mining :: Knowledge Discovery
Statistics :: Math

 
Data_Mining RSS feed
Business Applications - RSS Sponsorship

HP's first 13.3-inch Pavilion vs. the competition
Wed, 01 Oct 2008 12:34:18 -0700
HP is shipping its first laptop with a 13.3-inch widescreen display. The HP Pavilion dv3500t series is already available on HP's site starting at $1,000, and a slightly higher-end configuration, the dv3520nr, is available for pre-order at Best Buy for $1,200. Other sites have reported that this model will be...
A new mystery: What's Microsoft got up its Office Live sleeve?
Wed, 01 Oct 2008 12:22:06 -0700
It sounds like Microsoft has yet another Professional Developers Conference PDC announcement percolating -- beyond the Windows 7, Oslo, cloud OS (Red Dog/Zurich) and Mesh Platform stuff about which the company already has been dropping hints. by Mary Jo Foley
Google talks efficient data centers
Wed, 01 Oct 2008 12:18:32 -0700
Google is known as a search titan, but its real business is running data centers. On Wednesday, Google handed out a few key tips as it touted its data center efficiency and sustainability efforts. In a blog post, Google's senior vice president of operations Urs Hölzle outlined...
Apple strikes back against Psystar, asks for counterclaim to be dismissed
Wed, 01 Oct 2008 12:17:23 -0700
Apple has hit back against claims made by Mac-clone maker Psystar that the company is operating in violation of the Sherman Antitrust Act and the Clayton Antitrust Act and will be asking the U.S. District Court judge to dismiss to dismiss the counterclaim. by Adrian Kingsley-Hughes
Microsoft HealthVault offers the nickel tour
Wed, 01 Oct 2008 12:08:55 -0700
Nolan insists that "third party applications" will be what get HealthVault over the top, and it's in making those available that he hopes the system will prove its value. by Dana Blankenhorn
Student Technology Day: "Windows Cloud" revealed
Wed, 01 Oct 2008 11:47:18 -0700
Steve Ballmer was next door in the Queen Elizabeth Hall talking to the TechNet/TechEd people about Office Live, and mentioning that there would be a major announcement at the PDC on the 27th October. Once he finished there, he popped next door and told us a similar thing. ...

 
Subscribe to Software RSS feed

directory of related sites

Magic: Using Data Mining Successfully - by Michael Meltzer
Meta Description: [ The leading customer management community for news and features on customer insight, customer management and CRM research. With weekly emails, case studies, a download library, and a forum for expert advice, CMC provides customer management news and tools to help maximise your CRM activities. ]

About.com on Data Mining - About.com presents a collection of original feature articles, net links, forum discussions and a chat room dedicated to data mining and data warehousing topics.
Meta Description: [ The NetÂ’s best collection of data mining and data warehousing links from your About.com guide. From data mining tutorials to data warehousing techniques, youÂ’ll find it all! ]

Bank Of Montreal Mines Knowledge From Data - Jan Mrazek says privacy and performance are key issues in business intelligence and data mining for the Bank of Montreal.

Data Mining and Knowledge Discovery - A peer-reviewed journal publishing articles on all aspects of Knowledge Discovery in Databases (KDD) and data mining methods for extracting high-level representations (patterns and models) from data. Accepts submissions of original research or technical survey articles of related fields and techniques.

Data Mining on the Web - Article by Dan Greening on data mining techniques applied to analyzing and making decisions from web data.
Meta Description: [ Internet Strategies for Technology Leaders, New Architect is a monthly publication serving highly qualified technology leaders who drive the purchase and integration of Internet and emerging technology solutions into their core business processes. Written by technical insiders, New Architect prov... ]

Data Mining Resources - A collection of Data Mining links edited by the Central Connecticut State University

Digging Up $$$ with Data Mining - An Executive's Guide - Tim Graettinger. Data mining creates information assets that an organization can leverage to achieve these strategic objectives. In this article, we address some of the key questions executives have about data mining.

Distribution Analysis module for PostgreSQL - Graphical parameter distribution and function relations analysis software for PostgreSQL
Meta Description: [ distribution, analysis, functional relation, David Ciarniello, PostgreSQL ]

DSS Lab - Includes articles and guides from some of the top applied data miners including Erik Thomsen, George Spofford, and Michael Berry

Estimating Campaign Benefits and Modeling Lift (Overheads) - In assessing the potential of data mining based marketing campaigns one needs to estimate the payoff of applying modeling to the problem of predicting behavior of some target population. We present a methodology for initial cost/benefit analysis and present surprising empirical results, based on actual business data from several domains, on achievable model accuracy.

Joel Ratsaby - Describes personal, professional and research capabilities on Intelligent Systems, Data Mining, Bayesian Networks and Learning Theory. Publications available in pdf format.

Kurt Thearling: Data Mining and CRM - Information on data mining and CRM technology. Includes many articles and white papers

Market Mining Tools - A variety of modeling technologies to create response, retention, and valuation models and marketing analyses. These include Statistical Networks, Linear Regression, Logistic Regression, K-Nearest Neighbor and C4.5 Decision Trees.

500 Mi_Li_Wo Discussion Group - Emphasis on the applications of modern modeling methodology and techniques from Statistics, Data Mining, and Machine Learning.

The Data Mine - Launched in April 1994 to provide information about Data Mining (AKA Knowledge Discovery In Databases or KDD). A Twiki site full of guides, info, and links

Tyson Software - Flagship product, The Query Tool, a data mining application that performs data analysis upon any SQL database. Data can be sorted, filtered, printed and exported to a variety of formats.
Meta Description: [ The Query Tool is a powerful data mining application. It allows you to perform data analysis upon any SQL database. It has been developed predominately for the non technical user. No knowledge of SQL is required, all actions are data driven point and click. ]

Visual Basic Data Mining .NET - An online resource on data mining with applications developed using Visual Basic or the .NET framework. Features free data mining source code, applications, data mining algorithm documentation and data mining quick start guides. Includes Naive Bayes Classifiers, Decision Trees, One Rule (1Rule).
Meta Description: [ Data mining consultant offers free visual basic data mining .net source code, SQL Server support and data mining, data warehousing, relational databases, pareto analysis, software programming, business intelligence, project management ]

Web-Datamining - Web-datamining.net gather information and exchanges on Data Mining, Statistics and Knowledge Discovery, including publications, meetings and tools. In French and English
Meta Description: [ Portail français sur le Data Mining : le site est un lieu d'échange sur le Data Mining, les statistiques, le Data Warehouse grâce à ses exemples d'études, son forum et sa lettre d'information. ]

Data_Mining related videos
Google Tech Talks July 20, 2007 ABSTRACT This is the Google campus version of Stats 202 which is being taught at Stanford ...
Next Video

 

HOMEADVERTISINGABOUT US

articlesartsbusinesscomputersgameshealthhospitalshomekids & teensnewsmobilephysiciansrecreationreferenceregionalscienceshoppingsocietysportsworld


Submit a Site About Become an Editor