Login
Username:

Password:


Lost Password?

Register now!
Resources
Who's Online
3 user(s) are online (1 user(s) are browsing References Downloads)

Members: 0
Guests: 3

more...
Data
References » Data

The CELEST TechLab demos and data repository is a web-accessible repository of software demos and data related to software or articles present in the CELEST TechLab articles and software repository. Users will be able to post and download demos and data, and relevant software and literature. Click here to submit your demos and data.

Boston Remote Sensing Testbed Submitted: 05/12/2008
Description:The Boston remote sensing testbed describes a remotely sensed area (data from a Landsat 7 Thematic Mapper satellite), 360 pixels wide by 600 pixels in height, or 5.4 km x 9 km in area.
Data Details: 41 dimensions, 8 classes (Beach, Ocean, Ice, River, Road, Park, Residential, Industrial). 216,000 samples total, of which 29,003 are labeled 41 layers of data are available for each pixel (lower resolution bands were upsampled to 15m):6 Thematic Mapper (TM) bands at 30m resolution; Two thermal bands at 60m resolution; One panchromatic band with 15m resolution; 32 derived bands representing local contrast, color and texture. Of the 216,000 points in the image, 29,003 have been assigned one of the eight labels (i.e., represent the ground truth information).
more...   


Frey & Slate Letter data Submitted: 04/23/2008
Description:This data set describes statistical attributes of 20,000 digitized pictures of letters, and was used to study machine learning using Holland-style adaptive classifiers (Frey & Slate, 1991). Our copy was obtained from the UCI repository (http://www.ics.uci.edu/~mlearn/MLRepository.html).
more...   


Iris Data Set Submitted: 04/14/2008
Description:This is perhaps the best known database to be found in the recognition literature. Fisher s paper is a classic in the and is referenced frequently to this day. (See Duda & Hart, example.) The data set contains 3 classes of 50 instances where each class refers to a type of iris plant. One class linearly separable from the other 2; the latter are NOT separable from each other.
more...   


Adult Data Submitted: 04/14/2008
Description:This data was extracted from the census bureau database found at http://www.census.gov/ftp/pub/DES/www/welcome.html
more...   


Abalone Data Set Submitted: 04/14/2008
Description:Predicting the age of abalone from physical measurements. The age of abalone is determined by cutting the shell through the cone, staining and counting the number of rings through a microscope -- a boring time-consuming task. Other measurements, which are easier to obtain, are used to predict the age. Further information, such as weather and location (hence food availability) may be required to solve the problem.
From the original data examples with missing values were removed (the majority having the predicted value missing), and the ranges of the continuous values have been scaled for use with an ANN (by dividing by 200).
more...   


Wine Data Set Submitted: 04/10/2008
Description:These data are the results of a chemical analysis wines grown in the same region in Italy but derived from different cultivars. The analysis determined the quantities of 13 constituents found in each of the three types of wines.
more...   



UCI Knowledge Discovery in Databases Archive Submitted: 04/07/2008
Description:The UCI Knowledge Discovery in Databases Archive is an online repository of large data sets which encompasses a wide variety of data types, analysis tasks, and application areas. The primary role of this repository is to enable researchers in knowledge discovery and data mining to scale existing and future data analysis algorithms to very large and complex data sets. URL: http://kdd.ics.uci.edu/
more...   


Circle in Square Benchmark Submitted: 04/02/2008
Description:This is the circle in the square benchmark, consisting of three data sets: a 100-dimension text file set, a 1000-dimension text file set, and a Matlab set. This benchmark is extensively used in the publication: Carpenter, G.A., Grossberg, S., Markuzon, N., Reynolds, J.H., & Rosen, D.B. (1992) Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps. IEEE Transactions on Neural Networks, 3, 698-713.
more...   


Data Repository at the Machine Learning Research Group - University of Texas at Austin Submitted: 03/16/2008
Description:A few interesting datasets: Natural language learning data, Job postings data, Medline proteins and protein interactions data, and RIDDLE (Repository of Information on Duplicate Detection, Record Linkage, and Identity Uncertainty). URL: http://www.cs.utexas.edu/users/ml/welcome.html
more...   


Delve Datasets Submitted: 03/16/2008
Description:Datasets in this repository are categorized as primarily assessment, development or historical according to their recommended use. Within each category the euthors have distinguished datasets as regression or classification according to how their prototasks have been created. URL: http://www.cs.toronto.edu/~delve/data/datasets.html.
more...   


The UCI Knowledge Discovery in Databases Archive Submitted: 03/16/2008
Description:This is an online repository of large data sets which encompasses a wide variety of data types, analysis tasks, and application areas. The primary role of this repository is to enable researchers in knowledge discovery and data mining to scale existing and future data analysis algorithms to very large and complex data sets. URL: http://kdd.ics.uci.edu/.
more...   


Boosting.org Submitted: 03/16/2008
Description:Boosting.org provides a forum for a balanced representation of the emerging field of Boosting and ensemble learning research. URL: http://www.boosting.org/.
more...   


UC Irvine Machine Learning Repository Submitted: 03/16/2008
Description:This website maintains 162 data sets as a service to the machine learning community.
more...   


References Menu