Skip to content
/ LSH-IS Public

Instance selection of linear complexity for big data

License

Notifications You must be signed in to change notification settings

alvarag/LSH-IS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LSH-IS: Instance selection of linear complexity for big data

This is an open-source filter for Weka based on locality sensitive hashing. Two algorithms are available and both have linear complexity.

Cite this software as:

Á. Arnaiz-González, J-F. Díez Pastor, Juan J. Rodríguez, C. García Osorio. Instance selection of linear complexity for big data. Knowledge-Based Systems, 107, 83-95. doi: 10.1016/j.knosys.2016.05.056

@article{ArnaizGonzalez2016,   
  title = "Instance selection of linear complexity for big data",   
  journal = "Knowledge-Based Systems ",   
  volume = "107",   
  pages = "83 - 95",   
  year = "2016",   
  issn = "0950-7051",   
  doi = "10.1016/j.knosys.2016.05.056",   
  author = "\'{A}lvar Arnaiz-Gonz\'{a}lez and Jos\'{e} F. D\'{i}ez-Pastor and Juan J. Rodr\'{i}guez and C\'{e}sar Garc\'{i}a-Osorio"   
}

How to use

Download and build with ant

  • Download source code: It is host on GitHub. To get the sources and compile them we will need git instructions. The specifically command is: git clone https://github.com/alvarag/LSH-IS.git
  • Build jar file: ant dist_weka It generates the jar file under /dist/weka

How to run

Include the file instanceselection.jar into the path. Example:

java -cp instanceselection.jar:weka.jar weka.gui.GUIChooser

The new filter can be found in: weka/filters/supervised/instance.

About

Instance selection of linear complexity for big data

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages