Download List

Descripción del Proyecto

Sally is a tool for mapping a set of strings to a set of vectors. This mapping is referred to as embedding and allows techniques of machine learning and data mining to be applied for the analysis of string data. It can be used with data such as text documents, DNA sequences, or log files. The vector space model or bag-of-words model is used. Strings are characterized by a set of features, where each feature is associated with one dimension of the vector space. Occurrences of the features in each string are counted. Alternatively, binary or TF-IDF values can be computed. Vectors can be output in plain text, LibSVM, or Matlab format.

System Requirements

System requirement is not defined
Information regarding Project Releases and Project Resources. Note that the information here is a quote from Freecode.com page, and the downloads themselves may not be hosted on OSDN.

2011-08-22 20:10
0.6.3

Un error en el módulo de entrada de FASTA se ha solucionado.
A bug in the FASTA input module has been fixed.

2011-07-10 06:27
0.6.2

Apoyo a la agrupación de software CLUTO se ha añadido como un nuevo módulo de salida. La documentación se ha ampliado.
Support for the clustering software CLUTO has been added as a new output module. The documentation has been extended.

2011-04-01 18:06
0.6.1

Un archivo de configuración faltante fue introducido en el paquete.
A missing configuration file was added to the package.

2011-02-21 20:24
0.6.0

Apoyo para una configuración de todo el sistema se ha añadido. Además, todos los parámetros de configuración se puede especificar en la línea de comandos. La página del manual y la documentación se han actualizado y ampliado.
Support for a system-wide configuration has been added. Additionally, all configuration parameters can be specified on the command line. The manual page and documentation have been updated and extended.

2010-10-08 07:15
0.5.2

La exportación de MATLAB ha mejorado. Vectores de características son almacenados como una estructura matriz, junto con las etiquetas y las fuentes.
The matlab export has been improved. Feature vectors are now stored as a struct array along with labels and sources.

Project Resources