A easy-to-use toolkit designed from the get-go to provide users with the means necessary to map a set of strings to a set of vectors swimmingly. #String mapper #String to vector #Map string #Mapper #String #Vector
Sally is a simple, easy to use, small and open source tool for mapping a set of strings to a set of vectors.
This mapping is referred to as embedding and allows for applying techniques of machine learning and data mining for analysis of string data.
Also, Sally can applied to several types of string data, such as text documents, DNA sequences or log files, where it can handle common formats such as directories, archives and text files of string data.
Sally implements a standard technique for mapping strings to a vector space that is often referred to as vector space model or bag-of-words model.
The strings are characterized by a set of features, where each feature is associated with one dimension of the vector space.
The following types of features are supported by Sally: bytes, words, n-grams of bytes and n-grams of words.
NOTE: Detailed installation instructions can be accessed HERE.
System requirements
- libconfig 1.4 or later
- libarchive 2.7 or later
What's new in Sally 1.0.0:
- checks for new parameters
- changed naming of ngram_delim to token_delim. see harry.
- adapted naming to harry configuration
- adapted test cases to new default setup
Sally 1.0.0
add to watchlist add to download basket send us an update REPORT- runs on:
- Mac OS X (-)
- file size:
- 620 KB
- filename:
- sally-1.0.0.tar.gz
- main category:
- Developer Tools
- developer:
- visit homepage
4k Video Downloader
IrfanView
Microsoft Teams
ShareX
Bitdefender Antivirus Free
Zoom Client
calibre
7-Zip
paint.net
Windows Sandbox Launcher
- 7-Zip
- paint.net
- Windows Sandbox Launcher
- 4k Video Downloader
- IrfanView
- Microsoft Teams
- ShareX
- Bitdefender Antivirus Free
- Zoom Client
- calibre