This project is read-only.

Splitter EP

The Splitter EP allows developers interested in program identifier splitting to
replace Sando’s default splitting algorithm. This allows them to investigate how to split more
challenging identifiers such as FILEMANAGER and the effect of improved splitting on overall
search.

Usage

  1. Implement Sando.ExtensionContracts.SplitterContracts.IWordSplitter
  2. Add wordsplitter to Sando's configuration file (link to come)
  3. Compile wordsplitter and place dll in Sando's plugins directory (link to come)
  4. Startup Visual Studio

API

Class Sando.ExtensionContracts.SplitterContracts.IWordSplitter
Method string[] ExtractWords(string text) - Given a token split that token into words.

Default implementation

  • Splits on camel case
  • Splits upon encountering a #
  • Splits on '_'
For details see: WordSplitter.cs

Example Scenario

A user could implement this extension point such that it operates as the default implementation above except when the word is in all caps. In this case the splitter uses a dictionary of common english words and splits on any found substrings in the identifier.

References

Last edited Apr 25, 2012 at 2:18 AM by davidcshepherd, version 18

Comments

No comments yet.