Unigram_Train_Data_Formatter Class Reference

#include <Unigram_Train_Data_Formatter.h>

Inheritance diagram for Unigram_Train_Data_Formatter:
Data_Formatter Unigram_Model_Streamer Unigram_Test_Data_Formatter

List of all members.

Public Member Functions

 Unigram_Train_Data_Formatter ()
virtual ~Unigram_Train_Data_Formatter ()
void format ()
 Perform the actual formatting.
WordIndexDictionaryget_dictionary ()
 Return the dictionary being used by the formatter.
int get_num_docs ()
 The number of documents formatted.
int get_total_num_words ()
 The total number of words found.

Protected Member Functions

virtual int insert_word_to_dict (std::string word)
int read_from_inp (LDA::unigram_document &wdoc, std::istream &inp)

Protected Attributes

WordIndexDictionary _dict
int _num_docs
int _num_words_in_all_docs
boost::unordered_set< string > _stopWords
std::ifstream _in
DocumentWriter_doc_writer

Constructor & Destructor Documentation

Unigram_Train_Data_Formatter::Unigram_Train_Data_Formatter (  ) 
Unigram_Train_Data_Formatter::~Unigram_Train_Data_Formatter (  )  [virtual]

Member Function Documentation

void Unigram_Train_Data_Formatter::format (  )  [virtual]

Perform the actual formatting.

Implements Data_Formatter.

WordIndexDictionary & Unigram_Train_Data_Formatter::get_dictionary (  )  [virtual]

Return the dictionary being used by the formatter.

Implements Data_Formatter.

int Unigram_Train_Data_Formatter::get_num_docs (  )  [virtual]

The number of documents formatted.

Implements Data_Formatter.

int Unigram_Train_Data_Formatter::get_total_num_words (  )  [virtual]

The total number of words found.

Implements Data_Formatter.

virtual int Unigram_Train_Data_Formatter::insert_word_to_dict ( std::string  word  )  [protected, virtual]
int Unigram_Train_Data_Formatter::read_from_inp ( LDA::unigram_document &  wdoc,
std::istream &  inp 
) [protected]

Member Data Documentation

std::ifstream Unigram_Train_Data_Formatter::_in [protected]
boost::unordered_set<string> Unigram_Train_Data_Formatter::_stopWords [protected]

The documentation for this class was generated from the following files:
Generated on Tue Jul 19 11:45:30 2011 for Y!LDA by  doxygen 1.6.3