|
Joshua
open source statistical hierarchical phrase-based machine translation system
|
Public Member Functions | |
| Sentence (String sentence, int id) | |
| boolean | isEmpty () |
| int | id () |
| String | sentence () |
| int[] | intSentence () |
| List< ConstraintSpan > | constraints () |
| Lattice< Integer > | intLattice () |
Static Protected Attributes | |
| static final Pattern | SEG_START |
| static final Pattern | SEG_END = Pattern.compile("\\s*</seg\\s*>\\s*$") |
Private Attributes | |
| int | sequenceId = -1 |
| int | id = -1 |
| String | sentence |
| List< ConstraintSpan > | constraints |
Static Private Attributes | |
| static final Logger | logger = Logger.getLogger(Sentence.class.getName()) |
This class represents a basic input sentence. A sentence is a sequence of UTF-8 characters denoting a string of source language words. The sequence can optionally be wrapped in <seg id="N">...</seg> tags, which are then used to set the sentence number (a 0-indexed ID).
| joshua.decoder.segment_file.Sentence.Sentence | ( | String | sentence, |
| int | id | ||
| ) |
| Lattice<Integer> joshua.decoder.segment_file.Sentence.intLattice | ( | ) |
Reimplemented in joshua.decoder.segment_file.LatticeInput.
Reimplemented in joshua.decoder.segment_file.ParsedSentence.
| boolean joshua.decoder.segment_file.Sentence.isEmpty | ( | ) |
List<ConstraintSpan> joshua.decoder.segment_file.Sentence.constraints [private] |
int joshua.decoder.segment_file.Sentence.id = -1 [private] |
final Logger joshua.decoder.segment_file.Sentence.logger = Logger.getLogger(Sentence.class.getName()) [static, private] |
final Pattern joshua.decoder.segment_file.Sentence.SEG_END = Pattern.compile("\\s*</seg\\s*>\\s*$") [static, protected] |
final Pattern joshua.decoder.segment_file.Sentence.SEG_START [static, protected] |
Pattern
.compile("^\\s*<seg\\s+id=\"?(\\d+)\"?[^>]*>\\s*")
String joshua.decoder.segment_file.Sentence.sentence [private] |
int joshua.decoder.segment_file.Sentence.sequenceId = -1 [private] |