Joshua
open source statistical hierarchical phrase-based machine translation system
 All Classes Namespaces Functions Variables Typedefs Enumerations Enumerator Friends
joshua.subsample.AlignedSubsampler Class Reference
Inheritance diagram for joshua.subsample.AlignedSubsampler:
[legend]
Collaboration diagram for joshua.subsample.AlignedSubsampler:
[legend]

List of all members.

Public Member Functions

 AlignedSubsampler (String[] testFiles, int maxN, int targetCount) throws IOException
void subsample (String filelist, float targetFtoERatio, String extf, String exte, String exta, String fpath, String epath, String apath, String output) throws IOException

Static Public Member Functions

static void main (String[] args)

Detailed Description

A subsampler which takes in word-alignments as well as the F and E files. To remove redundant code, this class uses callback techniques in order to "override" the superclass methods.

See also:
joshua.subsample.Subsampler
Author:
wren ng thornton wren@.nosp@m.user.nosp@m.s.sou.nosp@m.rcef.nosp@m.orge..nosp@m.net
Version:
$LastChangedDate$

Constructor & Destructor Documentation

joshua.subsample.AlignedSubsampler.AlignedSubsampler ( String[]  testFiles,
int  maxN,
int  targetCount 
) throws IOException

Here is the caller graph for this function:


Member Function Documentation

static void joshua.subsample.AlignedSubsampler.main ( String[]  args) [static]

Reimplemented from joshua.subsample.Subsampler.

Here is the call graph for this function:

void joshua.subsample.AlignedSubsampler.subsample ( String  filelist,
float  targetFtoERatio,
String  extf,
String  exte,
String  exta,
String  fpath,
String  epath,
String  apath,
String  output 
) throws IOException
Parameters:
filelistlist of source files to subsample from
targetFtoERatiogoal for ratio of output F length to output E length
extfextension of F files
exteextension of E files
extaextension of alignment files
fpathpath to source F files
epathpath to source E files
apathpath to source alignment files
outputbasename for output files (will append extensions)