tensorfx icon indicating copy to clipboard operation
tensorfx copied to clipboard

Implement a schema field that can represent multi-valent value

Open nikhilk opened this issue 8 years ago • 0 comments

In tf.Example, this is intrinsically supported by a feature with N entries in a list field. Think about what is the best way to support this in csv.

Couple of options come to mind:

  1. A ':' delimited set of numbers in a column. (pros: easy to produce; cons: requires more expensive string splitting)
  2. A sequence of bytes encoded as base64 into a column. (pros: more painful to produce; still requires base64 decoding but cheaper)

Scenarios

  1. Text run through a word2vec transform during data preparation.
  2. Image run through a image2vec (aka use inception model to extract features from image)

nikhilk avatar Mar 12 '17 21:03 nikhilk