ecosystem icon indicating copy to clipboard operation
ecosystem copied to clipboard

Build new Spark-Tensorflow-Connector for scala 2.12

Open wuxianxingkong opened this issue 4 years ago • 6 comments

Spark has released 3.x which built with scala 2.12, but th newest version of Spark-Tensorflow-Connector is built with scala 2.11 which means not compatiable with spark 3.x. Maven repository is still 2.11.

wuxianxingkong avatar Jan 07 '22 07:01 wuxianxingkong

Spark has released 3.x which built with scala 2.12, but th newest version of Spark-Tensorflow-Connector is built with scala 2.11 which means not compatiable with spark 3.x. Maven repository is still 2.11.

You can build 2.12 by yourself!!!

yiwei00000 avatar Feb 11 '22 06:02 yiwei00000

Spark has released 3.x which built with scala 2.12, but th newest version of Spark-Tensorflow-Connector is built with scala 2.11 which means not compatiable with spark 3.x. Maven repository is still 2.11.

Also, you use spark-tfrecord https://github.com/linkedin/spark-tfrecord, that support scala 2.12.

yiwei00000 avatar Feb 11 '22 11:02 yiwei00000

It would be great to have an example of how to build this for Spark 3.x and Tensorflow 2.x

I tried a number of combinations with no success. I can build the jar, but it writes an empty TFRecord file.

If others have successfully built the jar for Spark 3.x and Tensorflow 2.x I'd love to hear the details.

dennisobrien avatar Feb 20 '22 00:02 dennisobrien

@dennisobrien Hi, try this: https://github.com/wuxianxingkong/spark-tfrecord (Built with spark 3.1.1 and scala 2.12)

wuxianxingkong avatar Mar 03 '22 10:03 wuxianxingkong

It would be super helpful if TF team could kindly help releasing spark-tensorflow-connector built with scala 2.12 to maven artifact 🙌 Right now the best workaround seems to be spark-tfrecord but there'll be migration effort across products using it ..

yundai424 avatar Jul 27 '22 00:07 yundai424