Skip to content

checklist for 2.10.x (+ request for comment, feature requests, etc.) #263

@ngam

Description

@ngam

Comment:

I am interested in the following (minor) changes to the build. None of these is important and if we don't manage to figure them out, they can wait.

  • ensuring gcp support (likely enabled) (update: I believe this is working as of 2.9.1)
  • ensuring hdfs support (likely disabled)
  • ensuring aws support (likely disabled)

An example of our current configs:

Configuration finished
+ bazel build --crosstool_top=//custom_toolchain:toolchain --logging=6 --verbose_failures --config=opt --define=PREFIX=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac --define=PROTOBUF_INCLUDE_PATH=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac/include --config=noaws --cpu=k8 --local_cpu_resources=2 //tensorflow/tools/pip_package:build_pip_package //tensorflow/tools/lib_package:libtensorflow //tensorflow:libtensorflow_cc.so
Starting local Bazel server and connecting to it...
INFO: Options provided by the client:
  Inherited 'common' options: --isatty=0 --terminal_columns=80
INFO: Reading rc options for 'build' from /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc:
  Inherited 'common' options: --experimental_repo_remote_exec
INFO: Reading rc options for 'build' from /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc:
  'build' options: --define framework_shared_object=true --define=use_fast_cpp_protos=true --define=allow_oversize_protos=true --spawn_strategy=standalone -c opt --announce_rc --define=grpc_no_ares=true --noincompatible_remove_legacy_whole_archive --enable_platform_specific_config --define=with_xla_support=true --config=short_logs --config=v2 --define=no_aws_support=true --define=no_hdfs_support=true --experimental_cc_shared_library --experimental_link_static_libraries_once=false
INFO: Reading rc options for 'build' from /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.tf_configure.bazelrc:
  'build' options: --action_env PYTHON_BIN_PATH=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac/bin/python --action_env PYTHON_LIB_PATH=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac/lib/python3.9/site-packages --python_path=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac/bin/python --define=with_xla_support=true --action_env TF_SYSTEM_LIBS=absl_py,astor_archive,astunparse_archive,boringssl,com_github_googlecloudplatform_google_cloud_cpp,com_github_grpc_grpc,com_google_protobuf,curl,cython,dill_archive,flatbuffers,gast_archive,gif,icu,libjpeg_turbo,org_sqlite,png,pybind11,snappy,zlib --define=PREFIX=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac --define=LIBDIR=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac/lib --define=INCLUDEDIR=/home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/_h_env_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_placehold_plac/include
INFO: Reading rc options for 'build' from /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc:
  'build' options: --deleted_packages=tensorflow/compiler/mlir/tfrt,tensorflow/compiler/mlir/tfrt/benchmarks,tensorflow/compiler/mlir/tfrt/jit/python_binding,tensorflow/compiler/mlir/tfrt/jit/transforms,tensorflow/compiler/mlir/tfrt/python_tests,tensorflow/compiler/mlir/tfrt/tests,tensorflow/compiler/mlir/tfrt/tests/ir,tensorflow/compiler/mlir/tfrt/tests/analysis,tensorflow/compiler/mlir/tfrt/tests/jit,tensorflow/compiler/mlir/tfrt/tests/lhlo_to_tfrt,tensorflow/compiler/mlir/tfrt/tests/lhlo_to_jitrt,tensorflow/compiler/mlir/tfrt/tests/tf_to_corert,tensorflow/compiler/mlir/tfrt/tests/tf_to_tfrt_data,tensorflow/compiler/mlir/tfrt/tests/saved_model,tensorflow/compiler/mlir/tfrt/transforms/lhlo_gpu_to_tfrt_gpu,tensorflow/core/runtime_fallback,tensorflow/core/runtime_fallback/conversion,tensorflow/core/runtime_fallback/kernel,tensorflow/core/runtime_fallback/opdefs,tensorflow/core/runtime_fallback/runtime,tensorflow/core/runtime_fallback/util,tensorflow/core/tfrt/common,tensorflow/core/tfrt/eager,tensorflow/core/tfrt/eager/backends/cpu,tensorflow/core/tfrt/eager/backends/gpu,tensorflow/core/tfrt/eager/core_runtime,tensorflow/core/tfrt/eager/cpp_tests/core_runtime,tensorflow/core/tfrt/gpu,tensorflow/core/tfrt/run_handler_thread_pool,tensorflow/core/tfrt/runtime,tensorflow/core/tfrt/saved_model,tensorflow/core/tfrt/graph_executor,tensorflow/core/tfrt/saved_model/tests,tensorflow/core/tfrt/tpu,tensorflow/core/tfrt/utils
INFO: Found applicable config definition build:short_logs in file /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc: --output_filter=DONT_MATCH_ANYTHING
INFO: Found applicable config definition build:v2 in file /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc: --define=tf_api_version=2 --action_env=TF2_BEHAVIOR=1
INFO: Found applicable config definition build:opt in file /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.tf_configure.bazelrc: --copt=-O2 --host_copt=-O2
INFO: Found applicable config definition build:noaws in file /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc: --define=no_aws_support=true
INFO: Found applicable config definition build:linux in file /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc: --copt=-w --host_copt=-w --cxxopt=-std=c++17 --host_cxxopt=-std=c++17 --config=dynamic_kernels --distinct_host_configuration=false --experimental_guard_against_concurrent_changes
INFO: Found applicable config definition build:dynamic_kernels in file /home/conda/feedstock_root/build_artifacts/tensorflow-split_1659720782575/work/.bazelrc: --define=dynamic_loaded_kernels=true --copt=-DAUTOLOAD_DYNAMIC_KERNELS
Loading: 

See this on filesystems: tensorflow/tensorflow@3437098

CUDA specific:

  • drop all but 11.2 (or at least, I won't personally bother with debugging anything other than 11.2 going forward). The rationale for this is twofold: burden on maintainers and likely lack of support (e.g. JAX no longer supports anything but 11.2+)
  • investigate new problems with jax's cuda build, likely related to protobuf, abseil, and grpc and the interaction thereof. It is important to investigate these issues in tandem (i.e. between tensorflow and jax) because we can benefit from jax's completion in time, but also make decisions for the interest of the two packages, rather than one. Current jaxlib release (0.3.15) is blocked by elusive cuda issues; the error is clearly a protobuf one, but I suspect it is related to abseil and grpc underneath.

Copying @h-vetinari and @hmaarrfk, please feel free to add to (or take away from) this. None of these is important imminently. My goal is to get this to be as close to "default" (e.g. tensorflow's images) and "optimized" (e.g. ngc's images) as possible. Continuing from the XLA additions.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions