REINVENT4
Prefer a local REINVENT4 checkout over web summaries. Treat a directory as the
repo root when it contains install.py, pyproject.toml, reinvent/, and
configs/.
Workflow
- 1. Classify the request: install, run configuration, data preprocessing,
notebook conversion, scoring plugin, or test/troubleshooting.
- 2. Read only the relevant reference file:
- installation or CLI usage:
references/install-and-run.md
- TOML mode selection or parameter mapping:
references/config-modes.md
- plugins, notebooks, or tests:
references/plugins-and-tests.md
- 3. Verify commands against the local checkout before proposing them. Prefer
python install.py --help,
python install.py <backend> --dry-run,
reinvent
--help, direct file inspection, and existing files under
configs/.
- 4. Reuse the example configs in
configs/ instead of inventing schemas from
scratch.
- 5. Keep file paths explicit. Upstream example configs are templates and must be
adjusted to local model, SMILES, output, and log paths before execution.
Installation Rules
- - Use an isolated Python environment with Python 3.10 or newer.
- Map the processor/backend carefully:
- macOS CPU:
mac
- Linux CPU:
cpu
- Linux NVIDIA CUDA: upstream examples use values like
cu126
- Linux AMD ROCm: upstream examples use values like
rocm6.4
- Intel XPU:
xpu
- Windows: CPU, CUDA, and XPU are supported, but upstream says Windows is
only partially tested
- - Remember that
install.py defaults to optional dependency set all, which
includes extra packages such as
openeye and
isim.
- - Prefer
-d none for minimal or smoke-test installs unless the user
explicitly needs OpenEye ROCS or iSIM-related functionality.
- - Use
--dry-run before a real install whenever backend choice or dependency
resolution is uncertain.
- - Verify a finished install with
reinvent --help.
Running REINVENT
- - Main CLI entry point:
reinvent [-l logfile] <config.toml>. - Data pipeline entry point from
pyproject.toml: reinvent_datapre. - Prefer TOML because upstream ships maintained examples in
configs/. - When editing configs, update at least device selection, model/prior paths,
SMILES inputs, output files, and TensorBoard/log paths.
Troubleshooting Rules
- - On macOS, remind the user that upstream documents CPU-only support and says
macOS is only partially tested.
- - If a macOS clone reports path collisions under
contrib/tutorials/maize/adgpu_prepare, avoid relying on those collided files
unless the user specifically needs that tutorial.
- - For tests, warn that they require a JSON config with a non-existent
MAIN_TEST_PATH; some tests also require
OE_LICENSE.
- - Do not promise a full RL/TL run unless models, datasets, and optional
licensed tools are present locally.
Source Files
Use these upstream files as the primary source of truth when they are present
in the local checkout:
- - INLINECODE31
- INLINECODE32
- INLINECODE33
- INLINECODE34
- INLINECODE35
- INLINECODE36
- INLINECODE37
- INLINECODE38
REINVENT4
优先使用本地 REINVENT4 代码库而非网络摘要。当目录包含 install.py、pyproject.toml、reinvent/ 和 configs/ 时,将其视为仓库根目录。
工作流程
- 1. 对请求进行分类:安装、运行配置、数据预处理、笔记本转换、评分插件或测试/故障排除。
- 仅读取相关参考文件:
- 安装或 CLI 使用:references/install-and-run.md
- TOML 模式选择或参数映射:references/config-modes.md
- 插件、笔记本或测试:references/plugins-and-tests.md
- 3. 在提出命令前,先对照本地代码库进行验证。优先使用 python install.py --help、python install.py --dry-run、reinvent --help、直接文件检查以及 configs/ 下的现有文件。
- 复用 configs/ 中的示例配置,而非从头创建架构。
- 保持文件路径明确。上游示例配置为模板,执行前必须根据本地模型、SMILES、输出和日志路径进行调整。
安装规则
- - 使用 Python 3.10 或更高版本的隔离 Python 环境。
- 仔细映射处理器/后端:
- macOS CPU:mac
- Linux CPU:cpu
- Linux NVIDIA CUDA:上游示例使用 cu126 等值
- Linux AMD ROCm:上游示例使用 rocm6.4 等值
- Intel XPU:xpu
- Windows:支持 CPU、CUDA 和 XPU,但上游表示 Windows 仅经过部分测试
- - 请记住 install.py 默认使用可选依赖集 all,其中包括 openeye 和 isim 等额外包。
- 对于最小安装或冒烟测试安装,优先使用 -d none,除非用户明确需要 OpenEye ROCS 或 iSIM 相关功能。
- 当后端选择或依赖解析不确定时,在实际安装前使用 --dry-run。
- 使用 reinvent --help 验证安装是否完成。
运行 REINVENT
- - 主要 CLI 入口点:reinvent [-l logfile]
- pyproject.toml 中的数据管道入口点:reinvent_datapre
- 优先使用 TOML,因为上游在 configs/ 中提供了维护良好的示例。
- 编辑配置时,至少更新设备选择、模型/先验路径、SMILES 输入、输出文件以及 TensorBoard/日志路径。
故障排除规则
- - 在 macOS 上,提醒用户上游文档仅支持 CPU,且 macOS 仅经过部分测试。
- 如果 macOS 克隆在 contrib/tutorials/maize/adgpuprepare 下报告路径冲突,除非用户特别需要该教程,否则避免依赖这些冲突文件。
- 对于测试,警告它们需要包含不存在的 MAINTESTPATH 的 JSON 配置;某些测试还需要 OELICENSE。
- 除非本地存在模型、数据集和可选的许可工具,否则不要承诺完整的 RL/TL 运行。
源文件
当以下文件存在于本地代码库中时,将其作为主要信息来源:
- - README.md
- install.py
- pyproject.toml
- configs/README.md
- configs/PARAMS.md
- configs/SCORING.md
- notebooks/README.md
- tests/example_config.json