Skip to content

Commit

Permalink
Bug Fix - Bug fix for cuda 12.2 dockerfile LD_LIBRARY_PATH issue (#614)
Browse files Browse the repository at this point in the history
**Description**
Cuda 12.2 image will report undfined symbol error due to incomplete
LD_LIBRARY_PATH:


![image](https://github.com/microsoft/superbenchmark/assets/25875482/1a7c48c7-cb6b-4e3a-abbe-dde23007a96b)

### How to reproduce:
1. Deploy sb with cuda12.2 image
```
sb deploy -f local.ini -i superbench/superbench:v0.10.0-cuda12.2
```
2. Enter to the container
```
sudo docker exec -it sb-workspace bash
```
3. Execute `mpirun`:
```
root@sb-container:~# mpirun
mpirun: symbol lookup error: mpirun: undefined symbol: opal_libevent2022_event_base_loop
```
### Fix to fix
* Append hpcx_load into /etc/bash.bashrc for updaing env LD_LIBRARY_PATH in each time

---------
  • Loading branch information
RyoYang authored Mar 21, 2024
1 parent 2c88db9 commit eeaa9b1
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion dockerfile/cuda12.2.dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -130,7 +130,8 @@ ENV PATH="${PATH}" \

RUN echo PATH="$PATH" > /etc/environment && \
echo LD_LIBRARY_PATH="$LD_LIBRARY_PATH" >> /etc/environment && \
echo SB_MICRO_PATH="$SB_MICRO_PATH" >> /etc/environment
echo SB_MICRO_PATH="$SB_MICRO_PATH" >> /etc/environment && \
echo "source /opt/hpcx/hpcx-init.sh && hpcx_load" >> /etc/bash.bashrc

# Add config files
ADD dockerfile/etc /opt/microsoft/
Expand Down

0 comments on commit eeaa9b1

Please sign in to comment.