Concurrent algorithms for integrating three-dimensional B-spline

functions into machines with shared memory such as GPU