Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

a major code release of rocblas #12

Merged
merged 15 commits into from
May 13, 2016
Merged

a major code release of rocblas #12

merged 15 commits into from
May 13, 2016

Conversation

tingxingdong
Copy link
Contributor

Summary of proposed changes:

  • cleaning: remove/rename the files with clblas, ablas,
  • rocBLAS:
  • provide cublas-like / batched API (four BLAS-1, GEMV, HEMV, GEMM)
  • four BLAS-1 routines implementations: scal, swap, copy, dot
  • auxilary routines: get_(set) vector, get(set)_matrix.
  • Gtest accuracy testing:
  • BLAS-1 routine scal testing
  • BLAS-3 GEMM testing
  • Benchmarking:
    • support scal, symv, dot
    • benchmarking client: accept command line
  • Common operation
    • timing function, random generator, device query...
    • cblas compatible interface

@tingxingdong
Copy link
Contributor Author

tingxingdong commented May 12, 2016

revision based on feedbacks:
1)get_wtime --> get_time_ms;
2)gflops --> gflop_count;
3)fix the indentation
4)add comments when using feature of c++11 explicitly
5) dir tests/ --> gtest/, files now ends with "_gtest.cpp"
6) remove get(set)_vector, replace them with hipMemcy
7) pointer_type --> pointer_location
8) fix typo: auxilary --> auxiliary
9) macro name CHECK_ERROR --> CHECK_HIP_ERROR, when using HIP API (e.g hipMallco, hipMemcpy)
10) delete unused files main.cpp. delete the unused expert interface related files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants