In-Memory Acceleration for General Data Parallel Applications