My GPU GeForce GTX 1050 Ti has compute capability 6.1. According to the CUDA docs it has 96 KB of shared memory per streaming multiprocessor.
How to get this limit fr