this dataset gives memroy problems, we will replace it with more efficient way of storing the KG matrix