The model can only learn from the signals present in the data and labels it receives. If the labels are inconsistent, the examples are duplicated, or the sample is badly skewed, the training process is being taught the wrong lesson from the start.
This is why many real machine-learning failures are data failures before they are architecture failures. Better modeling cannot fully rescue a dataset that is misleading the learner.