SyntaxFix
Write A Post
Hire A Developer
Questions
This is how we can join two Dataframes on same column names in PySpark.
df = df1.join(df2, ['col1','col2','col3'])
If you do printSchema() after this then you can see that duplicate columns have been removed.
printSchema()