1 year ago

#361639

test-img

Reshma Joseph

Interpreting hex value in pyspark

I was querying values from two tables, which were joined, to find out the reason for mismatch in pyspark.

Joining columns from the tables seems to have extra space in one column.

I tried trim functions(multiple combinations) to remove the white space but it was futile.

There was one suggestion to try with hex() function. Below is the code and output for the same. Can someone tell me how to interpret hex?

compare_query="""
(
select coalesce(upper(trim(b.cleaned)),upper(trim(a.tz_brandname))) as tz_brandname_mapped, 
b.source,length(b.source),hex(b.source),
a.productbrand,length(a.productbrand),hex(a.productbrand)
from ticket a
left join brand_mapping b
on trim(a.productbrand)=trim(b.source)
where ticketid='xxx'
and ticketlineid='xxx'
and storeid='xxx'
and dateclosed='xxx'
)
"""

compare_df=spark.sql(compare_query)

compare_df.show(truncate=False)

Output

FloatProgress(value=0.0, bar_style='info', description='Progress:', layout=Layout(height='25px', width='50%'),…
+-------------------+------+--------------+------------+------------+--------------------+-----------------+
|tz_brandname_mapped|source|length(source)|hex(source) |productbrand|length(productbrand)|hex(productbrand)|
+-------------------+------+--------------+------------+------------+--------------------+-----------------+
|SOURCE CANNABIS    |SOURCE|6             |534F55524345| SOURCE     |7                   |20534F55524345   |
+-------------------+------+--------------+------------+------------+--------------------+-----------------+

apache-spark

pyspark

apache-spark-sql

hex

whitespace

0 Answers

Your Answer

Accepted video resources