character sets? unicode?

Thread starter Michael
Start date Feb 3, 2005

Michael

Feb 3, 2005

I'm trying to import text from email I've received, run some regular
expressions on it, and save the text into a database. I'm trying to
figure out how to handle the issue of character sets. I've had some
problems with my regular expressions on email that has interesting
character sets. Korean text seems to be filled with a lot of '=3D=21'
type of stuff. This doesn't look like unicode (or am I wrong?) so does
anyone know how I should handle it? Do I need to do anything special
when passing text with non-ascii characters to re, MySQLdb, or any other
libraries? Is it better to save the text as-is in my db and save the
character set type too or should I try to convert all text to some
default format like UTF-8? Any advice? Thanks.

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

File names, character sets and Unicode	1	Dec 12, 2008
MySQLdb not playing nice with unicode	1	Mar 30, 2013
Outputting signal values to terminal Within Character Array	0	Dec 10, 2021
Python Unicode handling wins again -- mostly	67	Nov 30, 2013
Unicode questions	17	Oct 19, 2010
prob's w foreign char sets ...	9	May 20, 2012
Thinking Unicode	0	Aug 8, 2013
Python 3.3, gettext and Unicode problems	0	Dec 31, 2012

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

No members online now.

Total: 48 (members: 2, guests: 46)
Robots: 342

Forum statistics

Threads: 473,995

Messages: 2,570,236

Members: 46,822

Latest member: israfaceZa

Latest Threads

How to Merge to div with each other as shadow effect?
- Started by treekmostly22
- Yesterday at 7:58 AM
Syntax error
- Started by RGIANNETTI
- Thursday at 7:13 PM
SYNTAX ERROR
- Started by RGIANNETTI
- Thursday at 7:10 PM
Right or wrong
- Started by Tobi1987
- Thursday at 6:34 AM
Hello , Im Emilio
- Started by Mercury_Dev
- Thursday at 5:55 AM
Anyone want to balance my browser-based clicker game?
- Started by timo
- Wednesday at 11:16 PM
DNS Disaster: The Server Downfall
- Started by Infinityhost
- Tuesday at 5:04 AM
Web-Based RAM Management: Real-Time Server Control in Windows 10
- Started by Infinityhost
- Tuesday at 5:00 AM
Create and Preview HTML & PDF with Custom Encryption and Micro Cloud Storage
- Started by Infinityhost
- Tuesday at 4:53 AM
Demonstration of a Self-Written HTTP Server for an Online Store
- Started by Infinityhost
- Tuesday at 4:50 AM

Top