I am reading the PDF using itextsharp dll. I had hit in to logical issue where I could not differentiate first name and last name from the string and there are no specific logic for it as i cannot split it using spaces between them.
Sample text taken from PDF after reading -
Title First Name
Pangkat Nama Depan
I can't imagine that you can differentiate first and last names without really complicated logic. Even then it might only work at only a nominal success rate. Any real attempt at parsing these will only be successful most of the time with the power of a human brain (or large amount of computing power) behind it.
Is there a way you can isolate the names positionally, potentially using some OCR product or something to identify characters at specific coordinates? I am not iTextSharp would help there, but that might be an option that I have seem used in the past.