Bekwam Blog: Java Libraries in Talend Open Studio

Thursday, January 6, 2011

Java Libraries in Talend Open Studio

Although Talend Open Studio has a rich set of StringHandling functions, I prefer those in the StringUtils class of Apache Commons Lang. One of my favorite functions is "isBlank" which checks for null, the empty String, and a String composed only of whitespace. Fortunately, Talend provides and easy way to integrate this library call.

The Java library I'll be working with is the Commons Lang library: Commons Lang. Here is the Javadoc.

An Example

This is an image of Open Studio. An input Excel file is mapped directly to an output Excel file. However, since some of the fields in the source are empty, some null/empty string checks are required to make sure that the output spreadsheet's columns are aligned.

Talend Open Studio Job with tLibraryLoad Component

Select the tLibraryLoad component. In the Component panel, the "Basic settings" tab will let you find the JAR file. Under "Advanced settings" there is a text box into which import statements can be added. Add the following import statement in this text box.

import org.apache.commons.lang.StringUtils;

Then, in the tMap component, add a Java expression that makes the StringUtils.isBlank call.

tMap with a Commons Lang StringUtils Call

Deployment Note

When you use tLoadLibrary, the JAR file is copied in the Talend internals. This makes it eligible to be exported along with a job. Don't try to adjust the IDE's or another classpath to find your JAR.

Flexibility

There are many possibilities for integration with this kind of flexibility. This example focused on some useful string handling routines from a popular Java library. But with Java, there is so much code out there that more capable libraries like Hibernate or JUnit could find their way into integration scenarios.

12 comments:

CarlFebruary 2, 2011 at 5:50 AM
Talend comes with a lot of JARs including several versions of Commons Lang. Rather than browse the file system for your downloaded JAR in the tLibraryLoad component configuration panel, try scanning the popup menu for "Commons Lang 2.5".
ReplyDelete
Replies
AnonymousFebruary 3, 2012 at 1:01 PM
Hi Cart,

I need to bring over Chinese charaters an am using tLibraryLoad to load charset.jar.

My question for you...What should be the corresponding import statement?

Thanks in Advance....
ReplyDelete
Replies
CarlFebruary 3, 2012 at 1:12 PM
Hi,

Consider upgrading to Java 6 which contains charsets.jar for Chinese encodings: Big 5, GB18030, GB2312, and GBK. You can also try placing charsets.jar in the jre/ext/lib folder in the JRE used by Talend and the target platform.
ReplyDelete
Replies
AnonymousFebruary 3, 2012 at 1:34 PM
Thanks Carl...It works for one row (If only one row is in the source)...

When there are more than one row in the source,I get ???
ReplyDelete
Replies
AnonymousFebruary 3, 2012 at 1:36 PM
When I use tLibraryLoad to load charset.jar

What should be the corresponding import statement (like import org.apache.commons.lang.StringUtils;
)?
ReplyDelete
Replies

Add comment

Bekwam Blog

Featured Post

Applying Email Validation to a JavaFX TextField Using Binding

Thursday, January 6, 2011

Java Libraries in Talend Open Studio

12 comments: