Upgrading from Lucene to Elastic search indexing

Question

HJetti

Member since 2011

16 posts

IP Australia

Posted: Jun 27, 2016

Last activity: Oct 4, 2018

Posted: 27 Jun 2016 20:09 EDT
Last activity: 4 Oct 2018 13:54 EDT

Closed

Solved

Upgrading from Lucene to Elastic search indexing

Report

We are currently in Pega 7.1.7 version with 2 batch nodes and 4 user nodes. We have disabled the elastic indexing and running the legacy lucene indexing to a shared mount.

We have customised the search so that all nodes can search the indexes from the shared mount.

Now we are upgrading to Pega 7.1.9 and would like to update to Elastic search indexing.

We have few concerns that needs to be addressed:

Since we have around 2 million work objects and our system needs to run 12 hours 7 days a week with outage of only 8 hours we are looking for options on how to re-index

Option 1:- Run the re-indexing through one of the batch node. Please confirm if it is for the re-indexing to run during the business hours when users are on the system.
Option 2:- Initiate the re-indexing through batch from some local desktop connected to production database with increasing the number of threads. But not sure whether this can go during the business hours when users are on the system. Please confirm.

In option 1 we can increase maxnumworkers setting to 10 and run it as well.

Please let us know if there is a better way in doing this.

**Moderation Team has archived post**

This post has been archived for educational purposes. Contents and links will no longer be updated. If you have the same/similar question, please write a new post.

To see attachments, please log in.

Updates

Like (0)
Share this page Facebook Twitter LinkedIn Email Copying... Copied!

Accepted Solution

Posted: 7 years ago

Posted: 11 Jul 2016 2:27 EDT

nistr replied to HJetti

Report

If there are 9 nodes and 2 of them are host indexing nodes, how will the other 7 nodes communicate (using which IP? is it the pyClusterAddress or different thing)

Note that Elastic Search uses the port range 9300 ~ 9399 to communicate between the different nodes. All nodes will communicate with the index host nodes as identified on the search landing page. The actual IP address and port number used by each node for communication is listed in pyIndexerAddress column in pr_sys_statusnodes table. Note that pyClusterAddress column is used by Hazelcast (and not Elastic Search).

View reply inline

To see attachments, please log in.

Posted: 7 years ago

Posted: 28 Jun 2016 10:17 EDT

dames

Tetrasoft India Private Limited

replied to HJetti

Report

Hi Hari,

For work indexing, it would be suitable to run re-indexing during non-business hours

For better way, Anilkumar Nimmalapudi may share some further insights.

To see attachments, please log in.

Like (0)

Posted: 7 years ago

Posted: 29 Jun 2016 7:47 EDT

nistr replied to dames

Report

You can use this utility for batch re-indexing - https://docs-previous.pega.com/batch-creation-and-updating-elasticsearch-index-files-command-line

Note that having a large number of threads may not give you a lot of benefit because of reading everything from the DB. So you might get signification time reduction up to 3 / 4 threads but beyond that while there is some improvement, it flattens out.

As Swati mentioned, it is advisable to run this during off business hours. Since we read a lot of data from the database to index, there will be some impact.

To see attachments, please log in.

Likes (1)

Aruldevan Thangappan T

Posted: 7 years ago

Posted: 7 Jul 2016 1:16 EDT

NarendraC9464

Wipro Technologies Ltd

replied to HJetti

Report

Hi Rajiv,

If there are 9 nodes and 2 of them are host indexing nodes, how will the other 7 nodes communicate (using which IP? is it the pyClusterAddress or different thing)

To see attachments, please log in.

Like (0)

Posted: 7 years ago

Posted: 7 Jul 2016 1:16 EDT

NarendraC9464

Wipro Technologies Ltd

replied to HJetti

Report

Hi Rajiv,

If there are 9 nodes and 2 of them are host indexing nodes, how will the other 7 nodes communicate (using which IP? is it the pyClusterAddress or different thing)

To see attachments, please log in.

Like (0)

Accepted Solution

Posted: 7 years ago

Posted: 11 Jul 2016 2:27 EDT

nistr replied to HJetti

Report

If there are 9 nodes and 2 of them are host indexing nodes, how will the other 7 nodes communicate (using which IP? is it the pyClusterAddress or different thing)

To see attachments, please log in.

Like (0)

Posted: 6 years ago

Posted: 29 Jan 2018 2:15 EST

karthigeyanr

Common Wealth Bank of Australia

replied to nistr

Report

This comment/reply has been branched into a new post

To see attachments, please log in.

Like (0)

Posted: 7 years ago

Posted: 26 Oct 2016 14:08 EDT

ThrinathJalamadugu

Accenture

replied to HJetti

Report

Rajiv, One question. In all versions shipped after 7.1.7, does pega by default enable ELASTIC search (and disable/avoid lucene search)? Is there any setting to check if ES is enabled or not?

We are currently on 7.2.1.

To see attachments, please log in.

Like (0)

Posted: 7 years ago

Posted: 28 Oct 2016 1:50 EDT

nistr replied to HJetti

Report

Rajiv, One question. In all versions shipped after 7.1.7, does pega by default enable ELASTIC search (and disable/avoid lucene search)?

For a fresh installation, Elastic Search is enabled by default. For an upgrade, Lucene indices if present prior to upgrade will still be used, till such time the user doesn't re-index from the search landing page. Once all enabled indices from the search landing page have been re-indexed, the platform will automatically switch over the Elastic Search. Please refer to the upgrade guide on details for this.

To see attachments, please log in.

Like (0)

Get Started with Community

Question

Upgrading from Lucene to Elastic search indexing

Need help or want to help others?

Experience the benefits of Support Center when you log in.

Question

Upgrading from Lucene to Elastic search indexing

Related content:

Need help or want to help others?

Experience the benefits of Support Center when you log in.

We'd prefer it if you saw us at our best.