HelpForum

Forum (Archive)

This forum is kept largely for historic reasons and for our latest changes announcements. (It was focused around the older EPPI Reviewer version 4.)

There are many informative posts and answers to common questions, but you may find our videos and other resources more informative if you are an EPPI Reviewer WEB user.

Click here to search the forum. If you do have questions or require support, please email eppisupport@ucl.ac.uk.

<< Back to main Help page

HomeHomeUsing EPPI-Revi...Using EPPI-Revi...Questions about...Questions about...time out when refreshing duplicate listtime out when refreshing duplicate list
Previous
 
Next
New Post
23/03/2012 15:01
 

Dear Trude,

Thanks for getting in touch. The problem you're experiencing is surely due to the fact that your review is breaking all known size records, with a total of 1.82 million references imported. EPPI-Reviewer was designed with big numbers in mind but I must admit that we are exploring its limits in your case.

If size wasn't enough, you now have 27k duplicates groups already in the system, this means that the "get new duplicates" routine will have to compare its fresh results to existing groups and find out what groups should be updated and what new groups should be created.

Quick answer: I need to look at this at our end, I don't thik you can successfully complete the "get new duplicates" within the limits that are applied inside EPPI-Reviewer. Depending on what I'll find, I may ask you to purchase some personalised support: handling such a large number of references has to be seen as experimental work and is certainly not within the limit of the (free) support that we are happy to provide to all our users. The key here is simple: if I'll be able to help quickly, or simply by providing advice, then all is fine. If, on the other hand, we'll be required to write special solutions for your peculiar case, then this should be considered "dedicated support".

But let us not jump to conclusions. Please bear with me while I investigate some more.

Best wishes,

Sergio

 

 
New Post
23/03/2012 16:59
 

Dear Trude,

A short update: the Duplicate checking procedures are now busily evaluating your review. I doubt they will finish before the end of my working day, therefore I will have to postpone this until Tuesday. I will be away from the office most of the day on Monday. Could I ask you to avoid loading the duplicate checking window in EPPI-Reviewer until you hear back from me? It is not strictly necessary, but I would like to keep an eye on what is going on and it would help to know that I'm the only one working on it.

Many thanks for your patience,

Sergio

 

 
New Post
23/03/2012 17:37
 

Dear Sergio,

Thank you for your help, I will make sure not to use these functions until you get back to me.

 

Kind regards,

Trude

 
New Post
26/03/2012 10:14
 

Dear Trude,

the "get new duplicates" routine has now finished running. I had no time to check the results, but it is now perfectly fine for you to go and have a look. I will catch up with this tomorrow.

Best wishes,

Sergio

 

 
New Post
27/03/2012 18:39
 

Dear Trude,

I've spent some time investigating your review. I could spot several issues, some are the direct consequence of the large number of references, but some may be the cause of the difficult to handle size of your review.

Duplicate checking: the routines returned approximately 300000 duplicate groups, unfortunately, unless your PC is *very powerful*, the user interface will be unable to handle such a number, and eventually return an "Out of Memory" error. This is particularly unfortunate, because it means you will need us to write some special solution for your own case, and I am guessing you would like to avoid the extra cost that comes with this.

Despite this, I gave a quick look at how your duplicate groups look like: I think they are pretty comprehensive, and should not be missing much. Some manual adjustment may be needed, but that's something we may deal with later on.

Possible import problems: looking at groups of duplicates, I noticed that frequently most group members come from the same "source". This is unusual, because typically a single source represents a single search (or part of), so I took the initiative an gave a look at your sources as well. The first batches of imports looks perfectly all right to me, but the last ones (Feb 2012) seem to be affected by some problems:

Many of the single sources appear to contain duplicates, but the real problems appear in the "Wok" sources. These all share the same two issues:

1) they all have a large number of duplicates, across the same source and across different sources, this means that many duplicate groups will be large, containing multiple copies of what seem to be exactly the same references.

2) I did search for a while, but couldn't find any "wok" reference that has an abstract (i.e. all abstract fields are empty).

This situation makes me think that there is some kind of problem with your general search/import strategy (why do many searches include the same item multiple times?) and/or the import routines.

All in all, the anomalies explained above may mean that we don't have to deal with such great numbers, and that we should fix whatever went wrong at the importing stage instead of trying to cope with all these duplicates. This hope is reinforced by the fact that you have now 1581925 "group members", meaning that we may be dealing with something like 4-600000 genuine items, and that some 1.2 million items are actually duplicates.

What do you think? As this is a rather peculiar situation, please feel free to contact us directly through eppisupport@ioe.ac.uk: I doubt that this discussion will be particularly interesting for our typical user.

Best wishes,
Sergio

 
Previous
 
Next
HomeHomeUsing EPPI-Revi...Using EPPI-Revi...Questions about...Questions about...time out when refreshing duplicate listtime out when refreshing duplicate list


Copyright 2021 by EPPI-Centre :: Privacy Statement :: Terms Of Use :: Site Map :: Login
Home::Help::EPPI-Mapper::RIS Export::About::Account Manager