I’m copying this to solrmarc-tech in case anyone there has some ideas. Are you able to open your MARC records a tool like MarcEdit or convert them with a tool like yaz-marcdump? It might be interesting to do some cross-checking. Maybe you can fix problems with a different tool prior to importing… or perhaps another tool will give you a better idea of exactly what is wrong.
- Demian
From: kevin smith [mailto:ash...@gmail.com]
Sent: Tuesday, June 26, 2012 3:01 PM
To: hori...@mailman.xmission.com; vufind-...@lists.sourceforge.net
Subject: [VuFind-General] Structural marc problems
Hello,
I have run into some issues importing records into VuFind from Horizon. It had been going along fine for weeks, but now I am getting all sorts of errors. Something like this:
ERROR [main] (MarcImporter.java:257) - Error reading record: For input string: "ocm6"
ERROR [main] (MarcImporter.java:257) - Error reading record: unable to parse record length
ERROR [main] (MarcImporter.java:257) - Error reading record: null
ERROR [main] (MarcImporter.java:257) - Error reading record: Directory length is not a multiple of 12 bytes long. Unable to continue.
ERROR [main] (MarcImporter.java:257) - Error reading record: unable to parse record length
ERROR [main] (MarcImporter.java:257) - Error reading record: null
ERROR [main] (MarcImporter.java:257) - Error reading record: Directory length is not a multiple of 12 bytes long. Unable to continue.
ERROR [main] (MarcImporter.java:257) - Error reading record: unable to parse record length
ERROR [main] (MarcImporter.java:257) - Error reading record: null
ERROR [main] (MarcImporter.java:257) - Error reading record: Directory length is not a multiple of 12 bytes long. Unable to continue.
ERROR [main] (MarcImporter.java:257) - Error reading record: unable to parse record length
ERROR [main] (MarcImporter.java:257) - Error reading record: null
I am not sure if this has to do with a record that is too long for marc, or some other leader problems. I have been looking at this blog post about some Horizon specific issues: http://bibwild.wordpress.com/2010/02/02/structural-marc-problems-you-may-encounter/
I am just not sure where to go from here. Are there queries I can run against the Horizon database to check for, and fix the integrity of the marc records?
Thanks,
--
Kevin Smith
Digital Library Manager
Wake County Public Libraries
Are you able to export in MARC-XML? The XML format doesn’t have the same size restrictions as binary MARC, so that might help you extract workable records.
As for pinpointing the problem record, can you export specific ranges of record numbers? If nothing else, you can try extracting different chunks until you hit the problem. If the issue cropped up in the past couple of weeks, chances are the bad record was modified or added recently, which might help you track it down.
- Demian
From: kevin smith [mailto:ash...@gmail.com]
Sent: Tuesday, June 26, 2012 3:47 PM
To: Demian Katz
Cc: hori...@mailman.xmission.com; vufind-...@lists.sourceforge.net; solrma...@googlegroups.com
Subject: Re: [VuFind-General] Structural marc problems
When I try to open the file with MarcEdit, I get the error Record too large error (larger than 99,999 bytes). Error Number: -7
So, it looks like I have a record that is too large. Anyone know how I can identify which record, or records are causing the problem?
--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To post to this group, send email to solrma...@googlegroups.com.
To unsubscribe from this group, send email to solrmarc-tec...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.
When I try to open the file with MarcEdit, I get the error Record too large error (larger than 99,999 bytes). Error Number: -7
So, it looks like I have a record that is too large. Anyone know how I can identify which record, or records are causing the problem?
On Tue, Jun 26, 2012 at 3:29 PM, Demian Katz <demia...@villanova.edu> wrote:
------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________ VuFind-General mailing list VuFind-...@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/vufind-general
On Jun 27, 2012 2:50 PM, "Robert Haschart" <rh...@virginia.edu> wrote:
> So the record ought to be able to be handled, (unless for instance the record creation software does something like writing out the record length using however many bytes are needed to represent it, which I think is not one of the error modes that the MarcPermissiveStreamReader is designed to handle.
I think this is what happens; I'm not 100% sure about the directory offset, but I think this will be the case.
The record can be recovered iff no data field has a length that overflows the directory slot, as a running count can be made on the assumption that there is no unallocated space. All that is needed from the directory is the tag; the rest can be inferred from the presence or absence of field terminators, (combined with an assumption that offset is monotonic increasing, since there might be field terminators in the middle of a field. cut and paste)
To unsubscribe from this group, send email to solrmarc-tec...@googlegroups.com.