mirrors/zotero - Ayakael: My personal forge

mirrors/zotero

Author	SHA1	Message	Date
Simon Kornblith	d5bc6cbe4b	- fixes a bug in capitalizeTitle - better feedback for search translator errors	2006-09-09 22:45:03 +00:00
Simon Kornblith	14c5c40a50	- closes #279 , Refer/EndNote translator - fixes a bug in text handling that was previously masked by another	2006-09-09 22:00:04 +00:00
Simon Kornblith	67f6ae3ed2	- closes #69 , notification system for broken scrapers - don't put "Page" before page in WaPo scraper	2006-09-09 19:47:47 +00:00
Simon Kornblith	d4576d3d55	addresses #69 , notification system for broken scrapers thanks to Dan for his help on the repository side of things	2006-09-09 00:12:09 +00:00
Simon Kornblith	539957a93b	- closes #281 , look for BOM when importing to override charset. the BOM is a nice way to detect UTF encodings, although it won't help distinguish, e.g., ISO 8859-1 from MacRoman. since EndNote adds a BOM to all of its export files, this means non-ASCII charaacters should now be preserved when exported from EndNote. - better error handling for translators ("Could Not Add Item" should now pop up in all circumstances)	2006-09-08 20:44:05 +00:00
Simon Kornblith	7b7d3d85e3	- added Washington Post translator - translation works properly even when a user has switched to a different page	2006-09-08 05:47:47 +00:00
Simon Kornblith	b8ddba3a67	CiteSeer translator	2006-09-08 01:59:22 +00:00
Simon Kornblith	89cf0c7235	closes #276 , fix RIS bugs - import translators no longer fail when trying to import an item with no name - the T2/BT field becomes the publication title when no JO/JF field is available (fixes newspaper issues) - Y2 is now treated as part of the date if and only if it is improperly formatted (seriously, why can't Thomson get their own specs straight?) - work around EndNote's strange behavior of putting article titles into notes for no apparent reason - RIS export gives dates as per specification - fixed a bug that could have (potentially) caused problems formatting "January" - allow translators to access strToDate function	2006-09-06 04:45:19 +00:00
Simon Kornblith	858c0145e6	closes #216 , support for non-ascii characters in word integration	2006-09-06 03:49:41 +00:00
Simon Kornblith	b3bb6b9013	remove unnecessary debug code	2006-09-05 07:59:25 +00:00
Simon Kornblith	045780d9ac	closes #250 , figure out proper text encodings for import/export MODS uses the encoding as specified in the <?xml tag, or else UTF-8 RIS uses IBM850, since the spec says "IBM Extended Character Set" and it's the only code page Mozilla supports. (should I do this? or just use unicode?) MARC uses UTF-8, since I don't think there's any way to get full MARC-8 support, and UTF-8 is now the preferred encoding anyway	2006-09-05 07:51:55 +00:00
Simon Kornblith	dd0c537ce1	closes #267 , MODS export option uses an rdf extension (should be xml) thanks to Dan for the idea	2006-09-04 22:57:23 +00:00
Simon Kornblith	7d93903e2d	closes #239 , fix embedded RDF translator	2006-09-04 21:43:23 +00:00
Simon Kornblith	0ab9e8b36c	references #268 , occasional problems with ingest of pages with multiple references i've fixed the Amazon.com bug (i think) and made the translator show a "Could Not Save Item" prompt rather than show an empty list, but if you see any other pages where this happens, let me know	2006-09-04 17:09:44 +00:00
Simon Kornblith	2f364432ef	oops. actually commit the string changes, and change comments in translate.js	2006-08-31 07:52:28 +00:00
Simon Kornblith	438ff82955	- replace storage streams with plain old strings for translate IO. there's not much of a reason to use storage streams now, and it was screwing up non-ASCII characters. - make EBSCO scraper work better through a proxy - shorten Accession Number -> Accession No, Journal Abbreviation -> Journal Abbr, Publication Title -> Publication. it does look a bit stranger, but it also makes the interface more functional (especially for those of us without giant widescreen LCDs ;-)	2006-08-31 07:45:03 +00:00
Simon Kornblith	1c8e3fcb02	closes #239 , fix embedded RDF translator modifies scrapers to use dates in the format that comes out of the page, rather than converting to SQL adds Scholar.Date.formatDate() to provide a pretty representation of dates	2006-08-31 00:04:11 +00:00
Simon Kornblith	0cd3021cf3	closes #241 , improved date handling - Scholar.strToDate() accepts a string date and returns an object containing year, month, day, and part - capture access date whenever URL is captured - updated Zotero.dot to use new namespaces	2006-08-30 21:56:52 +00:00
Simon Kornblith	27617ee152	closes #236 , Export windows should offer a default filename with extension closes #238, present dialog if no import translator is available for a file closes #240, change XML namespaces	2006-08-30 19:57:23 +00:00
Simon Kornblith	68c480b7b5	- closes #232 , University of Michigan library site does not work - improved handling of scraper errors (hopefully, the hanging should be gone)	2006-08-30 01:41:51 +00:00
Simon Kornblith	406d1d6950	references #232 , library UI does not refresh after saving new item now that the notifier issue is fixed, i'm switching this bug to deal with the University of Michigan issue	2006-08-30 00:43:09 +00:00
Simon Kornblith	d3fc9866b9	- add ABC-CLIO (America: History and Life) translator - fix a potential issue with COinS support	2006-08-26 21:36:49 +00:00
Simon Kornblith	04d05548b2	closes #103 , figure out how to store captured pages in native export format fixes ampersands in citation COinS fixes tags and seeAlso in import/export (should now work for all items)	2006-08-20 04:35:04 +00:00
Simon Kornblith	26668a6e73	closes #194 , EBSCO translator closes #160, cache regular expressions closes #188, rewrite MARC handling functions MARC-based translators should now produce item types besides "book." right now, artwork, film, and manuscript are available. MARC also has codes for various types of audio (speech, music, etc.) and maps. the EBSCO translator does not yet produce attachments. i sent them an email because their RIS export is invalid (the URLs come after the "end of record" field) and i'm waiting to see if they'll fix it before i try to fix it myself. the EBSCO translator is unfortunately a bit slow, because it has to make 5 requests in order to get RIS export. the alternative (scraping individual item pages) would be even slower. regular expression caching can be turned off by disabling extensions.scholar.cacheTranslatorData in about:config. if you leave it on, you'll have to restart Firefox after updating translators.	2006-08-19 18:58:09 +00:00
Simon Kornblith	20486d5053	addresses #103 , figure out how to store captured pages in native export format import/export of file data should work for all file types _except_ snapshots (in this situation, export is working, but import is not yet complete; see #193) also, fixes a potential security issue that could have allowed malicious web translators to post local data to remote sites (although, given we maintain the central repository and there's no easy way to install a translator, the risk would have been minimal to begin with).	2006-08-18 05:58:14 +00:00
Simon Kornblith	10ba568ee8	closes #39 , auto-ingest of associated files (as recognizable) closes #3, Overflow metadata dumps into "extra" field add "extra" data where such data is useful and conveniently accessible (not available for XML-based export or MARC formats yet) add links to permanent URLs download associated files from full text sources (if extensions.scholar.downloadAssociatedFiles preference is enabled) fix WorldCat translator improve InnoPAC translator (it now works on Georgetown search results pages, albeit slowly, because it must first realize the catalog is misconfigured) tag items from SIRSI and WorldCat return to putting the full lengths of books into "pages," because some citation styles require it fix COinS (broken a few revisions ago)	2006-08-17 07:56:01 +00:00
Simon Kornblith	410e090ecd	closes #104 , speed up multiple item adds	2006-08-15 23:03:11 +00:00
Simon Kornblith	51108446e3	closes #187 , make berkeley's library work closes #186, stop translators from hanging when a document loads inside a frameset, we now check whether we can scrape each individual frame. all functions involving tabs have been vastly simplified, because in the process of figuring this out, i discovered Firefox 2's new tab events. if a translator throws an exception inside loadDocument(), doGet(), doPost(), or processDocuments(), a translate error message will appear, and the translator will not hang	2006-08-15 19:46:42 +00:00
Simon Kornblith	feff0aa531	closes #53 , export to footnote or bibliography closes #180, make all contextual menu export/create bibliography options work right also: - add Chicago Note style output - unregister RDF data sources from cache after import	2006-08-14 20:34:13 +00:00
Simon Kornblith	3195a1c382	closes #112 , ingested items should be automatically added to selected project references #178, changes to various date fields - updates CSL to work with the latest schema. we can now (almost) generate completely valid APA style. the only issue is that there's no syntax for specifying short forms for page and creator type labels. - updates scrapers to use date field rather than year field. - removes now-unnecessary translation engine code pertaining to year field.	2006-08-14 05:12:28 +00:00
Simon Kornblith	36a402713c	rename Scholar.Utilities.Ingester.HTTPUtilities to Scholar.Utilities.Ingester.HTTP for consistency	2006-08-11 16:34:22 +00:00
Simon Kornblith	3a1ffb6174	make LOC/WebVoyage scraper and other scrapers using Scholar.loadTranslator work again	2006-08-09 18:59:38 +00:00
Simon Kornblith	6efd6d2cc4	closes #99 , add options for export	2006-08-08 23:00:33 +00:00
Simon Kornblith	3edb6e0286	closes #86 , steal EndNote download links Scholar should now attempt to process citation information from EndNote download links (MIME types application/x-endnote-refer and application/x-research-info-systems). in situations where Scholar cannot process the information, a standard helper app dialog will appear. this behavior is controlled by the preference extensions.scholar.parseEndNoteMIMETypes.	2006-08-08 21:17:07 +00:00
Simon Kornblith	504ebf8996	closes #162 , do sniffing for import formats import should now work regardless of file extensions. this should make #86 (steal EndNote download links) fairly easy to implement.	2006-08-08 02:46:52 +00:00
Simon Kornblith	216f0c7581	closes #83 , figure out how to implement OpenURL closes #76, implement extensible search/retrieval architecture for obtaining metadata OpenURL COinS lookup is now implemented using a real search architecture system. at the moment, it works with Open WorldCat for books, CrossRef for journal articles (provided the COinS object contains a DOI or an ISSN), and PubMed when a PMID is available.	2006-08-08 01:06:33 +00:00
Simon Kornblith	9144b56772	addresses #131 , make import/export symmetrical closes #163, make translator API allow creator types besides author import and export in the multi-ontology RDF format should now work properly. collections, notes, and see also are all preserved. more extensive testing will be necessary later.	2006-08-05 20:58:45 +00:00
Simon Kornblith	30af2c89df	- closes #130 , add progress bar for import/export - eliminates "unresponsive script" message on import/export i tried to make a progress bar that actually provides useful information, but for some reason, XUL interface updates are done asynchronously, and thus don't actually happen as long as the import/export operation continues. the code is there, but disabled, if there's some solution to this issue, but i searched and couldn't find one.	2006-08-02 21:06:58 +00:00
Simon Kornblith	c64e5c841f	closes #78 , figure out import/export architecture closes #100, migrate ingester to Scholar.Translate closes #88, migrate scrapers away from RDF closes #9, pull out LC subject heading tags references #87, add fromArray() and toArray() methods to item objects API changes: all translation (import/export/web) now goes through Scholar.Translate all Scholar-specific functions in scrapers start with "Scholar." rather than the jumbled up piggy bank un-namespaced confusion scrapers now longer specify items through RDF (the beginning of an item.fromArray()-like function exists in Scholar.Translate.prototype._itemDone()) scrapers can be any combination of import, export, and web (type is the sum of 1/2/4 respectively) scrapers now contain functions (doImport, doExport, doWeb) rather than loose code scrapers can call functions in other scrapers or just call the function to translate itself export accesses items item-by-item, rather than accepting a huge array of items MARC functions are now in the MARC import translator, and accessed by the web translators new features: import now works rudimentary RDF (unqualified dublin core only), RIS, and MARC import translators are implemented (although they are a little picky with respect to file extensions at the moment) items appear as they are scraped MARC import translator pulls out tags, although this seems to slow things down no icon appears next to a the URL when Scholar hasn't detected metadata, since this seemed somewhat confusing apologizes for the size of this diff. i figured if i was going to re-write the API, i might as well do it all at once and get everything working right.	2006-07-17 04:06:58 +00:00
Simon Kornblith	d65328c830	adds Biblio/DC/FOAF/PRISM/VCard RDF export type. Bruce D'Arcus, author of CiteProc and co-lead on the OpenOffice bibliographic project, is currently using this as his ontology, and we can unambiguously encode all of our metadata with it. caveats: - it's not human readable. mozilla doesn't nest blank nodes, so everything's scattered throughout the file. it would be relatively easy to do post-processing with E4X or even regexps to correct this. - there's no generic callNumber field, so all callNumbers are encoded as LCC. adds container creation routines to dataMode rdf changes Dublin Core export to Unqualified Dublin Core, and removes DC Terms qualifiers	2006-07-07 18:41:21 +00:00
Simon Kornblith	c02666fcd3	add an API for Mozilla's RDF data source, so that import/export translators will be able to create and parse RDF with minimal effort convert Dublin Core export to new API	2006-07-06 21:55:46 +00:00
Simon Kornblith	2d8ed16d88	adds export of tags to MODS. adds export of seeAlso info and project hierarchy to RDF. for now, this is embedded in the modsCollection root element. uses nodeIDs for Dublin Core RDF.	2006-07-06 03:39:32 +00:00
Simon Kornblith	45b9234996	addresses #78 , figure out import/export architecture - changes scrapers table to translators table; all import/export/web translators now belong in this table - adds Scholar.Translate to handle translation issues. eventually, Scholar.Ingester.Document will become part of this interface - adds Scholar_File_Interface (in fileInterface.js) to handle UI for export and eventually import. (David, when you have time, please connect Scholar_File_Interface.exportFile to a button.) - adds an export translator for MODS. all of our metadata, but not our hierarchy (projects, etc.) translates directly and unambiguously into valid MODS. eventually, we can use RDF or another format to handle hierarchy. - adds utilities.getVersion() and utilities.inArray() for simplified scraper coding - fixes minor interface issues with the nifty chrome scraping status window	2006-06-29 00:56:50 +00:00