These functions, gathered together and still in beta-testing , starting from a source text or part of it
(howeever no lower than a verse), would search in the whole corpus of Musisque Deoque, or in one of its sections, verbal or also non-verbal rhythmic similarities.
After having chosen a source text and set up some main options, the search engine will draw to scholars' attention a certain number of results which could be significant,
picked out from a usually huge mass of irrilevant material.
The functions, although complementary in their goals, are distinguished in two different types, also managed in distinct pages of the application; in detail:
co-occurrence's search of at least two words from the source text or more commonly of their lemmas in target texts;
the source text is a section of an entire text, namely from a first order partition (for example, one book of the Aeneid).
Metrical and verbal co-occurrences:
this tool, searching from a chosen source text (in this case, a single verse), tries to take advantage of the complete scan of dactylic verses
made available by the
to find, in the entire corpus or in one of its parts, similarities of rhythm and sound, overtaking the limits of a verbal inquiry based on identity of forms
or dependency from an headword.
There are three available approches:
search for a metrical pattern: provides verses with the same metrical pattern of a source verse (quantities, pauses, hiatuses, synalephas, ecc.)
search for words: a key, composed of its metrical position and its vowels, is extracted from each word of the source verse;
the key is then searched in the whole corpus;
search for sequences of syllables: the tool extracts from a verse all the possible sequences of 4 or 5 syllables, not regarding a word's limits;
metrical position and vowels are taken into account.
Scoring and selection criteria
Both the approches of the searching tool by co-occurrences (lexical; metrical and verbal) produce an overabundance of results, but scholars
should not be overwhelmed by it.
Following this assumption, our choices move from the certainty that, although the tool's discerning ability could be improved,
significant results will always be surrounded by ground noise; hence, we think that it is more profitable,
but perhaps less extraordinary, to aim not at the perfect tool, showing to scholars a nice and ready result,
but instead to supply them with a stock of filters and different ways of reading the results,
to help them, with the irreplaceable guidance of their sharpness and experience, to find a nugget between pebbles.
This means, in practice, that we don't force the authomatic selection of the machine to its limits, to avoid the risk of losing valuable results,
often hidden where we do not expect them to be. Obviously, we could not do without a vigorous initial selection of the results,
specialised to discern more at the lower levels than at the higher ones, that is, more aimed at removing the mediocre ones, to bring to surface the excellent ones.
Main criteria for the selection:
The scoring system for the co-occurrence is based on: identity of forms, sequence of the two words,
comparison between words' distance in the source text and in the target, position into the verse.
For what concerns this last criterion, each word with the same significant position (first, second, penultimate, last)
in the two compared texts is given a score point; also if only one of the words has the same position, but the same distance,
it provides to the result two score points.
Metrical and verbal co-occurrences
search for words: only the occurrences with a matching of, at least, 4 syllables in one or more words are accepted;
then the occurrences are ordered by importance, on the basis of the number of corresponding syllables, of the proximity of the words found and
also of the equivalence of consonants;
search for sequences of syllables: only the occurrences with a certain number of identities in the consonantic part
of the syllables are accepted.
Lexical co-occurrences: search by lemmas
In the lexical co-occurrences' search, Search by lemmas option is set by default. It can be useful to point out that with this option employs the same
rules of the Musisque Deoque advanced search: the search for lemmas is not extended to all inflected forms led back to a lemma, but only to
those with the same number of syllables of the source form; on the contrary, this search can widen also to other lemmas, namely compounds with only different prefixes (for example, advenio and pervenio).
Verbal and metrical co-occurrences: matched search
One could think that the search for sequences of syllables, working in a less detailed manner of that for words, could include also the results of the last one.
But this is not true, and it isn't difficult to explain why:
the search for words can intercept non-consecutive words, each below the finding threshold of a syllabic sequence (4 or 5),
but comprehensively giving a result accepted by that kind of search (> 3 syllables); for example, two words of two non-consecutive syllables,
not found with the second approch, also working for sequences of 4 (it should work for sequences of 2, but the number of the results would then be unmanageable);
the scoring criteria of the two methods are necessarily quite different and the second one can exclude results accepted by the first one.
Therefore, we have to think at these two approches not as interchangeable or in alternative, but instead as complementary,
because, although the results are in part overlapping, each one provides some significant results in an exclusive manner.
This is the reason why it is given the chance to combine them with a unique call, which sums up the two series of results, removing redundancies.
All rights of the texts with critical apparatus contained inside www.mqdq.it are reserved to the
Musisque Deoque National Interests’ Projects Research, to the editors of the literary word
and to the authors of the original documents.
It is not allowed for any kind of commercial use without prior agreement. Reproductions and circulations in printed
format or electronic format (offline) are allowed only to the exclusive scientific, didactical and documentary use,
as long as these documents are not altered in any substantial way, and, in particular, are kept with correct
indications of date, paternity and original source (by quotation).
Link from other websites are welcome, especially if the editing will be informed to the editorial board
(email@example.com) so as to facilitate the timely communication of following
Any kind of mirroring (duplications) on other sites is forbidden. Any automatic capture of the texts on other sites,
without specifics agreement with the editorial board, is also forbidden.