Personal tools

Skip to content. | Skip to navigation

 
You are here:
Document Actions

4. Method

Up one level

Method

 

Strategic Goals

Strategic Goals

"lingua franca", long term storage, common metadata, self-explanatory, extensible,

Strategic Goals

The AKOMA NTOSO model has been informed by the following strategic goals:

  1. To create a "lingua franca' for the interchange of parliamentary, legislative and judiciary  documents between institutions in Africa. For example, Parliament/Court X should be able to easily import a piece of legislation made available in AKOMA NTOSO format by Parliament/Court Y. The goal here is to speed up the process of drafting new legislation/writing sentences/etc. by reducing the amount of re-keying, re-formatting etc. required. 
  2. To provide a long term storage and access format to parliamentary, legislative and judiciary  documents that allow search, interpretation and visualization of such documents several years from now, even in the absence of the specific applications and technologies that were used to generate them.
  3. To provide an implementable baseline for parliamentary, legislative and judiciary  systems in African institutions. It is envisaged that this will lead to one or more systems that provide a base layer of software "out of the box" that can then be customized to local needs. The goals here are twofold. Firstly, to facilitate the process of introducing IT into African institutions. Secondly, to reduce the amount of re-invention of the wheel that would result if all institutions pursued separate IT initiatives in the area of parliamentary, legislative and judiciary document production and management. 
  4. To create a common data and metadata models so that information retrieval tools & techniques used in Parliament/Court X can be also be used in Parliament/Court Y. To take a simple example, it should be possible to search across the document repositories of multiple Parliaments/Courts in a consistent and effective way. 
  5. To create  common resource naming and resource linking models so that documents produced by Parliaments/Courts can be easily cited and cross-referenced - either by other Parliaments/Courts or by other users. 
  6. To be "self-explanatory", that is to be able to provide all information for their use and meaning through a simple examination, even without the aid of specialized software.
  7. To be "extensible", that is it must be possible to allow local customisations to the models within the AKOMA NTOSO framework so that local customisation can be achieved without sacrificing interoperability with other systems.

 


Simple data model

Simple data model

identify a number of basic, fundamental classes of structures

The AKOMA NTOSO document model is designed, first and foremost, to be actually used. As a consequence, a high premium has been placed on simplicity throughout its design. Data models created to handle complex document types (as legislation) need to deal with two apparently opposed requirements: on the one hand, they need to be sufficiently sophisticated to handle all possible occurrences and situations that may occur in the actual documents. On the other, they need to be speedily understood and used by the people who would need to apply these models.

These opposed requirements can be jointly satisfied not by simplifying the vocabularies of available structures and elements, which would reduce the available descriptive sophistication of the language, but rather by simplifying the structure variability and types (in XML parlance, the content models), thereby reducing the learning time and the software complexity without compromising a full and detailed descriptive power of the language. The idea therefore is to identify a number of basic, fundamental classes of structures (containers, hierarchies, blocks, etc.) that can be immediately understood and used appropriately, regardless of their actual names.

 


Ability to evolve

Ability to evolve

built to stand evolutions and changes over time

A critical attribute of a successful XML model is its ability to evolve over time. This "evolvability' has been a key concern in the creation of the AKOMA NTOSO model. Thus, although the General Schema is built to stand evolutions and changes over time, each individual Detailed Schema can be customized at will even in time, and still be made compatible with the overall AKOMA NTOSO infrastructure and the General Schema.

Furthermore, the General Schema is built to stand evolutions and changes even regarding the number of actual functionalities provided: features such as the number of metadata, or the automatic generation of amended text, or the activation of special analysis tools on the text may require with time the evolution of the schema. In these cases, it can be guaranteed that existing documents already marked up according the initial versions of AKOMA NTOSO will be either immediately compatible with the new schemas, or easily convertible to it via a single XSLT stylesheet to be provided.

 


Validation

Validation

verifying the correctness of an XML document against specific schema

Validation is the act of checking the correctness of an XML document according to some pre-defined structural rules expressed in one or more DTDs and XML Schemas. The validation step verifies whether the XML document contains, in number and position, all the expected elements of the type this document is an instance of. The AKOMA NTOSO schema could impose a number of constraints and restrictions on the final form of the XML document, requiring for instance a specific order in the containment of parts or that any part is preceded by a heading, etc.

The problem with being too restrictive in the constraints of the schema is that, e.g. the Parliaments may have approved, and may decide to approve in the future, documents that do not conform to these rules: in most countries there are guidelines for the correct drafting of legislation, but this is just what they are: guidelines, that can be ignored and modified at will by a higher authority such as a Parliament or Court.

This fact has a very important effect on the generation of XML versions of documents: everything that gets approved by Parliaments or written by Courts has to be accepted by the system, and everything that has already been approved even more so. Therefore, failing XML validation (i.e., violating one or more of the constraints and restrictions expressed in the schemas) cannot have the effect of rejecting documents, but, at most, of pointing out issues and differences from the guidelines that the authority itself, if it wants and has time to spend on this, can consider for editing and modifications.

In reality there are two different actors in the complex issue of validating a piece of legislation: the legislator, who is writing the actual content of the document, and the marker, who is converting it into XML by identifying all interesting bits of the text and correctly marking them up using the AKOMA NTOSO vocabulary.

If we consider the validation schema a contract, then this contract clearly binds only the marker, leaving the legislator/judge absolutely free to do as he/she chooses. Thus compliance to rules such as "An identifier will always be added to each substructure of the act" or "The enactment date will be specified" can be safely required, as they bind the behaviour of the marker only, while structural rules (such as "Every subpart will have a heading", or "A section will contain paragraphs which contain clauses") cannot be imposed, as they would interfere with the authority and independence of the legislator, which in the case of Parliaments  is most often complete and not constrainable by more mundane requirements such as the adherence to an abstract document structure.

Forcing markers to fully describe in XML all document parts, and yet leaving to the legislator the maximum freedom in writing, may seem incompatible and hard-to-reach goals, but they can be and are reached in the AKOMA NTOSO framework. AKOMA NTOSO clearly separates data and metadata, thereby clearly distinguishing the contribution of the legislator (data) and the contribution of the marker (metadata); AKOMA NTOSO provides a richly evocative vocabulary of structures and elements, so that the marker can correctly and precisely describe what is actually contained in the documents. AKOMA NTOSO imposes little or no constraints on data, letting the legislator write and organize the text matter as he wishes, but imposes a number of constraints on the metadata, forcing the marker of texts to provide all bits of information that are necessary to manage and organize the documents.

Yet it might be appropriate to also give guidance and help in following the drafting guidelines enacted in each country. This is the reason to provide both a General Schema(GS) and several Detailed Schemas (DSs): the GS is fully  descriptive, only binding the marker and not the legislator, but allowing the marker to describe as precisely as possible the actual structure of the document as approved and generated by the Parliament. The DSs are more prescriptive, and are used to check whether the document actually conforms to the existing legal drafting guidelines in each individual country. Successful validation of documents will only be required against the GS, as errors would signal incorrect markup from the marker, while the DSs can be used, at the discretion of the Parliament itself, to automatically check conformance of the proposed bill against the drafting guidelines, and thus be able to modify it accordingly in case conformance is sought.

The descriptive (GS) and prescriptive (DS) schema, of course, are closely related. As mentioned, they both use the same vocabularies, and differ in the number of rules and constraints they enact. All rules enforced in the GS are also enforced in the DSs, so that, correspondingly, all documents that are valid according to one of the DSs  (i.e. are conformant to a set of stricter rules) are also valid according to the GS (i.e., they are also conformant to the set of shallower rules).

Furthermore, this pair of schema classes also allows interoperability of systems dealing with documents coming from different African countries. In fact the descriptive GS schema can work on all AKOMA NTOSO documents of all the interested African countries, and can be used as the baseline for accessing and displaying documents regardless of their provenance.

On the other hand each prescriptive DS schema is created to deal with the specific guidelines of each individual country, helping the legal drafting process and the correct preservation of cultural peculiarities of each individual country. The fundamental commonality of GS and DSs provides therefore full description of individual and country-specific document types without renouncing interoperability and document interchange.

 


Tools

Tools

editor, converter, name resolver, post-editing tools

Just as many are the users (some of whom are not even aware of the fact they are using or relying on AKOMA NTOSO-compatible systems), many also are the tools that will be created around the AKOMA NTOSO document model. Some of them are basic tools that are necessary for the AKOMA NTOSO system to work at all. Others are additional applications that will be created once the basic tasks have been catered for.

Although this is not the place to provide a full list of the foreseeable tools, a brief list of the main categories may help in explaining the breadth and variety of the AKOMA NTOSO project, and the number of issues that need to be considered in the development of the data formats.

The editor

The editor is the fundamental tool for the generation of XML versions of legislation or judgement, etc. Although not all drafting needs to be actually done on a specialized editor (much less an XML editor) in any real life scenario, there will be situations in which that will be possible and actually necessary.

The editor will be used in three different scenarios:

  1. as an interface to activate, control and verify the automatic conversion tool previously described. Through the editor "the legal drafter" will be able to verify the correctness of the conversion, and change and add whatever the conversion engine has forgot or misidentified.

  2. as a tool to manually mark-up a document provided in a different format. Depending on the sophistication of the conversion engine, this scenario will most probably blend naturally with the first one. Surely the editor will provide for functionalities to edit and add any kind of AKOMA NTOSO-conformant markup, and will be able to check validity of the intermediate result.

  3. as an application for direct insertion of both text and markup, starting off an empty document: this will probably be the rarest scenario of use, as the drafting offices will most usually work off an existing document in some other format.

The convertor

It is the converter, with the editor, the most fundamental tool for the AKOMA NTOSO system. It will take some convincing for "the drafter" to switch from her old faithful word processor and her manual system of handling amendments through a combination of glue and scissors, to use any kind of strange text editor. In the meantime, one of the most important tools will be the converter.

The converter has the double purpose of converting into AKOMA NTOSO files the documents that ?the drafter? is still producing traditionally, and, most importantly of all, of converting into AKOMA NTOSO files the legacy documents, the already approved bills and acts that form the current legislative situation of this African country, and whose conversion to XML is needed for any hypertext web of references to work at all. Since legacy documents are, by definition, in any old format, and since "the drafter" is not interested in converting them into XML using an editor, "the toolmaker" will have to create an automatic mechanism for the task anyway.

The converter is based on the idea of semi-automatic conversion, i.e., it has automatic processes to determine as correctly as possible the actual interesting structures, and has a manual process to confirm (or, if there is an error, to edit) the inferences made by the automatic process. In fact, this application could even be one of the modules of the editor, and use the editor itself for corrections to the automatic inferences of the converter. Of course, the amount of human editing is inversely proportional to the sophistication of the converter, and in theory large quantities of documents could be processed automatically with little or no manual intervention.

The converter works by examining the typographical and textual regularities of the document, and inferring a structural or semantic role for each text fragment. For every fragment that has no deducible structural or semantic role, the presentation characteristics will be recorded instead and it will be left to the human user to infer the structural or semantic role (if any) needs to be associated to the fragment.

E.g., experiences with European laws show that the basic structure of the bill (sections, subsections, clauses, preambles, conclusions, attachments, etc.) can be inferred automatically with great precision and few errors. The most important semantic elements, references and dates, can also be deduced automatically with great precision as long as the human-readable text used for them uses one of a limited number of acceptable forms. More complex structural elements (explicit modifications, specialized terms, people, etc.) might be difficult to catch automatically, but not impossible.

Name resolvers

The AKOMA NTOSO Naming Convention is a standard mechanism for creating identifiers of documents that can be used for accessing content and metadata regardless of storage options and architecture.

AKOMA NTOSO documents will be stored on networked computers and accessible by specifying their addresses. Yet these addresses are extremely dependent on the specificities of the architecture that will be in vogue or appropriate for the economic and technical context of the moment. It is extremely inappropriate, therefore, that any content or structure that is planned to last for more than a short period of time is given direct access to the physical address of the document in the form that will be eventually used for display.

For this reason, the AKOMANTOSO Naming Convention specifies an architecture-independent URI address for all relevant structures of the AKOMANTOSO standard, which cannot be used directly for accessing these structures.

A Name resolver is a software tool that can, given an architecture-independent URI, identify the resource being sought and provide the current architecture-dependent address that needs to be used at any given time for actual access.

Name resolvers are either indirect (in that they redirect the client application to the current address of the requested document) or direct (in that they immediately provide the requested document by generating the actual address and requesting the document as a proxy for the initial client application).

 

Post-editing tools

The post-editing tools are a number of validation, enrichment, and storage tools that are used after ?the legal drafter? has finished her editing job. All these tools require no user-interface to speak of, are managed automatically or by the system administrator of the storage centre for all AKOMA NTOSO documents. These tools include at least (but the list might be longer and more sophisticated):

  • A content and structure validator that checks the correctness of the document instance with regard to the AKOMA NTOSO schema document, and any additional rules that were added locally.

  • A reference validator that checks whether all references contained in the document already belong to the document collection and are correctly referenced.

  • A metadata validator that checks whether the metadata stored with the document are correct and complete.

  • A sophisticated and complex document management system, with search engines, hypertext functionalities, XSLT support and versioning facilities. 

  • An XSLT stylesheet (or a series thereof) to create visualizations of individual documents for a number of browsers and applications that will increase and get more sophisticated in time.