home code faq docu dev why research journal events download contact who links links


This is a page of links of related project, ongoing research, inspiring ideas and other.
Feel free to add your suggestions and receipts, if you guess the wiki pswd.


Madrona - Open Source web interface for Museums archives

Open Archives - Standards for Web Content Interoperability

OSHA - Collect, preserve, and analyze the history of open source as a cultural movement


MD5/pearl - Perl interface to the MD5 Algorithm

MD5 - widely used, partially insecure cryptographic hash function with a 128-bit hash value.

Change Log

changelog - This is an xsl stylesheet for generating a classic GNU-style Change Log from a subversion repository log. It is made from several changelog-like scripts using common xslt constructs found in different places.


Django - Django is a high-level Python Web framework that encourages rapid development and clean, pragmatic design.

Developed four years ago by a fast-moving online-news operation, Django was designed to handle two challenges: the intensive deadlines of a newsroom and the stringent requirements of the experienced Web developers who wrote it. It lets you build high-performing, elegant Web applications quickly.


Fenfire - Everything has to do with everything else.

Fenfire is a Free Software project developing a computing environment in which you can express these relationships and benefit from them. Fenfire is a desktop environment which allows you to express relationships.


BSD license - BSD license template [OWNER, ORGANIZATION and YEAR].

copyzero - Copyzero is a mechanism of protection of works through it uses of qualified digital signature and a time mark.

Free Art License - The Free Art License grants the right to freely copy, distribute, and transform creative works without infringing the author’s rights, allowing everyone to use creations of the human mind in a creative manner, regardless of their types and ways of expression.

Open Knowledge - The Open Knowledge Definition: “A piece of knowledge is open if you are free to use, reuse, and redistribute it”.


LSI - Latent Semantic Indexing. Latent semantic indexing adds an important step to the document indexing process. In addition to recording which keywords a document contains, the method examines the document collection as a whole, to see which other documents contain some of those same words. LSI considers documents that have many words in common to be semantically close, and ones with few words in common to be semantically distant. This simple method correlates surprisingly well with how a human being, looking at content, might classify a document collection. Although the LSI algorithm doesn’t understand anything about what the words mean, the patterns it notices can make it seem astonishingly intelligent.

LSA - Latent Semantic Analysis is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of documents and the terms they contain by producing a set of concepts related to the documents and terms.

Mapping and Representation

/etc/groups - Database Configuration, a web application made with prefuse for the visualization of big sets of data, mainly aimed at visualizing groups and organizations and the relations between them. It has been developped in the context of the memory project, a project to analyze social movements who participated to social forums. It offers features of data filtering and a graphical search engine, that makes it useful for using it as a yellow page directory.

GFP - Generic Frame Protocol, a set of functions that support a generic interface to underlying frame representation systems (FRSs).

Open Knowledge Base Connectivity - Open Knowledge Base Connectivity (OKBC) is an application programming interface for accessing knowledge bases stored in knowledge representation systems (KRSs). OKBC is a successor of Generic Frame Protocol (GFP) which was primarily aimed at systems that can be viewed as frame representation systems.

Protege - a free, open source ontology editor and knowledge-base framework.


Dublin Core - The Dublin Core Metadata Initiative is an open organization engaged in the development of interoperable online metadata standards that support a broad range of purposes and business models. The Dublin Core Metadata Element Set is a vocabulary of fifteen properties for use in resource description.

metadata transmission - The transmission collective work-group on Metadata, and the status of the art from the independent media perspective.

metadata wikipedia - Definition of metadata from Wikipedia. “Metadata is data about data”.

metadata W3 - W3 definition: “Metadata is machine understandable information for the web”.

The Meta Element (Smil) - This section defines the elements and attributes that make up the functionality in the SMIL Metainformation module. The meta element is an empty element. Each meta element specifies a single property/value pair in the name and content attributes, respectively.

Microformats Video - Video metadata model. Currently, video objects are very hard to search for on the web. There a lack of a simple taxonomy for users to implement, and most of the technology used to derive meaning is basically dependent on what text is available from a corresponding web page. The idea is to create a practical, easy-to-use element set that can both be implemented quickly by software developers and archivists, while also being intuitive to artists, photographers, film makers etc. A metadata system that makes sense to artists will encourage them to become more involved and add more information about their works. One starting point could be to begin with a dc-compatible subset that is already in use by numerous applications (and that, by definition, ensures compatibility with all W3C Metadata Formats), and build from there a larger framework capable of encompassing all media objects. Every attempt should be made to be compatible with the following vocabularies, and, most likely, several others: Dublin Core Metadata Element Set, SMIL (Synchronized Multimedia Integration Language), Media RSS, and the Internet Archive’s metadata format.

MRSS - An RSS module that supplements the <enclosure> element capabilities of RSS 2.0 to allow for more robust media syndication.

Tei - Text Encoding Initiative, one of the oldest Metadata standard. Started in 1987, TEI becomes an international standard that represents all kinds of literary and linguistic texts for online research and teaching, using an encoding scheme that is maximally expressive and minimally obsolescent. The new TEI Consortium was set up in December 2000 to continue maintaining and developing the TEI standard. TEI mainly focuses on the exchange of textual information, but other formats such as image and sound are also addressed. One of the main characteristics of TEI is that its descriptive metadata is created as the header (known as TEI header) of the encoded file (known as the TEI text proper). The format of TEI encoding scheme is based on the SGML standard.

Annual Symposium 2008: Force of Metadata @ Goldsmiths, University of London - Saturday, 29 November 2008.

In the age of digital media, metadata’s power has grown to administer access, pre-decide preferences, enable surveillance, automate transtextuality, and shape our experience. How does a social imaginary operate with the means and within the limits of metadata management? Can metadata acquire the power to generate content?


Idiki - Idiki is a wiki of ideas organised in categories. Idiki is the natural evolution of the philosophy of global, distributed and uncontrolled information. Ideas can’t be possessed. If Virtual Entity, on its side, depict the end of ownership towards a world of creations, and Creators, similarly Idiki proposes a word of independent ideas and Inventors.

Image Tracer - a research tool that archives Google image searches for the purposes of tracking their url, appearance, disappearance and rank.

Okkam - The OKKAM project aims at enabling the Web of Entities, namely a virtual space where any collection of data and information about any type of entities (e.g. people, locations, organizations, events, products, …) published on the Web can be integrated into a single virtual, decentralized, open knowledge base, like the Web did for hypertexts. The main difference between this approach and the Virtual Entity is that Ve focuses on files and Cultural Units as actors of the web: people, locations, organizations and events are not so important in its approach. If Okkam, in this sense, appears to maintain a web2.0 attitude, where the inspection is concerning the users, Ve is imagining users as satellites, bricoleurs, creators of files. The person becomes a filamentous connection between files entities.

URL - Uniform Resource Locators. Tim Berners-Lee, Internet Draft, CERN, October 1993 Uniform Resource Locators (URL) - A Unifying Syntax for the Expression of Names and addresses of Objects on the Network

imdb - Welcome to the Internet Movie Database. The Internet Movie Database originally began by collecting credits from movies. Who made them? Who was in them? As we’ve grown, we’ve started cataloging more and more information about the people who help make movie and TV magic. This is a good example of systematic use of metadata for archiving.


RFC - In computer network engineering, a request for comments (RFC) is a memorandum published by the Internet Engineering Task Force (IETF) describing methods, behaviors, research, or innovations applicable to the working of the Internet and Internet-connected systems. The RFC Editor assigns each RFC a unique serial number. Once assigned a number and published, an RFC is never rescinded or modified; if the document requires amendments, the authors publish a revised document. Therefore, some RFCs supersede others; the superseded RFCs are said to be deprecated, obsolete, or even obsoleted (sic). Together, the serialized RFCs compose a continuous historical record of the evolution of Internet standards and practices. The RFC production process differs from the standardization process of formal standards organizations such as ISO. Internet technology experts may submit an Internet Draft without support from an external institution. The official source for RFCs on the World Wide Web is the RFC Editor. Unofficially, they are obtainable from a multitude of mirrors accessible via the Hyper-Text Transfer Protocol, anonymous FTP, the gopher protocol, and other prominent application layer protocols.


RIB - RIB is a software package for creating WWW metadata repositories. Metadata, from RIB’s perspective, is information that describes reusable objects, such as software. RIB allows the user to enter metadata into a user friendly java applet which then sends the information to a RIB server via HTTP. The information is then stored in an SQL database where it is automatically made available in a fully functional web site (catalog, search page, etc). Repositories which use similar data models can use the XML processing capabilities of RIB to share information via the Internet.


BIDM - Basic Interoperability Data Model. The BIDM consists of classes that have attributes and relationships with other classes. The four BIDM classes are Asset, Element, Library, and Organization. An Asset is any reusable entity in the software life cycle process – e.g., a software package or routine. An Element is a single file. A Library contains assets. An Organization may be a person, company, research group, etc., that creates and manages an Asset or Library. The ACF extension to the BIDM includes additional classes for information about Asset Certification, also called software review or evaluation. Definitions of these additional classes are provided in the Glossary. Simple.conf FullBIDM.conf

RIB quickguide - Brief instructions on setting up and populating an interoperable software repository using the RIB software. It is beyond the scope of this document to provide detailed explanations on how to install, configure, or use the RIB software. See the RIB Installation Guide or the RIB User’s Guide. Both of these documents provide detailed directions for installing and using the software.


RSS - Wikipedia definition: RSS is a family of Web feed formats used to publish frequently updated works – such as blog entries, news headlines, audio, and video – in a standardized format. Really Simple Syndication, Rich Site Summary or RDF Site Summary.


RDF wikipedia - Resource Description Framework (RDF) is a family of World Wide Web Consortium (W3C) specifications, originally designed as a metadata data model, which has come to be used as a general method of modeling information through a variety of syntax formats.

RDF W3 - a general-purpose language for representing information in the Web.

RSS Versioning

Sublety Create an RSS Feed of a Public SVN Repository

rss/svn - Bringing RSS to SVN: subverssed script description.

SubveRSSed - Parse the script.

Subversion Importing - Importing a subversion repository into git

Semantics and Ontology

KIF - Knowledge Interchange Format is a language designed for use in the interchange of knowledge among disparate computer systems (created by different programmers, at different times, in different languages, and so forth). KIF is not intended as a primary language for interaction with human users (though it can be used for this purpose). Different computer systems can interact with their users in whatever forms are most appropriate to their applications (for example Prolog, conceptual graphs, natural language, and so forth).

LSI - Latent Semantic Indexing or Latent semantic analysis (LSA) is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of documents and the terms they contain by producing a set of concepts related to the documents and terms.

MCO - Media Contract Ontology is an ontology for representing contracts on multimedia content, in the framework of MPEG-21.

MVCO - The Media Value Chain Ontology (MVCO) is an ontology for formalizing the representation of the Media Value Chain. It represents the Intellectual Property (IP) along the Value Chain. There are different kinds of objects of the Intellectual Property (we call them IP Entities) and different actions that are performed on them, what defines the different roles that users can play regarding these IP Entities. These elements, along the permissions to execute the actions, constitute the essence of the MVCO.

Ontology - Wipedia definition: “the study of the nature of being”.

W3 Road Map - The Semantic Web Road Map. Text by Tim Berners-Lee, September 1998. An attempt to give a high-level plan of the architecture of the Semantic WWW. Editing status: Draft. “The Web was designed as an information space, with the goal that it should be useful not only for human-human communication, but also that machines would be able to participate and help. One of the major obstacles to this has been the fact that most information on the Web is designed for human consumption, and even if it was derived from a database with well defined meanings (in at least some terms) for its columns, that the structure of the data is not evident to a robot browsing the web. Leaving aside the artificial intelligence problem of training machines to behave like people, the Semantic Web approach instead develops languages for expressing information in a machine processable form.”

semanticweb.org - The Semantic Web Wiki.

SVD - Singular Value De-composition

sioc - The SIOC initiative (Semantically-Interlinked Online Communities) aims to enable the integration of online community information. SIOC provides a Semantic Web ontology for representing rich data from the Social Web in RDF. The SIOC Core Ontology definitions presented here are written using a computer language (RDF/OWL) that makes it easy for software to process some basic facts about the terms in the SIOC Core Ontology, and consequently about the things described in SIOC documents. A SIOC document, unlike a traditional Web page, can be combined with other SIOC and RDF documents to create a unified database of information.

SMIL - SMIL is a W3C Recommendation for an XML-based language for expressing the properties of largely web-based multimedia presentations. SMIL includes temporal behavior, methods for associating hyperlinks with a media object, and describing layout presentation on a screen.(Scalable Vector Graphics SVG - note that it can be used to draw video boxes).

WonderWeb - Ontology Infrastructure for the Semantic Web (2002 - 2004). The project aims to develop the INFRASTRUCTURE required for the large-scale deployment of ontologies as the foundation for the Semantic Web. This will involve not only the establishment of Web standard ontology languages, but also the parallel development of ontological engineering technology. OBJECTIVES The main objectives of the project are: - the development of a family of ontology languages that extend existing Web standards while maintaining maximum backwards compatibility; - the development of the comprehensive technical infrastructure and tool support that will be required for the development and deployment of ontologies; - the development of a set of foundational ontologies covering a wide range of application domains, each providing a carefully crafted taxonomic backbone with a sound high level structure that can be used as the basis for the development of more detailed domain ontologies; - the development of a framework of techniques and methodologies that will provide an engineering approach to the building and use of ontologies, dealing in particular with integration, migration, reconciliation and sharing.


SVD - In linear algebra, the singular value decomposition (SVD) is an important factorization of a rectangular real or complex matrix, with several applications in signal processing and statistics. Applications which employ the SVD include computing the pseudoinverse, least squares fitting of data, matrix approximation, and determining the rank, range and null space of a matrix.

SVD Tutorial -

Software Versioning Systems

GIT - Git is an open source version control system designed to handle very large projects with speed and efficiency, but just as well suited for small personal repositories.

GITHub - social code hosting

Tim Berners-Lee

Design Issues - Architectural and philosophical points. These statements of architectural principle explain the thinking behind the specifications. These are personal notes by Tim Berners-Lee: they are not endorsed by W3C. They are aimed at the technical community, to explain reasons, provide a framework to provide consistency for for future developments, and avoid repetition of discussions once resolved.

What is w3? - video presentation at Ted, February 2009.


SQL - SQL is a standard interactive and programming language for querying and modifying data and managing databases.

Virtual Square

Virtual Square - Virtual Square means that you have pieces of hardware and data pipes. The concept of computer ceases to be physical. Many processors or cases can be a single computer while one single processor can be several computers. Than you have a data pipe provider (or several connections), not an Internet provider as you can freely decide the mapping between computers and links. You are part of as many Internets as you want or like to.


Javascript Infivis Toolkit - tools for creating Interactive Data Visualizations for the Web.

Graph Visualization - Graph visualization is a way of representing structural information as diagrams of abstract graphs and networks. Automatic graph drawing has many important applications in software engineering, database and web design, networking, and in visual interfaces for many other domains.


Xanadu - This system of literature (the “Xanadu Docuverse”) must allow people to create virtual copies (“transclusions”) of any existing collection of information in the system regardless of ownership. In order to make this possible, the system must guarantee that the owner of any information will be paid their chosen royalties on any portions of their documents, no matter how small, whenever and wherever they are used.

Trans Literature - A Humanist Format for Re-Usable Documents and Media.


XML - The Extensible Markup Language (XML) is a general-purpose specification for creating custom markup languages.[1] It is classified as an extensible language because it allows its users to define their own elements. By adding semantic constraints, application languages can be implemented in XML.

XSL - XSL is a language for expressing style sheets.


URI - a Uniform Resource Identifier (URI) is a compact string of characters used to identify or name a resource on the Internet. The main purpose of this identification is to enable interaction with representations of the resource over a network, typically the World Wide Web, using specific protocols. URIs are defined in schemes defining a specific syntax and associated protocols.

Proposal for HyperText - HTML is the language that powers the Web in many respects, as the lingua franca that Web browsers are expected to be able to render. HTML has had unprecedented levels of success, and the uptake is all the more surprising when you realise that it was only invented in 1990, and few people knew about it before 1993.