Product Details

Lemmiwinks framework, MozArch application

Created: 2018

Czech title
Lemmiwinks framework, MozArch aplikace
Type
software
License
In order to use the result by another entity, it is always necessary to acquire a license
License Fee
The licensor does not require a license fee for the result
Authors
Serečun Viliam, Ing.
Veselý Vladimír, Ing., Ph.D. (DIFS)
Keywords

Web archiving, Mozilla Archive Format, Web scraping, Web indexing, Lemmiwinks, MultiFunctional Index Scraping Tool

Description

Many legal institutions require the burden of proof on web content. These tools deal with the problem of automating web refurbishment and web archiving. The main goal is to provide solutions with open source code that will satisfy legal institutions with their requirements. This work represents two main products. The first is the Lemmiwinks framework, which is the cornerstone for developing applications for website extraction and archiving. The second product is MozArch, a prototype showing the use of the framework. The MozArchi output is a MAFF file that includes a refurbished web page, website screenshot, and a meta-information table such as IP addresses, ports, and a time stamp.

Location

Zdrojové kódy k Lemmiwinks frameworku: https://github.com/nesfit/Lemmiwinks Zdrojové kódy k MozArchive aplikaci: https://github.com/nesfit/mozarch

License Conditions

Both softwares are offered under MIT license

Projects
Integrated platform for analysis of digital data from security incidents, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20172020062, 2017-2020, running
Research groups
Departments
Back to top