Skip to content

Commit a02441b

Browse files
authored
Merge pull request #5 from Lab42-Team/dev
Dev
2 parents 4266126 + 17008cf commit a02441b

File tree

306 files changed

+60620
-778
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

306 files changed

+60620
-778
lines changed

LICENSE

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
MIT License
2+
3+
Copyright (c) 2025 Lab42-Team
4+
5+
Permission is hereby granted, free of charge, to any person obtaining a copy
6+
of this software and associated documentation files (the "Software"), to deal
7+
in the Software without restriction, including without limitation the rights
8+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9+
copies of the Software, and to permit persons to whom the Software is
10+
furnished to do so, subject to the following conditions:
11+
12+
The above copyright notice and this permission notice shall be included in all
13+
copies or substantial portions of the Software.
14+
15+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21+
SOFTWARE.

README.md

Lines changed: 65 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -1,32 +1,80 @@
1-
# [ Home ](https://github.com/STI-Team/ontogen) / ontogen
2-
OnotoGen: Software system for analyzing and transforming spreadsheets into ontologies and related data sets
3-
This document describes the command-line interface for the <code>OntoGen(ont)</code> system.
1+
# OntoGen
42

5-
## Installation Instructions
3+
A command-line software called **OntoGen** for analysis and transformation of source spreadsheet data (CSV) to ontology (OWL/XML).
4+
5+
## Version
6+
7+
1.1
8+
9+
## Preliminaries
10+
11+
A source (input) spreadsheet represents a set of same type entities in a relational form (a subset of the Cartesian product of *K*-data domains), where:
12+
1. *Attribute (a column name)* is a name of a data domain in a relationship schema;
13+
2. *Metadata (a schema)* is an ordered set of *K*-attributes of a relational table;
14+
3. *Tuple (a record)* is an ordered set of *K*-atomic values (one for each attribute of a relation);
15+
4. *Data (a recordset)* is a set of tuples of a relational table.
16+
17+
A spreadsheet of same type entities (*a canonicalized form*) is a relational table in the third normal form (3NF), which contains an ordered set of *N*-rows and *M*-columns.
18+
19+
A table represents a set of entities of the same type, where:
20+
1. *Categorical column or Named entities column (NE-column)* contains names (text mentions) of some named entities;
21+
2. *Literal column (L-column)* contains literal values (e.g. dates, numbers);
22+
3. *Subject (thematic) column (S-column)* is a *NE*-column represented as a potential primary key and defines a subject of a source table;
23+
4. *Another (non-subject) columns* represent entity properties including their relationships with other entities.
24+
25+
**Assumption 1.** *The first row of a source spreadsheet is a header containing attribute (column) names.*
26+
27+
**Assumption 2.** *All values of column cells in a source spreadsheet have same entity types and data types.*
28+
29+
**Assumption 3.** *Source spreadsheets should be presented in the CSV format.*
30+
31+
**OntoGen** supports the process of ontology engineering based on spreadsheet data transformation.
32+
33+
**Assumption 4.** *A target ontology is presented in the [OWL2 DL](https://www.w3.org/TR/owl2-overview/) format.*
34+
35+
## Installation
36+
37+
First, you need to clone the project into your directory:
638

7-
Run the following commands in order in a cmd.exe
839
```
9-
git clone https://github.com/STI-Team/ontogen
10-
cd ontogen
40+
git clone https://github.com/Lab42-Team/ontogen.git
41+
```
42+
43+
Next, you need to install all requirements for this project:
1144

12-
python3 -m venv ont_env
13-
ont_env\Scripts\activate.bat
14-
python.exe -m pip install --upgrade pip
45+
```
1546
pip install -r requirements.txt
1647
```
17-
If python3 is not installed, find out what version of python 3 is installed and use that instead
18-
### Usage: python main.py [OPTIONS]
48+
49+
*We recommend you to use Python 3.0 or more.*
50+
51+
## Directory Structure
52+
53+
* `datasets` contains datasets of source spreadsheets in the CSV format:
54+
* `tough-tables` contains [Tough Tables (2T)](https://zenodo.org/record/4246370#.Yf5AO-pBw2w) dataset, where noise spreadsheets are excluded;
55+
* `wiki-uku-49` contains spreadsheets describing the main concepts and relationships in the field of education, in particular, universities in the United Kingdom (see [wiki-UKU-49: United Kingdom Universities from Wikipedia](https://data.mendeley.com/datasets/33v9tk6jjb/1));
56+
* `isi-167e` contains spreadsheets describing the main concepts and relationships in the field of Industrial Safety Inspection (see [ISI-167E: Entity spreadsheet tables](https://data.mendeley.com/datasets/3gjy46mx88/1)).
57+
* `examples` contains spreadsheet examples for testing.
58+
* `ontogen` contains software modules (py-scripts), including `main.py`.
59+
* `results` contains processing results (target ontologies).
60+
61+
## Usage
62+
63+
#### Usage: python main.py [OPTIONS]
1964
**Options:**
2065
- `--name=c:\userpath` -- Create ontologies
21-
### A Simple Example
66+
#### A simple example
2267
```
23-
python main.py --name=f:\test
24-
68+
python main.py --name=C:/test
2569
```
2670
or
2771

2872
```
2973
python main.py
30-
Your path name: f:\test
31-
74+
Your path to source spreadsheets: C:/test
3275
```
76+
77+
## Authors
78+
79+
* [Daria A. Denisova](mailto:daryalich@mail.ru)
80+
* [Nikita O. Dorodnykh](mailto:tualatin32@mail.ru)
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
Карта исходных данных,Программа работ по продлению срока безопасной эксплуатации (технического диагностирования),Акт готовности аппаратов к проведению технического диагностирования,Акт оперативной (функциональной) диагностики,Протокол проведения визуального и измерительного контроля 10-ВИК,Протокол проведения толщинометрии 10-УЗТ,Протокол и заключение акустико-эмиссионного контроля,Протокол проведения дефектоскопии 10-ПВК,Протокол проведения дефектоскопии 10-УЗК,Протокол измерения твердости 10-ТВ,Протоколы химического анализа,Расчет на прочность,Расчет остаточного ресурса,Перечень нормативной технической и методической документации использованной при экспертизе,Копия лицензии,Копия приказа,Копии удостоверений экспертов и специалистов
2+
на 6 листах,на 4 листах,на 1 листе,на 1 листе,на 4 листах,на 1 листе,на 1 листе,на 1 листе,на 1 листе,на 1 листе,на 2 листах,на 3 листах,на 2 листах,на 1 листе,на 1 листе,на 2 листах,на 1 листе
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
Единца измерения,Наименование показателя,Результата испытаний,Регламентированное значение показателя для стали 40 по ГОСТ 1050-88
2+
%,С,0.41,0.37-0.45
3+
%,Si,0.25,0.17-0.37
4+
%,Mn,0.53,0.5-0.8
5+
%,Cr,0.05,не более 0.25
6+
%,Ni,0.06,не более 0.3
7+
%,V,-,-
8+
%,Mo,-,-
9+
%,Cu,0.04,не более 0.3
10+
%,P,0.023,не более 0.035
11+
%,S,0.036,не более 0.035
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
Единца измерения, Наименование показателя,Результата испытаний,Регламентированное значение показателя для стали 40ХФА по ГОСТ 4543-71
2+
%,C,0.39,0.37-0.44
3+
%,Si,0.3,0.17-0.37
4+
%,Mn,0.8,0.5-0.8
5+
%,Cr,1.12,0.8-1.1
6+
%,Ni,0.11,не более 0.3
7+
%,V,0.18,0.1-0.18
8+
%,Mo,-,-
9+
%,Cu,0.12,не более 0.3
10+
%,P,0.022,не более 0.025
11+
%,S,0.025,не более 0.025
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
Наименование элемента, Кол-во шт.,Размеры Диаметр внутренний,Размеры Толщина стенки,Размеры Длина (высота),Материал Марка,Материал Гост или ТУ
2+
корпус,1,1000,130,6000,немецкая сталь - аналог сталь 25,-
3+
крышка 2 Дн = 920 280 -
4+
шпилька 16 W94x1/4'' - 640 немецкая таль CrNiMo -
5+
гайка 16 W94x1/4'' - 100 немецкая сталь S3 - аналог сталь 40
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
Обозначение,Назначение,Кол. шт.,Условный переход мм,Условное давление МПа,Место установки,Материал Марка,Материал ГОСТ или ТУ
2+
А,вход газа 1,90,32,крышка нижняя,-,-
3+
Б,выход газа,1,90,32,крышка верхняя,-,-
4+
В,под уровнемер,2,15,32,корпус,-,-
5+
Г,выход конденсата,1,40,32,корпус,-,-
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
Давление МПа,проектные,фактические
2+
рабочее,32.5,32
3+
расчетно,32.5,32
4+
разрешенное,32.5,32
5+
пробное гидроиспытания,42.5,40
6+
настройка предохранительного клапана,-,-
Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
Название элемента,Химический состав % С,Химический состав % Si,Химический состав % Mn,Химический состав % P,Химический состав % S,Химический состав % Cr,Химический состав % Ni,Химический состав % Mo,Химический состав % V,Химический состав % W,Механические свойства Gt_20 МПа,Механические свойства Gt_20 МПа,Механические свойства аН,Механические свойства b %,Примечание
2+
корпус,0.25,0.31,0.6,0.036,0.031,-,-,-,-,-,360,540,-,-,137-154
3+
крышка верхняя,0.26,0.2,0.51,0.033,0.019,-,-,-,-,-,230,420,-,-,151
4+
крышка нижняя,0.26,0.18,0.51,0.022,0.024,-,-,-,-,-,230,420,-,-,151
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
наименование элемента сосуда (аппарата),проектная толщина мм,фактическая минимальная толщина мм,организация проводившая контроль; номер и дата протокола
2+
корпус,130,126.9,ОАО Рога и Копыта Прот. 10-УЗТ от 12.12.12 заключение 10-1 от 12.12.12

0 commit comments

Comments
 (0)