README.md 2.53 KB
Newer Older
José Hugo Elsas's avatar
José Hugo Elsas committed
1
2
# Active Learning Loading Case Selection

José Hugo Elsas's avatar
José Hugo Elsas committed
3
4
5
6
7
Accessory repository for the paper "Active Learning Loading Case Selection" including notebooks and data necessary to reproduce all results of the paper. 

# Included Files : 

## Notebooks 
8
  - Spreadsheet aggregation.ipynb
9
10
11

     Spreadsheet aggregation perform the Extraction, Transform, Load (ETL) part of the work. It converts the data from the different files into an integrated feature dataframe encoding current and wave data into a format more ameanable to Gaussian Process Regression.

12
  - Single-Target random and active learning for loading case selection.ipynb
13
14
15

     As an example
  
16
  - Multi-Target random and active
José Hugo Elsas's avatar
José Hugo Elsas committed
17
18

## Data Files 
19

20
21
22
23
24
   There are several files involved in this project, but can be separated in the categories: case definition, results data and intermediary files. 

   Case definition files constitute the data necessary to characterize a loading case, and therefore run the simulation. Each loading case requires definition of a current and a wave. 

   Results data corresponds to the information produced by applying the machine learning inference process over the target dataset, and iterating this process either through random sampling or through active learning sampling. 
25

José Hugo Elsas's avatar
José Hugo Elsas committed
26
   ### Case definition
27
28
29
30
31

   The cases.csv defines the pair (current,wave) to which correspond each loading case. currents.csv contains the parameters that characterize each current, which is the 2D velocity vector for each water depth, and waves.csv contains the parameters for the JONSWAP wave model for each wave. cardinal_directions.csv is an auxiliary file used to convert data from currents.csv file to more ameanable format. 

   config0.csv , config1.csv ... config5.csv are the files containing the results of the simulations, i.e. Axial tension and DNVUF201 CLC, for 6 different riser configurations. The results are for each of the loading cases listed in the cases.csv file and are, ultimately, the target for the machine learning model. 

32
33
34
35
36
37
   - cases.csv
   - cardinal_directions.csv
   - currents.csv
   - waves.csv
   - config0..5

38
   ### Results data
39
40
41
42
43
44
   The resulting data of the analysis is stored in different folders for convenience. 

   - data/ : processed deviation measures for ML predictions
   - plots/ : plotted graphs analyzing data/ files
   - results/ : case-by-case data of ML prediction

45
46
   ### Intermediary files

47
48
49
50
   Intermediary files are produced by Spreadsheet aggregation notebook, which format the data in features ameanable to machine learning processing. 

   - cases_full.csv
   - cases_full.xlsx