Overview

Dataset statistics

Number of variables8
Number of observations201
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.3 KiB
Average record size in memory67.7 B

Variable types

Categorical5
Text2
Numeric1

Dataset

Description현재 건설중인 원자력 발전소의 투명한 정보공개로 원전 안전에 대한 국민신뢰 확보하기 위해 진행중인 시민참관단의 선발 현황 및 주요활동 내역 등
URLhttps://www.data.go.kr/data/15101053/fileData.do

Alerts

주요활동내역 is highly overall correlated with 시행년도 and 1 other fieldsHigh correlation
시행년도 is highly overall correlated with 기수 and 1 other fieldsHigh correlation
기수 is highly overall correlated with 시행년도 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 04:00:46.517138
Analysis finished2023-12-12 04:00:47.330386
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시행년도
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2019
51 
2022
50 
2018
40 
2020
30 
2021
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2019 51
25.4%
2022 50
24.9%
2018 40
19.9%
2020 30
14.9%
2021 30
14.9%

Length

2023-12-12T13:00:47.418288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:00:47.539230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 51
25.4%
2022 50
24.9%
2018 40
19.9%
2020 30
14.9%
2021 30
14.9%

기수
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2
51 
5
50 
1
40 
3
30 
4
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
2 51
25.4%
5 50
24.9%
1 40
19.9%
3 30
14.9%
4 30
14.9%

Length

2023-12-12T13:00:47.682809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:00:47.827178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 51
25.4%
5 50
24.9%
1 40
19.9%
3 30
14.9%
4 30
14.9%

성명
Text

Distinct180
Distinct (%)89.6%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T13:00:48.237955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9701493
Min length2

Characters and Unicode

Total characters597
Distinct characters108
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique163 ?
Unique (%)81.1%

Sample

1st row김*태
2nd row조*현
3rd row이*언
4th row장*원
5th row박*호
ValueCountFrequency (%)
이*우 4
 
2.0%
김*원 3
 
1.5%
김*래 3
 
1.5%
김*섭 2
 
1.0%
이*호 2
 
1.0%
정*현 2
 
1.0%
이*영 2
 
1.0%
김*자 2
 
1.0%
윤*자 2
 
1.0%
박*규 2
 
1.0%
Other values (170) 177
88.1%
2023-12-12T13:00:48.801072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 201
33.7%
37
 
6.2%
27
 
4.5%
13
 
2.2%
12
 
2.0%
12
 
2.0%
11
 
1.8%
9
 
1.5%
8
 
1.3%
8
 
1.3%
Other values (98) 259
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 396
66.3%
Other Punctuation 201
33.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
9.3%
27
 
6.8%
13
 
3.3%
12
 
3.0%
12
 
3.0%
11
 
2.8%
9
 
2.3%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (97) 251
63.4%
Other Punctuation
ValueCountFrequency (%)
* 201
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 396
66.3%
Common 201
33.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
9.3%
27
 
6.8%
13
 
3.3%
12
 
3.0%
12
 
3.0%
11
 
2.8%
9
 
2.3%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (97) 251
63.4%
Common
ValueCountFrequency (%)
* 201
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 396
66.3%
ASCII 201
33.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 201
100.0%
Hangul
ValueCountFrequency (%)
37
 
9.3%
27
 
6.8%
13
 
3.3%
12
 
3.0%
12
 
3.0%
11
 
2.8%
9
 
2.3%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (97) 251
63.4%

성별
Categorical

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
남성
143 
여성
58 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남성
2nd row남성
3rd row남성
4th row남성
5th row남성

Common Values

ValueCountFrequency (%)
남성 143
71.1%
여성 58
28.9%

Length

2023-12-12T13:00:49.004581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:00:49.139566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남성 143
71.1%
여성 58
28.9%

출생년도
Real number (ℝ)

Distinct58
Distinct (%)28.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1974.6318
Minimum1935
Maximum2003
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-12T13:00:49.279965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1935
5-th percentile1949
Q11958
median1971
Q31995
95-th percentile1999
Maximum2003
Range68
Interquartile range (IQR)37

Descriptive statistics

Standard deviation18.455454
Coefficient of variation (CV)0.0093462759
Kurtosis-1.5007783
Mean1974.6318
Median Absolute Deviation (MAD)17
Skewness0.082410783
Sum396901
Variance340.60378
MonotonicityNot monotonic
2023-12-12T13:00:49.450872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1998 12
 
6.0%
1996 9
 
4.5%
1961 9
 
4.5%
1958 9
 
4.5%
1999 9
 
4.5%
1997 8
 
4.0%
1956 7
 
3.5%
1960 7
 
3.5%
1995 6
 
3.0%
1954 6
 
3.0%
Other values (48) 119
59.2%
ValueCountFrequency (%)
1935 1
 
0.5%
1944 2
 
1.0%
1946 1
 
0.5%
1947 1
 
0.5%
1948 4
2.0%
1949 2
 
1.0%
1950 2
 
1.0%
1951 2
 
1.0%
1952 6
3.0%
1953 4
2.0%
ValueCountFrequency (%)
2003 2
 
1.0%
2002 1
 
0.5%
2001 2
 
1.0%
2000 5
2.5%
1999 9
4.5%
1998 12
6.0%
1997 8
4.0%
1996 9
4.5%
1995 6
3.0%
1994 4
 
2.0%
Distinct195
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T13:00:49.725558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters2613
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique189 ?
Unique (%)94.0%

Sample

1st row010-****-0644
2nd row010-****-6901
3rd row010-****-4552
4th row010-****-1171
5th row010-****-2002
ValueCountFrequency (%)
010-****-8637 2
 
1.0%
010-****-9239 2
 
1.0%
010-****-6080 2
 
1.0%
010-****-9923 2
 
1.0%
010-****-7395 2
 
1.0%
010-****-7057 2
 
1.0%
010-****-4410 1
 
0.5%
010-****-3371 1
 
0.5%
010-****-9420 1
 
0.5%
010-****-4365 1
 
0.5%
Other values (185) 185
92.0%
2023-12-12T13:00:50.181385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 804
30.8%
0 483
18.5%
- 402
15.4%
1 278
 
10.6%
7 92
 
3.5%
8 85
 
3.3%
9 84
 
3.2%
3 83
 
3.2%
4 81
 
3.1%
2 77
 
2.9%
Other values (2) 144
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1407
53.8%
Other Punctuation 804
30.8%
Dash Punctuation 402
 
15.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 483
34.3%
1 278
19.8%
7 92
 
6.5%
8 85
 
6.0%
9 84
 
6.0%
3 83
 
5.9%
4 81
 
5.8%
2 77
 
5.5%
6 76
 
5.4%
5 68
 
4.8%
Other Punctuation
ValueCountFrequency (%)
* 804
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 402
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2613
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
* 804
30.8%
0 483
18.5%
- 402
15.4%
1 278
 
10.6%
7 92
 
3.5%
8 85
 
3.3%
9 84
 
3.2%
3 83
 
3.2%
4 81
 
3.1%
2 77
 
2.9%
Other values (2) 144
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2613
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 804
30.8%
0 483
18.5%
- 402
15.4%
1 278
 
10.6%
7 92
 
3.5%
8 85
 
3.3%
9 84
 
3.2%
3 83
 
3.2%
4 81
 
3.1%
2 77
 
2.9%
Other values (2) 144
 
5.5%

지역
Categorical

Distinct14
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
부산
75 
울산
62 
서울
19 
경남
17 
경기
13 
Other values (9)
15 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique4 ?
Unique (%)2.0%

Sample

1st row서울
2nd row경남
3rd row울산
4th row부산
5th row울산

Common Values

ValueCountFrequency (%)
부산 75
37.3%
울산 62
30.8%
서울 19
 
9.5%
경남 17
 
8.5%
경기 13
 
6.5%
경북 3
 
1.5%
전북 2
 
1.0%
충남 2
 
1.0%
세종 2
 
1.0%
대구 2
 
1.0%
Other values (4) 4
 
2.0%

Length

2023-12-12T13:00:50.370497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산 75
37.3%
울산 62
30.8%
서울 19
 
9.5%
경남 17
 
8.5%
경기 13
 
6.5%
경북 3
 
1.5%
전북 2
 
1.0%
충남 2
 
1.0%
세종 2
 
1.0%
대구 2
 
1.0%
Other values (4) 4
 
2.0%

주요활동내역
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
151 
안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
50 

Length

Max length46
Median length46
Mean length45.502488
Min length44

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
2nd row안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
3rd row안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
4th row안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
5th row안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등

Common Values

ValueCountFrequency (%)
안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등 151
75.1%
안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등 50
 
24.9%

Length

2023-12-12T13:00:50.561912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:00:50.694486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안전체험장 201
14.3%
교육/건설현장 201
14.3%
주제어실 201
14.3%
201
14.3%
안전점검/주기기 151
10.7%
제작공정참관/원자력발전소 151
10.7%
견학 151
10.7%
안전점검/원자력발전소 50
 
3.6%
견학/시운전시험 50
 
3.6%
참관 50
 
3.6%

Interactions

2023-12-12T13:00:46.964114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:00:50.786197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시행년도기수성별출생년도지역주요활동내역
시행년도1.0001.0000.1190.4180.5381.000
기수1.0001.0000.1190.4180.5381.000
성별0.1190.1191.0000.2240.0000.119
출생년도0.4180.4180.2241.0000.3970.314
지역0.5380.5380.0000.3971.0000.266
주요활동내역1.0001.0000.1190.3140.2661.000
2023-12-12T13:00:50.917713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주요활동내역성별시행년도지역기수
주요활동내역1.0000.0760.9920.2010.992
성별0.0761.0000.1450.0000.145
시행년도0.9920.1451.0000.3091.000
지역0.2010.0000.3091.0000.309
기수0.9920.1451.0000.3091.000
2023-12-12T13:00:51.060426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출생년도시행년도기수성별지역주요활동내역
출생년도1.0000.2510.2510.2130.1600.302
시행년도0.2511.0001.0000.1450.3090.992
기수0.2511.0001.0000.1450.3090.992
성별0.2130.1450.1451.0000.0000.076
지역0.1600.3090.3090.0001.0000.201
주요활동내역0.3020.9920.9920.0760.2011.000

Missing values

2023-12-12T13:00:47.124773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:00:47.266698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시행년도기수성명성별출생년도연락처지역주요활동내역
020181김*태남성1954010-****-0644서울안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
120181조*현남성1991010-****-6901경남안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
220181이*언남성1974010-****-4552울산안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
320181장*원남성1989010-****-1171부산안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
420181박*호남성1958010-****-2002울산안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
520181양*우남성1952010-****-7133부산안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
620181조*준남성1958010-****-4029경기안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
720181김*지여성1998010-****-3949경기안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
820181김*호남성1954010-****-0891경기안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
920181이*남성1993010-****-2743경남안전체험장 교육/건설현장 안전점검/주기기 제작공정참관/원자력발전소 주제어실 견학 등
시행년도기수성명성별출생년도연락처지역주요활동내역
19120225정*현남성1996010-****-4718울산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19220225정*남성1982010-****-5728부산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19320225정*재남성1993010-****-4807울산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19420225정*나여성1998010-****-2819울산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19520225조*철남성1959010-****-9009울산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19620225최*훈남성1996010-****-6115경남안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19720225최*종남성1944010-****-9395부산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19820225하*주여성1961010-****-8182울산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
19920225홍*원남성1999010-****-4728부산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등
20020225홍*현남성1998010-****-2035부산안전체험장 교육/건설현장 안전점검/원자력발전소 주제어실 견학/시운전시험 참관 등