Overview

Dataset statistics

Number of variables5
Number of observations57
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory43.3 B

Variable types

Text2
Categorical2
Numeric1

Dataset

Description울산광역시 남구 이재민수용시설 현황에 대한 데이터로 시설명, 시설유형구분, 소재지도로명주소, 수용가능인원(명), 내진설계 항목을 제공합니다.
Author울산광역시 남구
URLhttps://www.data.go.kr/data/3076222/fileData.do

Alerts

내진설계 is highly imbalanced (63.3%)Imbalance
시설명 has unique valuesUnique
소재지도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-11 22:53:43.672628
Analysis finished2023-12-11 22:53:44.144815
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설명
Text

UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T07:53:44.345441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length6.245614
Min length4

Characters and Unicode

Total characters356
Distinct characters79
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)100.0%

Sample

1st row신정1동주민센터
2nd row신정초등학교
3rd row대흥교회
4th row울산농업인회관
5th row울산서여자중학교
ValueCountFrequency (%)
신정1동주민센터 1
 
1.8%
신복복지관 1
 
1.8%
문수실버복지관 1
 
1.8%
신복초등학교 1
 
1.8%
옥산초등학교 1
 
1.8%
옥현중학교 1
 
1.8%
월계초등학교 1
 
1.8%
테크노두왕경로당 1
 
1.8%
격동초등학교 1
 
1.8%
옥동중학교 1
 
1.8%
Other values (47) 47
82.5%
2023-12-12T07:53:44.722813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
12.4%
43
 
12.1%
31
 
8.7%
27
 
7.6%
13
 
3.7%
10
 
2.8%
9
 
2.5%
9
 
2.5%
7
 
2.0%
7
 
2.0%
Other values (69) 156
43.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 354
99.4%
Decimal Number 2
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
12.4%
43
 
12.1%
31
 
8.8%
27
 
7.6%
13
 
3.7%
10
 
2.8%
9
 
2.5%
9
 
2.5%
7
 
2.0%
7
 
2.0%
Other values (67) 154
43.5%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
4 1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 354
99.4%
Common 2
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
12.4%
43
 
12.1%
31
 
8.8%
27
 
7.6%
13
 
3.7%
10
 
2.8%
9
 
2.5%
9
 
2.5%
7
 
2.0%
7
 
2.0%
Other values (67) 154
43.5%
Common
ValueCountFrequency (%)
1 1
50.0%
4 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 354
99.4%
ASCII 2
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
12.4%
43
 
12.1%
31
 
8.8%
27
 
7.6%
13
 
3.7%
10
 
2.8%
9
 
2.5%
9
 
2.5%
7
 
2.0%
7
 
2.0%
Other values (67) 154
43.5%
ASCII
ValueCountFrequency (%)
1 1
50.0%
4 1
50.0%
Distinct10
Distinct (%)17.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
체육관
26 
다목적강당
17 
본관
강당
대회의실
 
1
Other values (5)

Length

Max length7
Median length5
Mean length3.5789474
Min length2

Unique

Unique6 ?
Unique (%)10.5%

Sample

1st row강당
2nd row체육관
3rd row본관
4th row본관
5th row다목적강당

Common Values

ValueCountFrequency (%)
체육관 26
45.6%
다목적강당 17
29.8%
본관 5
 
8.8%
강당 3
 
5.3%
대회의실 1
 
1.8%
대강당/체육관 1
 
1.8%
경로당 1
 
1.8%
다목적실 1
 
1.8%
대강당 1
 
1.8%
다목적홀 1
 
1.8%

Length

2023-12-12T07:53:44.856102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:53:44.976861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체육관 26
45.6%
다목적강당 17
29.8%
본관 5
 
8.8%
강당 3
 
5.3%
대회의실 1
 
1.8%
대강당/체육관 1
 
1.8%
경로당 1
 
1.8%
다목적실 1
 
1.8%
대강당 1
 
1.8%
다목적홀 1
 
1.8%
Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T07:53:45.226052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length18.105263
Min length14

Characters and Unicode

Total characters1032
Distinct characters69
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)100.0%

Sample

1st row울산광역시 남구 월평로9번길 7
2nd row울산광역시 남구 봉월로67번길 16
3rd row울산광역시 남구 문수로 461
4th row울산광역시 남구 돋질로 16
5th row울산광역시 남구 문수로 432
ValueCountFrequency (%)
울산광역시 57
25.0%
남구 57
25.0%
16 4
 
1.8%
문수로 4
 
1.8%
4 3
 
1.3%
중앙로 2
 
0.9%
18 2
 
0.9%
11 2
 
0.9%
25 2
 
0.9%
19 2
 
0.9%
Other values (86) 93
40.8%
2023-12-12T07:53:45.602095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
171
16.6%
63
 
6.1%
59
 
5.7%
58
 
5.6%
57
 
5.5%
57
 
5.5%
57
 
5.5%
57
 
5.5%
56
 
5.4%
1 51
 
4.9%
Other values (59) 346
33.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 650
63.0%
Decimal Number 205
 
19.9%
Space Separator 171
 
16.6%
Dash Punctuation 6
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
9.7%
59
9.1%
58
8.9%
57
8.8%
57
8.8%
57
8.8%
57
8.8%
56
8.6%
35
 
5.4%
35
 
5.4%
Other values (47) 116
17.8%
Decimal Number
ValueCountFrequency (%)
1 51
24.9%
4 23
11.2%
5 23
11.2%
2 23
11.2%
6 20
 
9.8%
3 18
 
8.8%
7 15
 
7.3%
9 13
 
6.3%
0 10
 
4.9%
8 9
 
4.4%
Space Separator
ValueCountFrequency (%)
171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 650
63.0%
Common 382
37.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
9.7%
59
9.1%
58
8.9%
57
8.8%
57
8.8%
57
8.8%
57
8.8%
56
8.6%
35
 
5.4%
35
 
5.4%
Other values (47) 116
17.8%
Common
ValueCountFrequency (%)
171
44.8%
1 51
 
13.4%
4 23
 
6.0%
5 23
 
6.0%
2 23
 
6.0%
6 20
 
5.2%
3 18
 
4.7%
7 15
 
3.9%
9 13
 
3.4%
0 10
 
2.6%
Other values (2) 15
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 650
63.0%
ASCII 382
37.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
171
44.8%
1 51
 
13.4%
4 23
 
6.0%
5 23
 
6.0%
2 23
 
6.0%
6 20
 
5.2%
3 18
 
4.7%
7 15
 
3.9%
9 13
 
3.4%
0 10
 
2.6%
Other values (2) 15
 
3.9%
Hangul
ValueCountFrequency (%)
63
9.7%
59
9.1%
58
8.9%
57
8.8%
57
8.8%
57
8.8%
57
8.8%
56
8.6%
35
 
5.4%
35
 
5.4%
Other values (47) 116
17.8%

수용가능인원(명)
Real number (ℝ)

Distinct55
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean319.57895
Minimum41
Maximum1399
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size645.0 B
2023-12-12T07:53:45.732161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum41
5-th percentile89.6
Q1205
median272
Q3361
95-th percentile614.6
Maximum1399
Range1358
Interquartile range (IQR)156

Descriptive statistics

Standard deviation237.84171
Coefficient of variation (CV)0.74423458
Kurtosis11.061673
Mean319.57895
Median Absolute Deviation (MAD)75
Skewness2.9410641
Sum18216
Variance56568.677
MonotonicityNot monotonic
2023-12-12T07:53:45.899957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
316 2
 
3.5%
256 2
 
3.5%
88 1
 
1.8%
46 1
 
1.8%
197 1
 
1.8%
536 1
 
1.8%
473 1
 
1.8%
129 1
 
1.8%
41 1
 
1.8%
633 1
 
1.8%
Other values (45) 45
78.9%
ValueCountFrequency (%)
41 1
1.8%
46 1
1.8%
88 1
1.8%
90 1
1.8%
91 1
1.8%
99 1
1.8%
129 1
1.8%
154 1
1.8%
158 1
1.8%
163 1
1.8%
ValueCountFrequency (%)
1399 1
1.8%
1288 1
1.8%
633 1
1.8%
610 1
1.8%
576 1
1.8%
536 1
1.8%
506 1
1.8%
473 1
1.8%
469 1
1.8%
429 1
1.8%

내진설계
Categorical

IMBALANCE 

Distinct2
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
적용
53 
미적용
 
4

Length

Max length3
Median length2
Mean length2.0701754
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적용
2nd row적용
3rd row적용
4th row적용
5th row적용

Common Values

ValueCountFrequency (%)
적용 53
93.0%
미적용 4
 
7.0%

Length

2023-12-12T07:53:46.024617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:53:46.136234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
적용 53
93.0%
미적용 4
 
7.0%

Interactions

2023-12-12T07:53:43.902618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:53:46.211666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명시설유형구분소재지도로명주소수용가능인원(명)내진설계
시설명1.0001.0001.0001.0001.000
시설유형구분1.0001.0001.0000.0000.000
소재지도로명주소1.0001.0001.0001.0001.000
수용가능인원(명)1.0000.0001.0001.0000.269
내진설계1.0000.0001.0000.2691.000
2023-12-12T07:53:46.301158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설유형구분내진설계
시설유형구분1.0000.000
내진설계0.0001.000
2023-12-12T07:53:46.675448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수용가능인원(명)시설유형구분내진설계
수용가능인원(명)1.0000.0000.182
시설유형구분0.0001.0000.000
내진설계0.1820.0001.000

Missing values

2023-12-12T07:53:44.011846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:53:44.106374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명시설유형구분소재지도로명주소수용가능인원(명)내진설계
0신정1동주민센터강당울산광역시 남구 월평로9번길 788적용
1신정초등학교체육관울산광역시 남구 봉월로67번길 16429적용
2대흥교회본관울산광역시 남구 문수로 461308적용
3울산농업인회관본관울산광역시 남구 돋질로 16205적용
4울산서여자중학교다목적강당울산광역시 남구 문수로 432233적용
5학성중학교체육관울산광역시 남구 문수로 426385적용
6울산중앙초등학교체육관울산광역시 남구 돋질로91번길 34288적용
7월평중학교체육관울산광역시 남구 중앙로204번길 31407적용
8신정4동주민센터대회의실울산광역시 남구 수암로64번길 16174적용
9수암초등학교체육관울산광역시 남구 중앙로 47312적용
시설명시설유형구분소재지도로명주소수용가능인원(명)내진설계
47야음초등학교체육관울산광역시 남구 산업로355번길 111229적용
48수암동주민센터다목적실울산광역시 남구 중앙로 32158적용
49울산남부초등학교체육관울산광역시 남구 야음로 11610미적용
50선암호수노인복지관대강당울산광역시 남구 선암호수길 15090적용
51개운초등학교체육관울산광역시 남구 두왕로106번길 28-6264적용
52선암초등학교체육관울산광역시 남구 산업로325번길 4240적용
53장생포복지문화센터다목적강당울산광역시 남구 장생포고래로 215154적용
54남구국민체육센터다목적홀울산광역시 남구 여천로 38242적용
55장생포초등학교체육관울산광역시 남구 장생포고래로125번길 5277적용
56태화중학교체육관울산광역시 남구 화합로71번길 24196적용