Overview

Dataset statistics

Number of variables6
Number of observations378
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.6 KiB
Average record size in memory50.3 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description울산광역시 구군별(남구, 울주군) 제설함 위치 정보(제설함 설치구역, 위치, 제설함 수 등)를 제공하고 있습니다.
Author울산광역시
URLhttps://www.data.go.kr/data/15091260/fileData.do

Alerts

시도 has constant value ""Constant
제설함설치구역 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
시군구 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 시군구 and 1 other fieldsHigh correlation
제설함 수 is highly overall correlated with 시군구 and 1 other fieldsHigh correlation
시군구 is highly imbalanced (71.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:25:18.087686
Analysis finished2023-12-12 06:25:18.999665
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct378
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean189.5
Minimum1
Maximum378
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-12T15:25:19.081732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19.85
Q195.25
median189.5
Q3283.75
95-th percentile359.15
Maximum378
Range377
Interquartile range (IQR)188.5

Descriptive statistics

Standard deviation109.26344
Coefficient of variation (CV)0.57658809
Kurtosis-1.2
Mean189.5
Median Absolute Deviation (MAD)94.5
Skewness0
Sum71631
Variance11938.5
MonotonicityStrictly increasing
2023-12-12T15:25:19.275965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
250 1
 
0.3%
259 1
 
0.3%
258 1
 
0.3%
257 1
 
0.3%
256 1
 
0.3%
255 1
 
0.3%
254 1
 
0.3%
253 1
 
0.3%
252 1
 
0.3%
Other values (368) 368
97.4%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
378 1
0.3%
377 1
0.3%
376 1
0.3%
375 1
0.3%
374 1
0.3%
373 1
0.3%
372 1
0.3%
371 1
0.3%
370 1
0.3%
369 1
0.3%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
울산광역시
378 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산광역시
2nd row울산광역시
3rd row울산광역시
4th row울산광역시
5th row울산광역시

Common Values

ValueCountFrequency (%)
울산광역시 378
100.0%

Length

2023-12-12T15:25:19.456891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:25:19.592632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산광역시 378
100.0%

시군구
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
남구
359 
울주군
 
19

Length

Max length3
Median length2
Mean length2.0502646
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남구
2nd row남구
3rd row남구
4th row남구
5th row남구

Common Values

ValueCountFrequency (%)
남구 359
95.0%
울주군 19
 
5.0%

Length

2023-12-12T15:25:19.713926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:25:19.828622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남구 359
95.0%
울주군 19
 
5.0%

제설함설치구역
Categorical

HIGH CORRELATION 

Distinct41
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
야음장생포동
62 
옥동
44 
무거동
38 
수암동
35 
선암동
33 
Other values (36)
166 

Length

Max length21
Median length15
Mean length4.0132275
Min length2

Unique

Unique23 ?
Unique (%)6.1%

Sample

1st row남화동
2nd row남화동
3rd row남화동
4th row달동
5th row달동

Common Values

ValueCountFrequency (%)
야음장생포동 62
16.4%
옥동 44
11.6%
무거동 38
10.1%
수암동 35
9.3%
선암동 33
8.7%
신정2동 26
 
6.9%
신정4동 20
 
5.3%
여천동 19
 
5.0%
삼호동 17
 
4.5%
신정1동 16
 
4.2%
Other values (31) 68
18.0%

Length

2023-12-12T15:25:19.968128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
야음장생포동 62
14.2%
옥동 44
 
10.1%
무거동 38
 
8.7%
수암동 35
 
8.0%
선암동 33
 
7.6%
신정2동 26
 
5.9%
신정4동 20
 
4.6%
여천동 19
 
4.3%
삼호동 17
 
3.9%
신정1동 16
 
3.7%
Other values (60) 127
29.1%

위치
Text

Distinct369
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-12T15:25:20.336188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length38
Mean length24.428571
Min length5

Characters and Unicode

Total characters9234
Distinct characters328
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique360 ?
Unique (%)95.2%

Sample

1st row울산광역시 남구 용잠로 566 가스산업공사 위
2nd row울산광역시 남구 용잠로 울산화력본부 입구
3rd row울산광역시 남구 삼산동 1656-2 산업로 경복궁 건너편
4th row울산광역시 남구 달동633-9(강남초등학교 정문 옆)
5th row울산광역시 남구 달동 629-5(달동 경로당 담벽)
ValueCountFrequency (%)
울산광역시 359
19.3%
남구 359
19.3%
56
 
3.0%
입구 34
 
1.8%
건너편 31
 
1.7%
문수로 30
 
1.6%
28
 
1.5%
산업로 28
 
1.5%
화합로 19
 
1.0%
두왕로 19
 
1.0%
Other values (573) 893
48.1%
2023-12-12T15:25:20.942963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1483
 
16.1%
449
 
4.9%
426
 
4.6%
405
 
4.4%
374
 
4.1%
371
 
4.0%
365
 
4.0%
362
 
3.9%
359
 
3.9%
1 204
 
2.2%
Other values (318) 4436
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6413
69.4%
Space Separator 1483
 
16.1%
Decimal Number 887
 
9.6%
Open Punctuation 149
 
1.6%
Close Punctuation 149
 
1.6%
Dash Punctuation 69
 
0.7%
Other Punctuation 31
 
0.3%
Uppercase Letter 29
 
0.3%
Math Symbol 22
 
0.2%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
449
 
7.0%
426
 
6.6%
405
 
6.3%
374
 
5.8%
371
 
5.8%
365
 
5.7%
362
 
5.6%
359
 
5.6%
114
 
1.8%
103
 
1.6%
Other values (289) 3085
48.1%
Uppercase Letter
ValueCountFrequency (%)
K 7
24.1%
S 5
17.2%
T 4
13.8%
C 3
10.3%
G 3
10.3%
L 2
 
6.9%
N 1
 
3.4%
E 1
 
3.4%
P 1
 
3.4%
R 1
 
3.4%
Decimal Number
ValueCountFrequency (%)
1 204
23.0%
2 128
14.4%
3 95
10.7%
4 79
 
8.9%
6 71
 
8.0%
8 67
 
7.6%
7 67
 
7.6%
9 61
 
6.9%
5 60
 
6.8%
0 55
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 20
64.5%
@ 11
35.5%
Space Separator
ValueCountFrequency (%)
1483
100.0%
Open Punctuation
ValueCountFrequency (%)
( 149
100.0%
Close Punctuation
ValueCountFrequency (%)
) 149
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%
Math Symbol
ValueCountFrequency (%)
~ 22
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6415
69.5%
Common 2790
30.2%
Latin 29
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
449
 
7.0%
426
 
6.6%
405
 
6.3%
374
 
5.8%
371
 
5.8%
365
 
5.7%
362
 
5.6%
359
 
5.6%
114
 
1.8%
103
 
1.6%
Other values (290) 3087
48.1%
Common
ValueCountFrequency (%)
1483
53.2%
1 204
 
7.3%
( 149
 
5.3%
) 149
 
5.3%
2 128
 
4.6%
3 95
 
3.4%
4 79
 
2.8%
6 71
 
2.5%
- 69
 
2.5%
8 67
 
2.4%
Other values (7) 296
 
10.6%
Latin
ValueCountFrequency (%)
K 7
24.1%
S 5
17.2%
T 4
13.8%
C 3
10.3%
G 3
10.3%
L 2
 
6.9%
N 1
 
3.4%
E 1
 
3.4%
P 1
 
3.4%
R 1
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6413
69.4%
ASCII 2819
30.5%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1483
52.6%
1 204
 
7.2%
( 149
 
5.3%
) 149
 
5.3%
2 128
 
4.5%
3 95
 
3.4%
4 79
 
2.8%
6 71
 
2.5%
- 69
 
2.4%
8 67
 
2.4%
Other values (18) 325
 
11.5%
Hangul
ValueCountFrequency (%)
449
 
7.0%
426
 
6.6%
405
 
6.3%
374
 
5.8%
371
 
5.8%
365
 
5.7%
362
 
5.6%
359
 
5.6%
114
 
1.8%
103
 
1.6%
Other values (289) 3085
48.1%
None
ValueCountFrequency (%)
2
100.0%

제설함 수
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.968254
Minimum1
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-12T15:25:21.409467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile1.6
Maximum66
Range65
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6.2425805
Coefficient of variation (CV)3.1716337
Kurtosis76.350511
Mean1.968254
Median Absolute Deviation (MAD)0
Skewness8.5048688
Sum744
Variance38.969812
MonotonicityNot monotonic
2023-12-12T15:25:21.530686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 359
95.0%
8 4
 
1.1%
12 3
 
0.8%
9 2
 
0.5%
6 2
 
0.5%
63 1
 
0.3%
7 1
 
0.3%
25 1
 
0.3%
51 1
 
0.3%
66 1
 
0.3%
Other values (3) 3
 
0.8%
ValueCountFrequency (%)
1 359
95.0%
5 1
 
0.3%
6 2
 
0.5%
7 1
 
0.3%
8 4
 
1.1%
9 2
 
0.5%
12 3
 
0.8%
15 1
 
0.3%
25 1
 
0.3%
51 1
 
0.3%
ValueCountFrequency (%)
66 1
 
0.3%
63 1
 
0.3%
55 1
 
0.3%
51 1
 
0.3%
25 1
 
0.3%
15 1
 
0.3%
12 3
0.8%
9 2
0.5%
8 4
1.1%
7 1
 
0.3%

Interactions

2023-12-12T15:25:18.610754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:18.409413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:18.710006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:18.512020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:25:21.629371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구제설함설치구역제설함 수
연번1.0000.8460.9840.369
시군구0.8461.0001.0000.813
제설함설치구역0.9841.0001.0001.000
제설함 수0.3690.8131.0001.000
2023-12-12T15:25:21.738396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제설함설치구역시군구
제설함설치구역1.0000.947
시군구0.9471.000
2023-12-12T15:25:21.833374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번제설함 수시군구제설함설치구역
연번1.0000.3780.6710.831
제설함 수0.3781.0000.8760.953
시군구0.6710.8761.0000.947
제설함설치구역0.8310.9530.9471.000

Missing values

2023-12-12T15:25:18.845963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:25:18.957951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도시군구제설함설치구역위치제설함 수
01울산광역시남구남화동울산광역시 남구 용잠로 566 가스산업공사 위1
12울산광역시남구남화동울산광역시 남구 용잠로 울산화력본부 입구1
23울산광역시남구남화동울산광역시 남구 삼산동 1656-2 산업로 경복궁 건너편1
34울산광역시남구달동울산광역시 남구 달동633-9(강남초등학교 정문 옆)1
45울산광역시남구달동울산광역시 남구 달동 629-5(달동 경로당 담벽)1
56울산광역시남구대현동울산광역시 남구 수암로169번길 28, 울산감리교회 인근1
67울산광역시남구대현동울산광역시 남구 수암로 170, 신선여고 인근1
78울산광역시남구대현동울산광역시 남구 야음로 4, 신선아파트 관리실 옆1
89울산광역시남구대현동울산광역시 남구 수암로149번길 16-5, 고려맨션 앞1
910울산광역시남구대현동울산광역시 남구 수암로155번길 20-1,1
연번시도시군구제설함설치구역위치제설함 수
368369울산광역시울주군대암댐 앞도로군도 10호선6
369370울산광역시울주군웅촌 통천리 ~ 청량 동천리군도 18호선8
370371울산광역시울주군상북 지내리 ~ 언양 다개리군도 20호선5
371372울산광역시울주군온양 대안리 ~ 서생 화산리군도 22호선6
372373울산광역시울주군두동 삼정리 ~ 두동 구미리군도 28호선12
373374울산광역시울주군온양 발리 ~ 서생 화정리군도 33호선12
374375울산광역시울주군웅촌 대대리군도 36호선8
375376울산광역시울주군범서 중리 ~ 서사리농어촌 102호선8
376377울산광역시울주군상북 덕현리농어촌 206호선15
377378울산광역시울주군상북 이천, 범서 굴화, 두동 만화 등기타 노선55