Overview

Dataset statistics

Number of variables8
Number of observations1004
Missing cells0
Missing cells (%)0.0%
Duplicate rows181
Duplicate rows (%)18.0%
Total size in memory63.9 KiB
Average record size in memory65.1 B

Variable types

Categorical7
Numeric1

Dataset

Description경상남도 사천시 공간정보시스템의 신호등 자료입니다.(행정읍면동, 설치일자, 신호등종류, 신호등재질 등)
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15091549

Alerts

Dataset has 181 (18.0%) duplicate rowsDuplicates
설치일자 is highly overall correlated with 신호등규격 and 1 other fieldsHigh correlation
신호등재질 is highly overall correlated with 신호등종류 and 1 other fieldsHigh correlation
신호등규격 is highly overall correlated with 행정읍면동 and 4 other fieldsHigh correlation
행정읍면동 is highly overall correlated with 신호등규격 and 1 other fieldsHigh correlation
신호등종류 is highly overall correlated with 신호등재질 and 3 other fieldsHigh correlation
신호등구분 is highly overall correlated with 신호등종류 and 1 other fieldsHigh correlation
잔여시간표시기유무 is highly overall correlated with 행정읍면동 and 2 other fieldsHigh correlation
신호등재질 is highly imbalanced (87.7%)Imbalance

Reproduction

Analysis started2023-12-10 23:36:03.711371
Analysis finished2023-12-10 23:36:04.636862
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정읍면동
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
용현면
165 
향촌동
137 
사천읍
106 
사남면
93 
정동면
80 
Other values (11)
423 

Length

Max length4
Median length3
Mean length3.0438247
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row향촌동
2nd row향촌동
3rd row향촌동
4th row향촌동
5th row향촌동

Common Values

ValueCountFrequency (%)
용현면 165
16.4%
향촌동 137
13.6%
사천읍 106
10.6%
사남면 93
9.3%
정동면 80
8.0%
곤양면 71
7.1%
벌용동 61
 
6.1%
동서동 58
 
5.8%
선구동 45
 
4.5%
동서금동 44
 
4.4%
Other values (6) 144
14.3%

Length

2023-12-11T08:36:04.715897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
용현면 165
16.4%
향촌동 137
13.6%
사천읍 106
10.6%
사남면 93
9.3%
정동면 80
8.0%
곤양면 71
7.1%
벌용동 61
 
6.1%
동서동 58
 
5.8%
선구동 45
 
4.5%
동서금동 44
 
4.4%
Other values (6) 144
14.3%

설치일자
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
1960-01-01
407 
<NA>
339 
2010-01-01
85 
2011-01-01
65 
1900-01-01
 
38
Other values (9)
70 

Length

Max length10
Median length10
Mean length7.9741036
Min length4

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
1960-01-01 407
40.5%
<NA> 339
33.8%
2010-01-01 85
 
8.5%
2011-01-01 65
 
6.5%
1900-01-01 38
 
3.8%
2004-02-24 27
 
2.7%
2008-01-01 16
 
1.6%
1999-05-20 12
 
1.2%
2002-05-14 9
 
0.9%
2000-01-27 2
 
0.2%
Other values (4) 4
 
0.4%

Length

2023-12-11T08:36:04.860102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1960-01-01 407
40.5%
na 339
33.8%
2010-01-01 85
 
8.5%
2011-01-01 65
 
6.5%
1900-01-01 38
 
3.8%
2004-02-24 27
 
2.7%
2008-01-01 16
 
1.6%
1999-05-20 12
 
1.2%
2002-05-14 9
 
0.9%
2000-01-27 2
 
0.2%
Other values (4) 4
 
0.4%

신호등종류
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
횡형 4 색등
332 
횡형 3 색등
221 
종형 2 색등
161 
횡형 2 색등
136 
종형 3 색등
109 
Other values (2)
45 

Length

Max length7
Median length7
Mean length6.811753
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종형 2 색등
2nd row종형 2 색등
3rd row종형 2 색등
4th row종형 3 색등
5th row종형 3 색등

Common Values

ValueCountFrequency (%)
횡형 4 색등 332
33.1%
횡형 3 색등 221
22.0%
종형 2 색등 161
16.0%
횡형 2 색등 136
13.5%
종형 3 색등 109
 
10.9%
기타 36
 
3.6%
속성나중입력 9
 
0.9%

Length

2023-12-11T08:36:05.001942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:36:05.138320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
색등 959
32.8%
횡형 689
23.6%
4 332
 
11.4%
3 330
 
11.3%
2 297
 
10.2%
종형 270
 
9.2%
기타 36
 
1.2%
속성나중입력 9
 
0.3%

신호등재질
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
강관
964 
강판
 
29
속성나중입력
 
9
주물형
 
1
스테인레스
 
1

Length

Max length6
Median length2
Mean length2.0398406
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row강관
2nd row강관
3rd row강관
4th row강관
5th row강관

Common Values

ValueCountFrequency (%)
강관 964
96.0%
강판 29
 
2.9%
속성나중입력 9
 
0.9%
주물형 1
 
0.1%
스테인레스 1
 
0.1%

Length

2023-12-11T08:36:05.286751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:36:05.415281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강관 964
96.0%
강판 29
 
2.9%
속성나중입력 9
 
0.9%
주물형 1
 
0.1%
스테인레스 1
 
0.1%

신호등규격
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
<NA>
571 
Ø250X8.0
239 
150
99 
200
74 
50
 
8
Other values (5)
 
13

Length

Max length22
Median length4
Mean length4.8476096
Min length2

Unique

Unique3 ?
Unique (%)0.3%

Sample

1st row150
2nd row150
3rd row150
4th row150
5th row150

Common Values

ValueCountFrequency (%)
<NA> 571
56.9%
Ø250X8.0 239
23.8%
150 99
 
9.9%
200 74
 
7.4%
50 8
 
0.8%
Ø220X8.0 8
 
0.8%
710X355(6) 2
 
0.2%
Ø270X8.0 1
 
0.1%
1065X355(4),355X710(1) 1
 
0.1%
1065X355(6),355X710(1) 1
 
0.1%

Length

2023-12-11T08:36:05.566903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:36:05.766902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 571
56.9%
ø250x8.0 239
23.8%
150 99
 
9.9%
200 74
 
7.4%
50 8
 
0.8%
ø220x8.0 8
 
0.8%
710x355(6 2
 
0.2%
ø270x8.0 1
 
0.1%
1065x355(4),355x710(1 1
 
0.1%
1065x355(6),355x710(1 1
 
0.1%

신호등구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
차량신호등
532 
보행등
263 
점멸등
138 
기타
69 
차량,보행등
 
2

Length

Max length6
Median length5
Mean length3.997012
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보행등
2nd row보행등
3rd row보행등
4th row보행등
5th row보행등

Common Values

ValueCountFrequency (%)
차량신호등 532
53.0%
보행등 263
26.2%
점멸등 138
 
13.7%
기타 69
 
6.9%
차량,보행등 2
 
0.2%

Length

2023-12-11T08:36:05.951728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:36:06.064621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
차량신호등 532
53.0%
보행등 263
26.2%
점멸등 138
 
13.7%
기타 69
 
6.9%
차량,보행등 2
 
0.2%

신호등수
Real number (ℝ)

Distinct13
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9163347
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.0 KiB
2023-12-11T08:36:06.174203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile6
Maximum16
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.0669194
Coefficient of variation (CV)0.70873874
Kurtosis9.8245529
Mean2.9163347
Median Absolute Deviation (MAD)1
Skewness2.3780951
Sum2928
Variance4.2721556
MonotonicityNot monotonic
2023-12-11T08:36:06.288712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 272
27.1%
2 239
23.8%
4 192
19.1%
3 164
16.3%
6 57
 
5.7%
5 46
 
4.6%
8 14
 
1.4%
7 5
 
0.5%
14 5
 
0.5%
10 3
 
0.3%
Other values (3) 7
 
0.7%
ValueCountFrequency (%)
1 272
27.1%
2 239
23.8%
3 164
16.3%
4 192
19.1%
5 46
 
4.6%
6 57
 
5.7%
7 5
 
0.5%
8 14
 
1.4%
10 3
 
0.3%
12 3
 
0.3%
ValueCountFrequency (%)
16 3
 
0.3%
15 1
 
0.1%
14 5
 
0.5%
12 3
 
0.3%
10 3
 
0.3%
8 14
 
1.4%
7 5
 
0.5%
6 57
 
5.7%
5 46
 
4.6%
4 192
19.1%

잔여시간표시기유무
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
없음
741 
있음
219 
미분류
 
44

Length

Max length3
Median length2
Mean length2.0438247
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row없음
3rd row없음
4th row있음
5th row있음

Common Values

ValueCountFrequency (%)
없음 741
73.8%
있음 219
 
21.8%
미분류 44
 
4.4%

Length

2023-12-11T08:36:06.426473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:36:06.535513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 741
73.8%
있음 219
 
21.8%
미분류 44
 
4.4%

Interactions

2023-12-11T08:36:04.269821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:36:06.921537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정읍면동설치일자신호등종류신호등재질신호등규격신호등구분신호등수잔여시간표시기유무
행정읍면동1.0000.7440.6590.6160.9060.5330.4330.720
설치일자0.7441.0000.6380.7101.0000.5970.3390.798
신호등종류0.6590.6381.0000.6600.8560.7920.5970.795
신호등재질0.6160.7100.6601.0001.0000.0800.0000.078
신호등규격0.9061.0000.8561.0001.0000.8770.7720.226
신호등구분0.5330.5970.7920.0800.8771.0000.5870.454
신호등수0.4330.3390.5970.0000.7720.5871.0000.634
잔여시간표시기유무0.7200.7980.7950.0780.2260.4540.6341.000
2023-12-11T08:36:07.062443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신호등구분잔여시간표시기유무설치일자신호등재질신호등규격행정읍면동신호등종류
신호등구분1.0000.3860.3900.0300.7470.3040.669
잔여시간표시기유무0.3861.0000.6490.0580.2240.5300.744
설치일자0.3900.6491.0000.4830.9840.3800.361
신호등재질0.0300.0580.4831.0000.9920.3700.503
신호등규격0.7470.2240.9840.9921.0000.5240.684
행정읍면동0.3040.5300.3800.3700.5241.0000.374
신호등종류0.6690.7440.3610.5030.6840.3741.000
2023-12-11T08:36:07.187083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신호등수행정읍면동설치일자신호등종류신호등재질신호등규격신호등구분잔여시간표시기유무
신호등수1.0000.1700.1500.3610.0000.4120.3880.360
행정읍면동0.1701.0000.3800.3740.3700.5240.3040.530
설치일자0.1500.3801.0000.3610.4830.9840.3900.649
신호등종류0.3610.3740.3611.0000.5030.6840.6690.744
신호등재질0.0000.3700.4830.5031.0000.9920.0300.058
신호등규격0.4120.5240.9840.6840.9921.0000.7470.224
신호등구분0.3880.3040.3900.6690.0300.7471.0000.386
잔여시간표시기유무0.3600.5300.6490.7440.0580.2240.3861.000

Missing values

2023-12-11T08:36:04.438929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:36:04.572424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정읍면동설치일자신호등종류신호등재질신호등규격신호등구분신호등수잔여시간표시기유무
0향촌동<NA>종형 2 색등강관150보행등1없음
1향촌동<NA>종형 2 색등강관150보행등1없음
2향촌동<NA>종형 2 색등강관150보행등1없음
3향촌동<NA>종형 3 색등강관150보행등1있음
4향촌동<NA>종형 3 색등강관150보행등2있음
5향촌동<NA>종형 3 색등강관150보행등1있음
6향촌동<NA>종형 2 색등강관150보행등1없음
7향촌동<NA>종형 3 색등강관150보행등1있음
8향촌동<NA>종형 2 색등강관150보행등1없음
9향촌동<NA>횡형 3 색등강관Ø250X8.0차량신호등2없음
행정읍면동설치일자신호등종류신호등재질신호등규격신호등구분신호등수잔여시간표시기유무
994서포면1900-01-01속성나중입력속성나중입력Ø220X8.0차량신호등3없음
995서포면1900-01-01속성나중입력속성나중입력Ø220X8.0보행등2없음
996서포면1900-01-01속성나중입력속성나중입력Ø220X8.0차량신호등3없음
997서포면1900-01-01속성나중입력속성나중입력Ø220X8.0보행등1없음
998사남면1900-01-01속성나중입력속성나중입력Ø270X8.0점멸등6없음
999서포면1900-01-01속성나중입력속성나중입력Ø220X8.0차량신호등3없음
1000사천읍<NA>횡형 2 색등강관710X355(6)점멸등6없음
1001사천읍<NA>횡형 2 색등강관710X355(6)점멸등6없음
1002사천읍<NA>기타강관1065X355(4),355X710(1)차량,보행등5있음
1003사천읍<NA>기타강관1065X355(6),355X710(1)차량,보행등7있음

Duplicate rows

Most frequently occurring

행정읍면동설치일자신호등종류신호등재질신호등규격신호등구분신호등수잔여시간표시기유무# duplicates
31동서동1960-01-01횡형 4 색등강관<NA>차량신호등4없음29
167향촌동<NA>종형 2 색등강관150보행등1없음24
6곤양면<NA>종형 3 색등강관Ø250X8.0보행등1있음20
67사천읍1960-01-01횡형 3 색등강관<NA>차량신호등3없음18
168향촌동<NA>종형 3 색등강관150보행등1있음18
177향촌동<NA>횡형 4 색등강관Ø250X8.0차량신호등2없음18
43사남면1960-01-01종형 2 색등강관<NA>보행등1없음17
35벌용동1960-01-01종형 2 색등강관<NA>보행등1없음16
41벌용동1960-01-01횡형 4 색등강관<NA>차량신호등4없음15
179향촌동<NA>횡형 4 색등강관Ø250X8.0차량신호등3없음15