Overview

Dataset statistics

Number of variables8
Number of observations6778
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory443.6 KiB
Average record size in memory67.0 B

Variable types

Numeric3
Text2
Categorical2
DateTime1

Dataset

Description제주특별자치도에서 제공하는 안전데이터 분석자료 - 초등학교 주변 위험 선택지점 데이터 (학교명, 위도, 경도, 위험지역사유 등) 정보 입니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/15076629/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 02:39:33.803022
Analysis finished2023-12-12 02:39:36.139657
Duration2.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분석아이디
Real number (ℝ)

Distinct2710
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8421.1508
Minimum6050
Maximum11148
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.7 KiB
2023-12-12T11:39:36.262408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6050
5-th percentile6216.55
Q17187.25
median8349
Q39636
95-th percentile10924
Maximum11148
Range5098
Interquartile range (IQR)2448.75

Descriptive statistics

Standard deviation1475.5185
Coefficient of variation (CV)0.17521578
Kurtosis-1.0612245
Mean8421.1508
Median Absolute Deviation (MAD)1233
Skewness0.1918089
Sum57078560
Variance2177154.9
MonotonicityIncreasing
2023-12-12T11:39:36.504683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7467 15
 
0.2%
6152 15
 
0.2%
6421 15
 
0.2%
8277 12
 
0.2%
7786 12
 
0.2%
9711 12
 
0.2%
6151 12
 
0.2%
7932 12
 
0.2%
8088 12
 
0.2%
6331 12
 
0.2%
Other values (2700) 6649
98.1%
ValueCountFrequency (%)
6050 4
0.1%
6051 5
0.1%
6055 6
0.1%
6057 1
 
< 0.1%
6058 2
 
< 0.1%
6059 5
0.1%
6060 3
< 0.1%
6061 4
0.1%
6062 4
0.1%
6063 4
0.1%
ValueCountFrequency (%)
11148 1
 
< 0.1%
11147 2
 
< 0.1%
11145 2
 
< 0.1%
11144 5
0.1%
11143 1
 
< 0.1%
11142 2
 
< 0.1%
11139 3
< 0.1%
11138 2
 
< 0.1%
11137 1
 
< 0.1%
11135 1
 
< 0.1%
Distinct114
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size53.1 KiB
2023-12-12T11:39:36.822191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length6.2446149
Min length6

Characters and Unicode

Total characters42326
Distinct characters122
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고산초등학교
2nd row고산초등학교
3rd row고산초등학교
4th row고산초등학교
5th row고산초등학교
ValueCountFrequency (%)
외도초등학교 323
 
4.8%
이도초등학교 320
 
4.7%
노형초등학교 205
 
3.0%
인화초등학교 195
 
2.9%
오라초등학교 186
 
2.7%
신광초등학교 184
 
2.7%
도남초등학교 180
 
2.7%
한라초등학교 162
 
2.4%
삼화초등학교 146
 
2.2%
동홍초등학교 145
 
2.1%
Other values (104) 4732
69.8%
2023-12-12T11:39:37.250133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6920
16.3%
6778
16.0%
6732
15.9%
6732
15.9%
1145
 
2.7%
610
 
1.4%
610
 
1.4%
575
 
1.4%
561
 
1.3%
555
 
1.3%
Other values (112) 11108
26.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42326
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6920
16.3%
6778
16.0%
6732
15.9%
6732
15.9%
1145
 
2.7%
610
 
1.4%
610
 
1.4%
575
 
1.4%
561
 
1.3%
555
 
1.3%
Other values (112) 11108
26.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42326
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6920
16.3%
6778
16.0%
6732
15.9%
6732
15.9%
1145
 
2.7%
610
 
1.4%
610
 
1.4%
575
 
1.4%
561
 
1.3%
555
 
1.3%
Other values (112) 11108
26.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42326
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6920
16.3%
6778
16.0%
6732
15.9%
6732
15.9%
1145
 
2.7%
610
 
1.4%
610
 
1.4%
575
 
1.4%
561
 
1.3%
555
 
1.3%
Other values (112) 11108
26.2%

위도
Real number (ℝ)

Distinct6567
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.441419
Minimum33.200445
Maximum33.557405
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.7 KiB
2023-12-12T11:39:37.422562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.200445
5-th percentile33.246418
Q133.434683
median33.488167
Q333.50282
95-th percentile33.522221
Maximum33.557405
Range0.35696021
Interquartile range (IQR)0.06813678

Descriptive statistics

Standard deviation0.09821985
Coefficient of variation (CV)0.0029370718
Kurtosis-0.20663215
Mean33.441419
Median Absolute Deviation (MAD)0.02142113
Skewness-1.2182272
Sum226665.94
Variance0.009647139
MonotonicityNot monotonic
2023-12-12T11:39:37.569353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.48910225 3
 
< 0.1%
33.48716107 3
 
< 0.1%
33.51890797 3
 
< 0.1%
33.48662714 3
 
< 0.1%
33.48784316 3
 
< 0.1%
33.48927384 3
 
< 0.1%
33.49241659 3
 
< 0.1%
33.48906423 3
 
< 0.1%
33.27422858 2
 
< 0.1%
33.48865182 2
 
< 0.1%
Other values (6557) 6750
99.6%
ValueCountFrequency (%)
33.20044486 1
< 0.1%
33.21901587 1
< 0.1%
33.21955247 1
< 0.1%
33.22083917 1
< 0.1%
33.22114942 1
< 0.1%
33.22118529 1
< 0.1%
33.22139263 1
< 0.1%
33.22147594 1
< 0.1%
33.22149342 1
< 0.1%
33.22201438 1
< 0.1%
ValueCountFrequency (%)
33.55740507 1
< 0.1%
33.55673448 1
< 0.1%
33.55646365 1
< 0.1%
33.55644234 1
< 0.1%
33.55640551 1
< 0.1%
33.55632781 1
< 0.1%
33.55619004 1
< 0.1%
33.55611755 1
< 0.1%
33.5557791 1
< 0.1%
33.55567197 1
< 0.1%

경도
Real number (ℝ)

Distinct6538
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.52466
Minimum126.17638
Maximum126.93539
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.7 KiB
2023-12-12T11:39:37.702450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.17638
5-th percentile126.2775
Q1126.47453
median126.52634
Q3126.56938
95-th percentile126.80074
Maximum126.93539
Range0.7590112
Interquartile range (IQR)0.09485025

Descriptive statistics

Standard deviation0.13024548
Coefficient of variation (CV)0.0010294078
Kurtosis1.8369305
Mean126.52466
Median Absolute Deviation (MAD)0.0481896
Skewness0.43498243
Sum857584.15
Variance0.016963884
MonotonicityNot monotonic
2023-12-12T11:39:37.838377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.5291232 3
 
< 0.1%
126.5140188 3
 
< 0.1%
126.4329869 3
 
< 0.1%
126.4323643 3
 
< 0.1%
126.4335059 3
 
< 0.1%
126.5350303 3
 
< 0.1%
126.56783 3
 
< 0.1%
126.5103818 3
 
< 0.1%
126.4325028 3
 
< 0.1%
126.5319857 2
 
< 0.1%
Other values (6528) 6749
99.6%
ValueCountFrequency (%)
126.1763798 1
< 0.1%
126.1767218 1
< 0.1%
126.1768493 1
< 0.1%
126.1775595 1
< 0.1%
126.1777633 1
< 0.1%
126.1777962 1
< 0.1%
126.1779473 1
< 0.1%
126.1780253 1
< 0.1%
126.1780703 1
< 0.1%
126.1780717 1
< 0.1%
ValueCountFrequency (%)
126.935391 1
< 0.1%
126.9353158 1
< 0.1%
126.9350971 1
< 0.1%
126.934802 1
< 0.1%
126.9347454 1
< 0.1%
126.9346604 1
< 0.1%
126.9346054 1
< 0.1%
126.9345866 1
< 0.1%
126.9339508 1
< 0.1%
126.9334994 1
< 0.1%
Distinct14
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size53.1 KiB
신호등이 없음
2361 
길가에 차들이 많이 세워져 있어 앞을 가림
874 
차가 다니는 길가로만 다닐 수 있음
789 
골목 등 좁은 길에서 갑자기 차가 튀어나옴
779 
인도에 차가 서 있음
623 
Other values (9)
1352 

Length

Max length27
Median length25
Mean length14.330333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가게에서 놓은 물건 때문에 길을 다니기 어려움
2nd row신호등이 없음
3rd row차가 다니는 길가로만 다닐 수 있음
4th row횡단보도 초록불 신호가 너무 짧음
5th row골목 등 좁은 길에서 갑자기 차가 튀어나옴

Common Values

ValueCountFrequency (%)
신호등이 없음 2361
34.8%
길가에 차들이 많이 세워져 있어 앞을 가림 874
 
12.9%
차가 다니는 길가로만 다닐 수 있음 789
 
11.6%
골목 등 좁은 길에서 갑자기 차가 튀어나옴 779
 
11.5%
인도에 차가 서 있음 623
 
9.2%
횡단보도 초록불 신호가 너무 짧음 341
 
5.0%
기타 298
 
4.4%
길 주변에 깨진 병 조각 같은 위험한 물건이 많음 209
 
3.1%
횡단보도가 적어서 많이 돌아가야 함 145
 
2.1%
위험한 곳이라는 안내시설이 없음 105
 
1.5%
Other values (4) 254
 
3.7%

Length

2023-12-12T11:39:37.957002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
없음 2479
 
8.3%
신호등이 2361
 
7.9%
차가 2191
 
7.4%
있음 1509
 
5.1%
많이 1019
 
3.4%
차들이 874
 
2.9%
세워져 874
 
2.9%
있어 874
 
2.9%
앞을 874
 
2.9%
가림 874
 
2.9%
Other values (48) 15821
53.2%
Distinct227
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size53.1 KiB
2023-12-12T11:39:38.278786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length4
Mean length4.324432
Min length1

Characters and Unicode

Total characters29311
Distinct characters328
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)2.7%

Sample

1st row해당없음
2nd row해당없음
3rd row해당없음
4th row해당없음
5th row해당없음
ValueCountFrequency (%)
해당없음 6474
87.7%
차가 42
 
0.6%
없음 40
 
0.5%
있음 28
 
0.4%
인도가 27
 
0.4%
너무 18
 
0.2%
개가 17
 
0.2%
차들이 16
 
0.2%
많이 13
 
0.2%
다님 13
 
0.2%
Other values (428) 695
 
9.4%
2023-12-12T11:39:38.796775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6585
22.5%
6550
22.3%
6482
22.1%
6474
22.1%
605
 
2.1%
189
 
0.6%
146
 
0.5%
121
 
0.4%
95
 
0.3%
86
 
0.3%
Other values (318) 1978
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28672
97.8%
Space Separator 605
 
2.1%
Other Punctuation 21
 
0.1%
Uppercase Letter 6
 
< 0.1%
Decimal Number 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6585
23.0%
6550
22.8%
6482
22.6%
6474
22.6%
189
 
0.7%
146
 
0.5%
121
 
0.4%
95
 
0.3%
86
 
0.3%
60
 
0.2%
Other values (306) 1884
 
6.6%
Decimal Number
ValueCountFrequency (%)
2 1
33.3%
0 1
33.3%
5 1
33.3%
Other Punctuation
ValueCountFrequency (%)
. 18
85.7%
, 3
 
14.3%
Uppercase Letter
ValueCountFrequency (%)
U 3
50.0%
C 3
50.0%
Lowercase Letter
ValueCountFrequency (%)
t 1
50.0%
k 1
50.0%
Space Separator
ValueCountFrequency (%)
605
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28672
97.8%
Common 631
 
2.2%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6585
23.0%
6550
22.8%
6482
22.6%
6474
22.6%
189
 
0.7%
146
 
0.5%
121
 
0.4%
95
 
0.3%
86
 
0.3%
60
 
0.2%
Other values (306) 1884
 
6.6%
Common
ValueCountFrequency (%)
605
95.9%
. 18
 
2.9%
, 3
 
0.5%
2 1
 
0.2%
) 1
 
0.2%
( 1
 
0.2%
0 1
 
0.2%
5 1
 
0.2%
Latin
ValueCountFrequency (%)
U 3
37.5%
C 3
37.5%
t 1
 
12.5%
k 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28667
97.8%
ASCII 639
 
2.2%
Compat Jamo 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6585
23.0%
6550
22.8%
6482
22.6%
6474
22.6%
189
 
0.7%
146
 
0.5%
121
 
0.4%
95
 
0.3%
86
 
0.3%
60
 
0.2%
Other values (303) 1879
 
6.6%
ASCII
ValueCountFrequency (%)
605
94.7%
. 18
 
2.8%
, 3
 
0.5%
U 3
 
0.5%
C 3
 
0.5%
2 1
 
0.2%
t 1
 
0.2%
k 1
 
0.2%
) 1
 
0.2%
( 1
 
0.2%
Other values (2) 2
 
0.3%
Compat Jamo
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%

행위 구분
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size53.1 KiB
등교
3798 
하교
2018 
활동
962 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하교
2nd row등교
3rd row등교
4th row하교
5th row하교

Common Values

ValueCountFrequency (%)
등교 3798
56.0%
하교 2018
29.8%
활동 962
 
14.2%

Length

2023-12-12T11:39:38.936787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:39:39.059478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
등교 3798
56.0%
하교 2018
29.8%
활동 962
 
14.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size53.1 KiB
Minimum2021-01-25 00:00:00
Maximum2021-01-25 00:00:00
2023-12-12T11:39:39.182934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:39.342300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T11:39:35.433390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:34.659083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:35.034105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:35.566274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:34.794827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:35.153109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:35.718846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:34.919474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:39:35.297670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:39:39.814799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분석아이디위도경도위험지역 사유행위 구분
분석아이디1.0000.7510.8550.1290.023
위도0.7511.0000.8560.0930.054
경도0.8550.8561.0000.1410.058
위험지역 사유0.1290.0930.1411.0000.135
행위 구분0.0230.0540.0580.1351.000
2023-12-12T11:39:39.940535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위험지역 사유행위 구분
위험지역 사유1.0000.075
행위 구분0.0751.000
2023-12-12T11:39:40.048176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분석아이디위도경도위험지역 사유행위 구분
분석아이디1.0000.353-0.0590.0520.013
위도0.3531.0000.3250.0370.032
경도-0.0590.3251.0000.0570.034
위험지역 사유0.0520.0370.0571.0000.075
행위 구분0.0130.0320.0340.0751.000

Missing values

2023-12-12T11:39:35.904254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:39:36.068854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분석아이디학교명위도경도위험지역 사유기타 상세 사유행위 구분데이터기준일자
06050고산초등학교33.307456126.178191가게에서 놓은 물건 때문에 길을 다니기 어려움해당없음하교2021-01-25
16050고산초등학교33.307275126.176722신호등이 없음해당없음등교2021-01-25
26050고산초등학교33.307267126.178247차가 다니는 길가로만 다닐 수 있음해당없음등교2021-01-25
36050고산초등학교33.307282126.177763횡단보도 초록불 신호가 너무 짧음해당없음하교2021-01-25
46051고산초등학교33.305677126.17638골목 등 좁은 길에서 갑자기 차가 튀어나옴해당없음하교2021-01-25
56051고산초등학교33.305349126.181349돌이 떨어질 것 같음해당없음활동2021-01-25
66051고산초등학교33.305641126.179097인도에 차가 서 있음해당없음활동2021-01-25
76051고산초등학교33.304065126.178113차가 다니는 길가로만 다닐 수 있음해당없음등교2021-01-25
86051고산초등학교33.305707126.178248횡단보도 초록불 신호가 너무 짧음해당없음등교2021-01-25
96055곽금초등학교33.444893126.30576나쁜 형, 아저씨 있음해당없음등교2021-01-25
분석아이디학교명위도경도위험지역 사유기타 상세 사유행위 구분데이터기준일자
676811144제주중앙초등학교33.504385126.51673신호등이 없음해당없음등교2021-01-25
676911144제주중앙초등학교33.504281126.516585신호등이 없음해당없음등교2021-01-25
677011144제주중앙초등학교33.503104126.516576신호등이 없음해당없음등교2021-01-25
677111144제주중앙초등학교33.504437126.517989신호등이 없음해당없음활동2021-01-25
677211144제주중앙초등학교33.50432126.517882신호등이 없음해당없음활동2021-01-25
677311145제주중앙초등학교33.499256126.517766신호등이 없음해당없음등교2021-01-25
677411145제주중앙초등학교33.503245126.516496신호등이 없음해당없음활동2021-01-25
677511147제주중앙초등학교33.504296126.514702신호등이 없음해당없음등교2021-01-25
677611147제주중앙초등학교33.504657126.514764신호등이 없음해당없음하교2021-01-25
677711148제주중앙초등학교33.503589126.519294횡단보도가 적어서 많이 돌아가야 함해당없음하교2021-01-25

Duplicate rows

Most frequently occurring

분석아이디학교명위도경도위험지역 사유기타 상세 사유행위 구분데이터기준일자# duplicates
07865삼화초등학교33.517631126.585346길 주변에 깨진 병 조각 같은 위험한 물건이 많음해당없음등교2021-01-252