Overview

Dataset statistics

Number of variables6
Number of observations274
Missing cells0
Missing cells (%)0.0%
Duplicate rows7
Duplicate rows (%)2.6%
Total size in memory13.8 KiB
Average record size in memory51.5 B

Variable types

Numeric3
Categorical2
Text1

Dataset

Description괴산군 국가공간정보통합체계 시스템에 등록된 육교, 교량 정보입니다. 실제 현황과 다를 수 있습니다. 자세한 사항은 괴산군으로 문의주시기 바랍니다.
URLhttps://www.data.go.kr/data/15118962/fileData.do

Alerts

구분 has constant value ""Constant
Dataset has 7 (2.6%) duplicate rowsDuplicates
종류 is highly imbalanced (96.5%)Imbalance

Reproduction

Analysis started2023-12-12 07:37:55.530834
Analysis finished2023-12-12 07:37:56.731072
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

맵번호
Real number (ℝ)

Distinct89
Distinct (%)32.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36704767
Minimum36703060
Maximum36708046
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-12T16:37:56.821447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36703060
5-th percentile36703070
Q136704053
median36704078
Q336704100
95-th percentile36708034
Maximum36708046
Range4986
Interquartile range (IQR)47

Descriptive statistics

Standard deviation1715.507
Coefficient of variation (CV)4.6737988 × 10-5
Kurtosis-0.41344446
Mean36704767
Median Absolute Deviation (MAD)25
Skewness1.0996822
Sum1.0057106 × 1010
Variance2942964.2
MonotonicityIncreasing
2023-12-12T16:37:57.013587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36704100 14
 
5.1%
36704058 11
 
4.0%
36704072 11
 
4.0%
36704061 9
 
3.3%
36704090 8
 
2.9%
36703060 7
 
2.6%
36704082 7
 
2.6%
36704087 6
 
2.2%
36704065 6
 
2.2%
36704089 6
 
2.2%
Other values (79) 189
69.0%
ValueCountFrequency (%)
36703060 7
2.6%
36703066 2
 
0.7%
36703067 3
1.1%
36703069 1
 
0.4%
36703070 3
1.1%
36703076 6
2.2%
36703077 3
1.1%
36703078 2
 
0.7%
36703080 3
1.1%
36703086 6
2.2%
ValueCountFrequency (%)
36708046 2
0.7%
36708043 1
 
0.4%
36708042 2
0.7%
36708041 3
1.1%
36708036 2
0.7%
36708035 4
1.5%
36708034 1
 
0.4%
36708033 2
0.7%
36708032 1
 
0.4%
36708031 2
0.7%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
교량
274 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교량
2nd row교량
3rd row교량
4th row교량
5th row교량

Common Values

ValueCountFrequency (%)
교량 274
100.0%

Length

2023-12-12T16:37:57.141165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:57.243018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교량 274
100.0%
Distinct226
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T16:37:57.596945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.3978102
Min length2

Characters and Unicode

Total characters931
Distinct characters158
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)68.2%

Sample

1st row궁굴교
2nd row밀선교
3rd row새터교
4th row소수1교
5th row소수2교
ValueCountFrequency (%)
적석교 4
 
1.5%
행촌교 4
 
1.5%
광덕교 4
 
1.5%
후평교 3
 
1.1%
광전교 3
 
1.1%
장암교 3
 
1.1%
강천1교 2
 
0.7%
아성교 2
 
0.7%
유산교 2
 
0.7%
백봉교 2
 
0.7%
Other values (216) 245
89.4%
2023-12-12T16:37:58.126384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
276
29.6%
1 25
 
2.7%
24
 
2.6%
2 19
 
2.0%
18
 
1.9%
16
 
1.7%
14
 
1.5%
13
 
1.4%
13
 
1.4%
12
 
1.3%
Other values (148) 501
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 876
94.1%
Decimal Number 47
 
5.0%
Uppercase Letter 4
 
0.4%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
276
31.5%
24
 
2.7%
18
 
2.1%
16
 
1.8%
14
 
1.6%
13
 
1.5%
13
 
1.5%
12
 
1.4%
12
 
1.4%
11
 
1.3%
Other values (141) 467
53.3%
Decimal Number
ValueCountFrequency (%)
1 25
53.2%
2 19
40.4%
4 3
 
6.4%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
I 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 876
94.1%
Common 51
 
5.5%
Latin 4
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
276
31.5%
24
 
2.7%
18
 
2.1%
16
 
1.8%
14
 
1.6%
13
 
1.5%
13
 
1.5%
12
 
1.4%
12
 
1.4%
11
 
1.3%
Other values (141) 467
53.3%
Common
ValueCountFrequency (%)
1 25
49.0%
2 19
37.3%
4 3
 
5.9%
( 2
 
3.9%
) 2
 
3.9%
Latin
ValueCountFrequency (%)
C 2
50.0%
I 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 876
94.1%
ASCII 55
 
5.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
276
31.5%
24
 
2.7%
18
 
2.1%
16
 
1.8%
14
 
1.6%
13
 
1.5%
13
 
1.5%
12
 
1.4%
12
 
1.4%
11
 
1.3%
Other values (141) 467
53.3%
ASCII
ValueCountFrequency (%)
1 25
45.5%
2 19
34.5%
4 3
 
5.5%
( 2
 
3.6%
) 2
 
3.6%
C 2
 
3.6%
I 2
 
3.6%

육교길이
Real number (ℝ)

Distinct128
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67.208504
Minimum5
Maximum600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-12T16:37:58.278624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile11.825
Q124.125
median40
Q371.5
95-th percentile210.445
Maximum600
Range595
Interquartile range (IQR)47.375

Descriptive statistics

Standard deviation83.956278
Coefficient of variation (CV)1.2491913
Kurtosis14.767318
Mean67.208504
Median Absolute Deviation (MAD)19
Skewness3.4802945
Sum18415.13
Variance7048.6566
MonotonicityNot monotonic
2023-12-12T16:37:58.408412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40.0 13
 
4.7%
60.0 10
 
3.6%
12.0 9
 
3.3%
33.0 7
 
2.6%
50.0 7
 
2.6%
150.0 7
 
2.6%
22.0 7
 
2.6%
30.0 7
 
2.6%
24.0 7
 
2.6%
28.0 6
 
2.2%
Other values (118) 194
70.8%
ValueCountFrequency (%)
5.0 1
 
0.4%
6.0 1
 
0.4%
7.0 1
 
0.4%
7.6 1
 
0.4%
9.0 1
 
0.4%
9.5 2
 
0.7%
10.0 4
1.5%
10.7 1
 
0.4%
11.5 2
 
0.7%
12.0 9
3.3%
ValueCountFrequency (%)
600.0 1
0.4%
570.0 1
0.4%
450.0 2
0.7%
400.0 2
0.7%
360.0 2
0.7%
300.0 1
0.4%
250.0 1
0.4%
240.0 1
0.4%
225.0 1
0.4%
220.0 2
0.7%

육교너비
Real number (ℝ)

Distinct52
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.185438
Minimum3
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-12T16:37:58.539837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile4.93
Q16
median9
Q311
95-th percentile20.175
Maximum90
Range87
Interquartile range (IQR)5

Descriptive statistics

Standard deviation7.2113713
Coefficient of variation (CV)0.70800797
Kurtosis55.346772
Mean10.185438
Median Absolute Deviation (MAD)2.9
Skewness5.7123587
Sum2790.81
Variance52.003875
MonotonicityNot monotonic
2023-12-12T16:37:58.656415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.0 30
 
10.9%
6.0 29
 
10.6%
10.0 23
 
8.4%
9.0 21
 
7.7%
7.0 15
 
5.5%
11.0 13
 
4.7%
9.5 13
 
4.7%
7.5 13
 
4.7%
19.5 11
 
4.0%
12.6 10
 
3.6%
Other values (42) 96
35.0%
ValueCountFrequency (%)
3.0 2
 
0.7%
3.5 1
 
0.4%
4.0 9
 
3.3%
4.5 1
 
0.4%
4.8 1
 
0.4%
5.0 30
10.9%
5.5 1
 
0.4%
5.6 1
 
0.4%
6.0 29
10.6%
6.1 1
 
0.4%
ValueCountFrequency (%)
90.0 1
0.4%
36.0 1
0.4%
35.0 1
0.4%
31.2 1
0.4%
28.0 1
0.4%
27.0 1
0.4%
24.3 1
0.4%
22.8 1
0.4%
22.5 1
0.4%
22.4 1
0.4%

종류
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
콘크리트
273 
석재
 
1

Length

Max length4
Median length4
Mean length3.9927007
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row콘크리트
2nd row콘크리트
3rd row콘크리트
4th row콘크리트
5th row콘크리트

Common Values

ValueCountFrequency (%)
콘크리트 273
99.6%
석재 1
 
0.4%

Length

2023-12-12T16:37:58.785290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:58.903240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
콘크리트 273
99.6%
석재 1
 
0.4%

Interactions

2023-12-12T16:37:56.301448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:55.756077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:56.018798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:56.390307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:55.833684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:56.128021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:56.476448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:55.918683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:56.218801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:37:58.970116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
맵번호육교길이육교너비종류
맵번호1.0000.2650.1230.041
육교길이0.2651.0000.4760.000
육교너비0.1230.4761.0000.000
종류0.0410.0000.0001.000
2023-12-12T16:37:59.058449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
맵번호육교길이육교너비종류
맵번호1.0000.090-0.1520.000
육교길이0.0901.0000.2860.000
육교너비-0.1520.2861.0000.000
종류0.0000.0000.0001.000

Missing values

2023-12-12T16:37:56.572209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:37:56.693624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

맵번호구분육교명육교길이육교너비종류
036703060교량궁굴교34.010.0콘크리트
136703060교량밀선교20.05.0콘크리트
236703060교량새터교27.05.0콘크리트
336703060교량소수1교150.021.0콘크리트
436703060교량소수2교20.920.0콘크리트
536703060교량수리교30.06.0콘크리트
636703060교량수리교30.09.0콘크리트
736703066교량산정교40.09.0콘크리트
836703066교량중흥교24.07.0콘크리트
936703067교량둔기2교15.09.5콘크리트
맵번호구분육교명육교길이육교너비종류
26436708036교량삼동교45.05.0콘크리트
26536708036교량삼송1교50.04.0콘크리트
26636708041교량가락교50.09.0콘크리트
26736708041교량동평교45.26.5콘크리트
26836708041교량박명교47.54.5콘크리트
26936708042교량신월교55.07.0콘크리트
27036708042교량신월교80.010.0콘크리트
27136708043교량사담교53.06.5콘크리트
27236708046교량다보교52.07.5콘크리트
27336708046교량삼송2교38.67.5콘크리트

Duplicate rows

Most frequently occurring

맵번호구분육교명육교길이육교너비종류# duplicates
036704046교량남달천교450.012.6콘크리트2
136704046교량문주4교360.012.6콘크리트2
236704058교량추점1교60.012.6콘크리트2
336704098교량독정교11.513.0콘크리트2
436704098교량후동교60.010.5콘크리트2
536704100교량적석교40.012.0콘크리트2
636704100교량행촌교400.011.9콘크리트2