Overview

Dataset statistics

Number of variables4
Number of observations247
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 KiB
Average record size in memory34.5 B

Variable types

Text1
Categorical1
Numeric2

Dataset

Description충청남도 천안시 도시계획정보시스템(UPIS) 공공 문화체육시설 현황으로 현황도형 관리번호, 라벨명 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=14&beforeMenuCd=DOM_000000201001001000&publicdatapk=15123190

Alerts

면적_도형 is highly overall correlated with 길이_도형 and 1 other fieldsHigh correlation
길이_도형 is highly overall correlated with 면적_도형High correlation
라벨명 is highly overall correlated with 면적_도형High correlation
현황도형 관리번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:24:04.534502
Analysis finished2024-01-09 22:24:05.263391
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct247
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-01-10T07:24:05.378752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters5928
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique247 ?
Unique (%)100.0%

Sample

1st row44130UQ155PS201207050098
2nd row44130UQ155PS200912020155
3rd row44130UQ155PS200804300005
4th row44130UQ155PS201305130003
5th row44130UQ155PS201705220012
ValueCountFrequency (%)
44130uq155ps201207050098 1
 
0.4%
44130uq155ps200403100001 1
 
0.4%
44130uq155ps200508220024 1
 
0.4%
44130uq155ps200812010852 1
 
0.4%
44130uq155ps200508220020 1
 
0.4%
44130uq155ps200201210001 1
 
0.4%
44130uq155ps200101020105 1
 
0.4%
44130uq155ps200408300002 1
 
0.4%
44130uq155ps199508070009 1
 
0.4%
44130uq155ps200501060033 1
 
0.4%
Other values (237) 237
96.0%
2024-01-10T07:24:05.642347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1359
22.9%
1 1018
17.2%
5 589
9.9%
4 578
9.8%
2 471
 
7.9%
3 369
 
6.2%
U 247
 
4.2%
Q 247
 
4.2%
P 247
 
4.2%
S 247
 
4.2%
Other values (4) 556
9.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4940
83.3%
Uppercase Letter 988
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1359
27.5%
1 1018
20.6%
5 589
11.9%
4 578
11.7%
2 471
 
9.5%
3 369
 
7.5%
9 205
 
4.1%
8 155
 
3.1%
6 117
 
2.4%
7 79
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
U 247
25.0%
Q 247
25.0%
P 247
25.0%
S 247
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4940
83.3%
Latin 988
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1359
27.5%
1 1018
20.6%
5 589
11.9%
4 578
11.7%
2 471
 
9.5%
3 369
 
7.5%
9 205
 
4.1%
8 155
 
3.1%
6 117
 
2.4%
7 79
 
1.6%
Latin
ValueCountFrequency (%)
U 247
25.0%
Q 247
25.0%
P 247
25.0%
S 247
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5928
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1359
22.9%
1 1018
17.2%
5 589
9.9%
4 578
9.8%
2 471
 
7.9%
3 369
 
6.2%
U 247
 
4.2%
Q 247
 
4.2%
P 247
 
4.2%
S 247
 
4.2%
Other values (4) 556
9.4%

라벨명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
초등학교
82 
청사
31 
중학교
29 
고등학교
19 
대학
17 
Other values (16)
69 

Length

Max length10
Median length9
Mean length4.0283401
Min length2

Unique

Unique7 ?
Unique (%)2.8%

Sample

1st row청사
2nd row청사
3rd row기타공공청사시설
4th row기타체육시설
5th row기타공공청사시설

Common Values

ValueCountFrequency (%)
초등학교 82
33.2%
청사 31
 
12.6%
중학교 29
 
11.7%
고등학교 19
 
7.7%
대학 17
 
6.9%
기타공공청사시설 16
 
6.5%
기타체육시설 12
 
4.9%
유치원 10
 
4.0%
기타사회복지시설 6
 
2.4%
골프장 5
 
2.0%
Other values (11) 20
 
8.1%

Length

2024-01-10T07:24:05.764503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
초등학교 82
33.2%
청사 31
 
12.6%
중학교 29
 
11.7%
고등학교 19
 
7.7%
대학 17
 
6.9%
기타공공청사시설 16
 
6.5%
기타체육시설 12
 
4.9%
유치원 10
 
4.0%
기타사회복지시설 6
 
2.4%
골프장 5
 
2.0%
Other values (11) 20
 
8.1%

면적_도형
Real number (ℝ)

HIGH CORRELATION 

Distinct246
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72809.745
Minimum305.7154
Maximum3919998.4
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-01-10T07:24:05.868345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum305.7154
5-th percentile770.13444
Q19781.2296
median14353.967
Q323210.686
95-th percentile304137.63
Maximum3919998.4
Range3919692.7
Interquartile range (IQR)13429.457

Descriptive statistics

Standard deviation294169.68
Coefficient of variation (CV)4.0402514
Kurtosis121.59548
Mean72809.745
Median Absolute Deviation (MAD)8004.6822
Skewness9.9677413
Sum17984007
Variance8.6535799 × 1010
MonotonicityNot monotonic
2024-01-10T07:24:05.979828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17839.10718 2
 
0.8%
1025.1455 1
 
0.4%
23102.74063 1
 
0.4%
1009.636408 1
 
0.4%
14343.28589 1
 
0.4%
13000.04134 1
 
0.4%
11450.57009 1
 
0.4%
112554.5547 1
 
0.4%
20768.42013 1
 
0.4%
11380.60423 1
 
0.4%
Other values (236) 236
95.5%
ValueCountFrequency (%)
305.7154 1
0.4%
330.501 1
0.4%
499.9924884 1
0.4%
514.6866562 1
0.4%
601.0478369 1
0.4%
633.8187161 1
0.4%
642.197883 1
0.4%
648.0224245 1
0.4%
659.8227109 1
0.4%
661.0320305 1
0.4%
ValueCountFrequency (%)
3919998.368 1
0.4%
1327253.691 1
0.4%
1055953.44 1
0.4%
1004631.345 1
0.4%
915933.9535 1
0.4%
630544.7423 1
0.4%
616651.2372 1
0.4%
569312.6613 1
0.4%
495802.9566 1
0.4%
416685.6011 1
0.4%

길이_도형
Real number (ℝ)

HIGH CORRELATION 

Distinct246
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean840.88066
Minimum69.966452
Maximum9340.7348
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-01-10T07:24:06.090811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum69.966452
5-th percentile120.78127
Q1419.81108
median489.56347
Q3654.15704
95-th percentile3057.5071
Maximum9340.7348
Range9270.7684
Interquartile range (IQR)234.34596

Descriptive statistics

Standard deviation1241.4961
Coefficient of variation (CV)1.4764237
Kurtosis19.044525
Mean840.88066
Median Absolute Deviation (MAD)139.69852
Skewness4.0713982
Sum207697.52
Variance1541312.7
MonotonicityNot monotonic
2024-01-10T07:24:06.189701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
585.0529872 2
 
0.8%
128.5852471 1
 
0.4%
624.0712333 1
 
0.4%
123.0369963 1
 
0.4%
473.0223642 1
 
0.4%
454.7660731 1
 
0.4%
424.0089786 1
 
0.4%
1653.134708 1
 
0.4%
617.198548 1
 
0.4%
432.7057954 1
 
0.4%
Other values (236) 236
95.5%
ValueCountFrequency (%)
69.9664521 1
0.4%
73.244 1
0.4%
88.00302857 1
0.4%
89.99049499 1
0.4%
99.42092725 1
0.4%
100.1945122 1
0.4%
101.5256762 1
0.4%
103.6556226 1
0.4%
104.2068757 1
0.4%
105.4202673 1
0.4%
ValueCountFrequency (%)
9340.734813 1
0.4%
8188.033474 1
0.4%
7133.198359 1
0.4%
6464.306315 1
0.4%
5831.966507 1
0.4%
5120.475608 1
0.4%
5095.434931 1
0.4%
4635.889327 1
0.4%
3984.266202 1
0.4%
3964.564051 1
0.4%

Interactions

2024-01-10T07:24:04.793668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:24:04.642482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:24:04.868428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:24:04.730017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:24:06.253666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
라벨명면적_도형길이_도형
라벨명1.0000.7920.788
면적_도형0.7921.0000.988
길이_도형0.7880.9881.000
2024-01-10T07:24:06.319168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적_도형길이_도형라벨명
면적_도형1.0000.9870.520
길이_도형0.9871.0000.424
라벨명0.5200.4241.000

Missing values

2024-01-10T07:24:05.172040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:24:05.236533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

현황도형 관리번호라벨명면적_도형길이_도형
044130UQ155PS201207050098청사1025.1455128.585247
144130UQ155PS200912020155청사1000.0809124.430471
244130UQ155PS200804300005기타공공청사시설2145.0996184.704759
344130UQ155PS201305130003기타체육시설31553.627191011.930165
444130UQ155PS201705220012기타공공청사시설960.430303127.835177
544130UQ155PS202006220066초등학교16627.7596539.312073
644130UQ155PS200101020109대학170631.83822344.120852
744130UQ155PS200101180238초등학교14690.78066575.00203
844130UQ155PS200101180243초등학교15864.935508.607553
944130UQ155PS200101180241특수학교22036.8643649.924223
현황도형 관리번호라벨명면적_도형길이_도형
23744130UQ155PS198706120073초등학교30482.34266770.518525
23844130UQ155PS198706120067초등학교25546.29221651.244428
23944130UQ155PS198706120066초등학교22608.49354728.844884
24044130UQ155PS200912020165고등학교15153.91765488.775654
24144130UQ155PS200912020163중학교14849.95335490.003633
24244130UQ155PS200912020160초등학교14442.4758474.540408
24344130UQ155PS200912020159초등학교14849.98355490.163343
24444130UQ155PS201505060910유치원3103.05935221.79243
24544130UQ155PS201505060909유치원3205.0925230.458
24644130UQ155PS200912020158기타사회복지시설305.715469.966452