Overview

Dataset statistics

Number of variables7
Number of observations108
Missing cells22
Missing cells (%)2.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory59.2 B

Variable types

Categorical4
Text1
Numeric2

Alerts

데이터기준일자 has constant value ""Constant
정제WGS84위도 is highly overall correlated with 시군명High correlation
정제WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 정제WGS84위도 and 1 other fieldsHigh correlation
유형 is highly overall correlated with 용도High correlation
용도 is highly overall correlated with 유형High correlation
정제WGS84위도 has 11 (10.2%) missing valuesMissing
정제WGS84경도 has 11 (10.2%) missing valuesMissing
명소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 21:51:02.165739
Analysis finished2023-12-10 21:51:03.017433
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)22.2%
Missing0
Missing (%)0.0%
Memory size996.0 B
고양시
12 
광주시
12 
여주시
11 
남양주시
수원시
Other values (19)
57 

Length

Max length4
Median length3
Mean length3.0833333
Min length3

Unique

Unique6 ?
Unique (%)5.6%

Sample

1st row하남시
2nd row화성시
3rd row고양시
4th row고양시
5th row고양시

Common Values

ValueCountFrequency (%)
고양시 12
 
11.1%
광주시 12
 
11.1%
여주시 11
 
10.2%
남양주시 9
 
8.3%
수원시 7
 
6.5%
파주시 7
 
6.5%
김포시 6
 
5.6%
화성시 6
 
5.6%
용인시 4
 
3.7%
과천시 4
 
3.7%
Other values (14) 30
27.8%

Length

2023-12-11T06:51:03.081469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고양시 12
 
11.1%
광주시 12
 
11.1%
여주시 11
 
10.2%
남양주시 9
 
8.3%
수원시 7
 
6.5%
파주시 7
 
6.5%
김포시 6
 
5.6%
화성시 6
 
5.6%
용인시 4
 
3.7%
과천시 4
 
3.7%
Other values (14) 30
27.8%

명소명
Text

UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-11T06:51:03.355612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.6018519
Min length2

Characters and Unicode

Total characters821
Distinct characters243
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)100.0%

Sample

1st row하남춘궁동 동사지
2nd row남양향교
3rd row고양 벽제관지
4th row북한산성_중흥사지
5th row월산대군사당(月山大君祠堂)
ValueCountFrequency (%)
고택 2
 
1.4%
고가 2
 
1.4%
양주 2
 
1.4%
가옥 2
 
1.4%
종택 2
 
1.4%
동구릉(東九陵 1
 
0.7%
남한산성_현절사(顯節祠 1
 
0.7%
남한산성_침괘정(枕戈亭 1
 
0.7%
남한산성_수어장대(守禦將臺 1
 
0.7%
남한산성_청량당(淸凉堂 1
 
0.7%
Other values (125) 125
89.3%
2023-12-11T06:51:03.767913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 38
 
4.6%
) 38
 
4.6%
35
 
4.3%
32
 
3.9%
29
 
3.5%
27
 
3.3%
_ 24
 
2.9%
18
 
2.2%
18
 
2.2%
17
 
2.1%
Other values (233) 545
66.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 685
83.4%
Open Punctuation 38
 
4.6%
Close Punctuation 38
 
4.6%
Space Separator 32
 
3.9%
Connector Punctuation 24
 
2.9%
Other Punctuation 3
 
0.4%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
5.1%
29
 
4.2%
27
 
3.9%
18
 
2.6%
18
 
2.6%
17
 
2.5%
16
 
2.3%
16
 
2.3%
15
 
2.2%
14
 
2.0%
Other values (227) 480
70.1%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Space Separator
ValueCountFrequency (%)
32
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 24
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 568
69.2%
Common 136
 
16.6%
Han 117
 
14.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
6.2%
29
 
5.1%
27
 
4.8%
18
 
3.2%
18
 
3.2%
17
 
3.0%
16
 
2.8%
16
 
2.8%
15
 
2.6%
14
 
2.5%
Other values (144) 363
63.9%
Han
ValueCountFrequency (%)
12
 
10.3%
3
 
2.6%
西 3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
Other values (73) 82
70.1%
Common
ValueCountFrequency (%)
( 38
27.9%
) 38
27.9%
32
23.5%
_ 24
17.6%
, 3
 
2.2%
3 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 568
69.2%
ASCII 136
 
16.6%
CJK 113
 
13.8%
CJK Compat Ideographs 4
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 38
27.9%
) 38
27.9%
32
23.5%
_ 24
17.6%
, 3
 
2.2%
3 1
 
0.7%
Hangul
ValueCountFrequency (%)
35
 
6.2%
29
 
5.1%
27
 
4.8%
18
 
3.2%
18
 
3.2%
17
 
3.0%
16
 
2.8%
16
 
2.8%
15
 
2.6%
14
 
2.5%
Other values (144) 363
63.9%
CJK
ValueCountFrequency (%)
12
 
10.6%
3
 
2.7%
西 3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (71) 79
69.9%
CJK Compat Ideographs
ValueCountFrequency (%)
3
75.0%
1
 
25.0%

유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size996.0 B
공간환경
54 
건축물
54 

Length

Max length4
Median length3.5
Mean length3.5
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공간환경
2nd row건축물
3rd row공간환경
4th row공간환경
5th row건축물

Common Values

ValueCountFrequency (%)
공간환경 54
50.0%
건축물 54
50.0%

Length

2023-12-11T06:51:03.929500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:51:04.018924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공간환경 54
50.0%
건축물 54
50.0%

용도
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size996.0 B
사찰
20 
전통가옥
16 
15 
향교
13 
성곽
11 
Other values (13)
33 

Length

Max length8
Median length2
Mean length2.4444444
Min length1

Unique

Unique6 ?
Unique (%)5.6%

Sample

1st row사찰
2nd row향교
3rd row주거건물지
4th row사찰
5th row사당

Common Values

ValueCountFrequency (%)
사찰 20
18.5%
전통가옥 16
14.8%
15
13.9%
향교 13
12.0%
성곽 11
10.2%
서원 7
 
6.5%
누정 6
 
5.6%
사당 4
 
3.7%
행궁 3
 
2.8%
관아 3
 
2.8%
Other values (8) 10
9.3%

Length

2023-12-11T06:51:04.112231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사찰 20
18.5%
전통가옥 16
14.8%
15
13.9%
향교 13
12.0%
성곽 11
10.2%
서원 7
 
6.5%
누정 6
 
5.6%
사당 4
 
3.7%
관아 3
 
2.8%
행궁 3
 
2.8%
Other values (8) 10
9.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size996.0 B
2021-02-19
108 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-02-19
2nd row2021-02-19
3rd row2021-02-19
4th row2021-02-19
5th row2021-02-19

Common Values

ValueCountFrequency (%)
2021-02-19 108
100.0%

Length

2023-12-11T06:51:04.218755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:51:04.296103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-02-19 108
100.0%

정제WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct96
Distinct (%)99.0%
Missing11
Missing (%)10.2%
Infinite0
Infinite (%)0.0%
Mean37.484088
Minimum36.939795
Maximum38.023713
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T06:51:04.393073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.939795
5-th percentile37.034225
Q137.288543
median37.478856
Q337.65887
95-th percentile37.871373
Maximum38.023713
Range1.0839178
Interquartile range (IQR)0.37032666

Descriptive statistics

Standard deviation0.2435382
Coefficient of variation (CV)0.0064971089
Kurtosis-0.44655819
Mean37.484088
Median Absolute Deviation (MAD)0.18972632
Skewness-0.13414259
Sum3635.9565
Variance0.059310855
MonotonicityNot monotonic
2023-12-11T06:51:04.524783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.2596759342 2
 
1.9%
37.6454472747 1
 
0.9%
37.6851651996 1
 
0.9%
37.7390088945 1
 
0.9%
37.7169565963 1
 
0.9%
37.6246481239 1
 
0.9%
37.62525541 1
 
0.9%
37.4788559698 1
 
0.9%
37.4797463077 1
 
0.9%
37.4796557605 1
 
0.9%
Other values (86) 86
79.6%
(Missing) 11
 
10.2%
ValueCountFrequency (%)
36.9397947684 1
0.9%
36.9644789638 1
0.9%
36.9648846827 1
0.9%
37.0138868798 1
0.9%
37.0263280823 1
0.9%
37.0361987088 1
0.9%
37.1331141093 1
0.9%
37.1451271205 1
0.9%
37.1522505766 1
0.9%
37.1629511234 1
0.9%
ValueCountFrequency (%)
38.023712603 1
0.9%
37.9911199573 1
0.9%
37.9855113305 1
0.9%
37.8897027254 1
0.9%
37.8876773885 1
0.9%
37.8672973033 1
0.9%
37.7998048272 1
0.9%
37.7856760794 1
0.9%
37.7711844771 1
0.9%
37.7537165759 1
0.9%

정제WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct96
Distinct (%)99.0%
Missing11
Missing (%)10.2%
Infinite0
Infinite (%)0.0%
Mean127.10293
Minimum126.53887
Maximum127.65543
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T06:51:04.664040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.53887
5-th percentile126.70786
Q1126.95052
median127.09901
Q3127.19854
95-th percentile127.61351
Maximum127.65543
Range1.1165554
Interquartile range (IQR)0.24801814

Descriptive statistics

Standard deviation0.25566751
Coefficient of variation (CV)0.0020114997
Kurtosis-0.13248577
Mean127.10293
Median Absolute Deviation (MAD)0.13412356
Skewness0.28478864
Sum12328.984
Variance0.065365877
MonotonicityNot monotonic
2023-12-11T06:51:04.789757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.6554257107 2
 
1.9%
127.1984176852 1
 
0.9%
127.099008921 1
 
0.9%
126.538870329 1
 
0.9%
126.5501928022 1
 
0.9%
126.7095635949 1
 
0.9%
127.1420093016 1
 
0.9%
127.1811261788 1
 
0.9%
127.176176382 1
 
0.9%
127.1764814274 1
 
0.9%
Other values (86) 86
79.6%
(Missing) 11
 
10.2%
ValueCountFrequency (%)
126.538870329 1
0.9%
126.5501928022 1
0.9%
126.6292288831 1
0.9%
126.6943578432 1
0.9%
126.7010381519 1
0.9%
126.7095635949 1
0.9%
126.7110037568 1
0.9%
126.7132581753 1
0.9%
126.8003555625 1
0.9%
126.80549435 1
0.9%
ValueCountFrequency (%)
127.6554257107 2
1.9%
127.6524637426 1
0.9%
127.6333132791 1
0.9%
127.6287596217 1
0.9%
127.6096988467 1
0.9%
127.5725345535 1
0.9%
127.5572029298 1
0.9%
127.5293096634 1
0.9%
127.4576052824 1
0.9%
127.4505932728 1
0.9%

Interactions

2023-12-11T06:51:02.622057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:51:02.465542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:51:02.694591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:51:02.540859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:51:04.892930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명유형용도정제WGS84위도정제WGS84경도
시군명1.0000.6530.6170.9520.904
유형0.6531.0000.9160.5950.280
용도0.6170.9161.0000.0000.000
정제WGS84위도0.9520.5950.0001.0000.602
정제WGS84경도0.9040.2800.0000.6021.000
2023-12-11T06:51:04.989566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명유형용도
시군명1.0000.4660.204
유형0.4661.0000.719
용도0.2040.7191.000
2023-12-11T06:51:05.067301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정제WGS84위도정제WGS84경도시군명유형용도
정제WGS84위도1.000-0.3240.6990.4390.000
정제WGS84경도-0.3241.0000.5710.2030.000
시군명0.6990.5711.0000.4660.204
유형0.4390.2030.4661.0000.719
용도0.0000.0000.2040.7191.000

Missing values

2023-12-11T06:51:02.793748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:51:02.895610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:51:02.976424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명명소명유형용도데이터기준일자정제WGS84위도정제WGS84경도
0하남시하남춘궁동 동사지공간환경사찰2021-02-1937.518175127.186077
1화성시남양향교건축물향교2021-02-1937.204403126.800356
2고양시고양 벽제관지공간환경주거건물지2021-02-1937.70456126.900945
3고양시북한산성_중흥사지공간환경사찰2021-02-1937.645856126.976627
4고양시월산대군사당(月山大君祠堂)건축물사당2021-02-1937.676017126.872842
5고양시공양왕릉(恭讓王陵)공간환경2021-02-1937.680354126.839098
6고양시행주산성_행주서원지공간환경서원2021-02-1937.599604126.818626
7과천시연주대_연주암(3층석탑)공간환경사찰2021-02-1937.441846126.964882
8과천시연주대_관악사지공간환경사찰2021-02-1937.44358126.965502
9광주시남한산성_망월사지공간환경사찰2021-02-1937.478316127.196197
시군명명소명유형용도데이터기준일자정제WGS84위도정제WGS84경도
98용인시전음애이자 고택건축물전통가옥2021-02-1937.238481127.134297
99용인시심곡서원건축물서원2021-02-1937.305767127.076625
100의왕시임영대군이구묘역및사당공간환경사당및묘역2021-02-1937.375624126.995995
101의정부서계 박세당 사랑채,종가,사당건축물전통가옥2021-02-1937.700286127.057568
102이천시설성산성공간환경성곽2021-02-1937.145127127.557203
103파주시보광사건축물사찰2021-02-1937.753717126.920471
104파주시자운서원건축물서원2021-02-1937.867297126.872006
105파주시장릉(長陵)공간환경2021-02-1937.771184126.711004
106평택시팽성읍 객사건축물관아2021-02-1936.964885127.063287
107포천시포천향교(抱川鄕校)건축물향교2021-02-1937.889703127.221343