Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells20008
Missing cells (%)33.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description춘천도시공사에서 관리하는 안식원, 안식공원 시설에서 제공하는 데이터(코드, 안치장소, 안치유형, 안치구분)
Author춘천도시공사
URLhttps://www.data.go.kr/data/15094407/fileData.do

Alerts

Unnamed: 5 has constant value ""Constant
안치유형 is highly overall correlated with 안치구분High correlation
안치구분 is highly overall correlated with 안치유형High correlation
안치유형 is highly imbalanced (51.7%)Imbalance
Unnamed: 4 has 9994 (99.9%) missing valuesMissing
Unnamed: 5 has 9998 (> 99.9%) missing valuesMissing
코드 has unique valuesUnique

Reproduction

Analysis started2024-04-17 10:58:35.575908
Analysis finished2024-04-17 10:58:36.412172
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

코드
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7310.1429
Minimum3
Maximum14116
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T19:58:36.464631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile586.95
Q13357.75
median7822.5
Q311069.25
95-th percentile13509.05
Maximum14116
Range14113
Interquartile range (IQR)7711.5

Descriptive statistics

Standard deviation4264.3461
Coefficient of variation (CV)0.58334648
Kurtosis-1.2759173
Mean7310.1429
Median Absolute Deviation (MAD)3755
Skewness-0.14847872
Sum73101429
Variance18184648
MonotonicityNot monotonic
2024-04-17T19:58:36.566497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11091 1
 
< 0.1%
6998 1
 
< 0.1%
11387 1
 
< 0.1%
10353 1
 
< 0.1%
11572 1
 
< 0.1%
4649 1
 
< 0.1%
11547 1
 
< 0.1%
8091 1
 
< 0.1%
13826 1
 
< 0.1%
10770 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
15 1
< 0.1%
ValueCountFrequency (%)
14116 1
< 0.1%
14114 1
< 0.1%
14113 1
< 0.1%
14112 1
< 0.1%
14111 1
< 0.1%
14110 1
< 0.1%
14109 1
< 0.1%
14108 1
< 0.1%
14107 1
< 0.1%
14106 1
< 0.1%
Distinct9814
Distinct (%)98.3%
Missing16
Missing (%)0.2%
Memory size156.2 KiB
2024-04-17T19:58:36.863518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length6.3390425
Min length1

Characters and Unicode

Total characters63289
Distinct characters37
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9654 ?
Unique (%)96.7%

Sample

1st row22-298
2nd row4-12-A45
3rd row26-192
4th row19-51
5th row5.3.A3
ValueCountFrequency (%)
10월 35
 
0.3%
09월 34
 
0.3%
11월 33
 
0.3%
03월 30
 
0.3%
05월 29
 
0.3%
07월 29
 
0.3%
06월 28
 
0.3%
08월 28
 
0.3%
02월 26
 
0.3%
04월 25
 
0.2%
Other values (9574) 10035
97.1%
2024-04-17T19:58:37.270106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 10508
16.6%
2 8724
13.8%
- 8664
13.7%
. 5660
8.9%
3 4976
7.9%
4 4047
 
6.4%
5 3529
 
5.6%
6 2902
 
4.6%
7 2646
 
4.2%
0 2502
 
4.0%
Other values (27) 9131
14.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44317
70.0%
Dash Punctuation 8664
 
13.7%
Other Punctuation 5660
 
8.9%
Uppercase Letter 3590
 
5.7%
Other Letter 693
 
1.1%
Space Separator 352
 
0.6%
Lowercase Letter 11
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
338
48.8%
338
48.8%
2
 
0.3%
2
 
0.3%
2
 
0.3%
2
 
0.3%
1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Other values (5) 5
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 10508
23.7%
2 8724
19.7%
3 4976
11.2%
4 4047
 
9.1%
5 3529
 
8.0%
6 2902
 
6.5%
7 2646
 
6.0%
0 2502
 
5.6%
8 2405
 
5.4%
9 2078
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
A 2122
59.1%
B 845
 
23.5%
C 343
 
9.6%
D 280
 
7.8%
Lowercase Letter
ValueCountFrequency (%)
c 6
54.5%
b 4
36.4%
a 1
 
9.1%
Math Symbol
ValueCountFrequency (%)
+ 1
50.0%
~ 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 8664
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5660
100.0%
Space Separator
ValueCountFrequency (%)
352
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 58995
93.2%
Latin 3601
 
5.7%
Hangul 693
 
1.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 10508
17.8%
2 8724
14.8%
- 8664
14.7%
. 5660
9.6%
3 4976
8.4%
4 4047
 
6.9%
5 3529
 
6.0%
6 2902
 
4.9%
7 2646
 
4.5%
0 2502
 
4.2%
Other values (5) 4837
8.2%
Hangul
ValueCountFrequency (%)
338
48.8%
338
48.8%
2
 
0.3%
2
 
0.3%
2
 
0.3%
2
 
0.3%
1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Other values (5) 5
 
0.7%
Latin
ValueCountFrequency (%)
A 2122
58.9%
B 845
 
23.5%
C 343
 
9.5%
D 280
 
7.8%
c 6
 
0.2%
b 4
 
0.1%
a 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 62596
98.9%
Hangul 692
 
1.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 10508
16.8%
2 8724
13.9%
- 8664
13.8%
. 5660
9.0%
3 4976
7.9%
4 4047
 
6.5%
5 3529
 
5.6%
6 2902
 
4.6%
7 2646
 
4.2%
0 2502
 
4.0%
Other values (12) 8438
13.5%
Hangul
ValueCountFrequency (%)
338
48.8%
338
48.8%
2
 
0.3%
2
 
0.3%
2
 
0.3%
2
 
0.3%
1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Other values (4) 4
 
0.6%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

안치유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
봉안당
6366 
잔디장
1432 
매장묘
1271 
봉안묘
907 
<NA>
 
20
Other values (4)
 
4

Length

Max length5
Median length3
Mean length3.002
Min length1

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row봉안당
2nd row봉안묘
3rd row봉안당
4th row봉안당
5th row매장묘

Common Values

ValueCountFrequency (%)
봉안당 6366
63.7%
잔디장 1432
 
14.3%
매장묘 1271
 
12.7%
봉안묘 907
 
9.1%
<NA> 20
 
0.2%
7 1
 
< 0.1%
10.A8 1
 
< 0.1%
10 1
 
< 0.1%
B39. 1
 
< 0.1%

Length

2024-04-17T19:58:37.388358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:58:37.486306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
봉안당 6366
63.7%
잔디장 1432
 
14.3%
매장묘 1271
 
12.7%
봉안묘 907
 
9.1%
na 20
 
0.2%
7 1
 
< 0.1%
10.a8 1
 
< 0.1%
10 1
 
< 0.1%
b39 1
 
< 0.1%

안치구분
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전체
5139 
합장
1600 
부부단
1266 
단장
1127 
납골묘
841 
Other values (5)
 
27

Length

Max length4
Median length2
Mean length2.2156
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row전체
2nd row전체
3rd row전체
4th row부부단
5th row합장

Common Values

ValueCountFrequency (%)
전체 5139
51.4%
합장 1600
 
16.0%
부부단 1266
 
12.7%
단장 1127
 
11.3%
납골묘 841
 
8.4%
<NA> 21
 
0.2%
매장묘 3
 
< 0.1%
A47. 1
 
< 0.1%
A24 1
 
< 0.1%
잔디장 1
 
< 0.1%

Length

2024-04-17T19:58:37.597842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:58:37.695345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 5139
51.4%
합장 1600
 
16.0%
부부단 1266
 
12.7%
단장 1127
 
11.3%
납골묘 841
 
8.4%
na 21
 
0.2%
매장묘 3
 
< 0.1%
a47 1
 
< 0.1%
a24 1
 
< 0.1%
잔디장 1
 
< 0.1%

Unnamed: 4
Text

MISSING 

Distinct3
Distinct (%)50.0%
Missing9994
Missing (%)99.9%
Memory size156.2 KiB
2024-04-17T19:58:37.816964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length2.3333333
Min length2

Characters and Unicode

Total characters14
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합장
2nd row단장
3rd row매장묘
4th row합장
5th row매장묘
ValueCountFrequency (%)
합장 2
33.3%
단장 2
33.3%
매장묘 2
33.3%
2024-04-17T19:58:38.026428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
42.9%
2
 
14.3%
2
 
14.3%
2
 
14.3%
2
 
14.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
42.9%
2
 
14.3%
2
 
14.3%
2
 
14.3%
2
 
14.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
42.9%
2
 
14.3%
2
 
14.3%
2
 
14.3%
2
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
42.9%
2
 
14.3%
2
 
14.3%
2
 
14.3%
2
 
14.3%

Unnamed: 5
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing9998
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:58:38.097700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합장
2nd row합장
ValueCountFrequency (%)
합장 2
100.0%
2024-04-17T19:58:38.253734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Interactions

2024-04-17T19:58:36.098789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T19:58:38.326717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
코드안치유형안치구분Unnamed: 4
코드1.0000.6980.6370.193
안치유형0.6981.0000.9741.000
안치구분0.6370.9741.0000.568
Unnamed: 40.1931.0000.5681.000
2024-04-17T19:58:38.406431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
안치유형안치구분
안치유형1.0000.920
안치구분0.9201.000
2024-04-17T19:58:38.477091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
코드안치유형안치구분
코드1.0000.4280.355
안치유형0.4281.0000.920
안치구분0.3550.9201.000

Missing values

2024-04-17T19:58:36.192865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:58:36.272891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-17T19:58:36.357157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

코드안치장소안치유형안치구분Unnamed: 4Unnamed: 5
87331109122-298봉안당전체<NA><NA>
8895112564-12-A45봉안묘전체<NA><NA>
4937672726-192봉안당전체<NA><NA>
86291097919-51봉안당부부단<NA><NA>
4474585.3.A3매장묘합장<NA><NA>
7132930524-368봉안당전체<NA><NA>
470163448-348봉안당전체<NA><NA>
1907199110-76봉안당부부단<NA><NA>
86241097322-250봉안당전체<NA><NA>
1619165412-171봉안당부부단<NA><NA>
코드안치장소안치유형안치구분Unnamed: 4Unnamed: 5
10609130661-22-D6잔디장합장<NA><NA>
2236235401월 14일봉안당전체<NA><NA>
425255447-240봉안당전체<NA><NA>
8349106644-11-A3봉안묘납골묘<NA><NA>
569177282.7.A9.봉안묘납골묘<NA><NA>
5306729925-166봉안당전체<NA><NA>
236525651-225봉안당전체<NA><NA>
7294948413-498봉안당전체<NA><NA>
289133533-180봉안당전체<NA><NA>
2503277402월 02일봉안당전체<NA><NA>