Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells26134
Missing cells (%)20.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory114.0 B

Variable types

Text2
Categorical7
DateTime3
Boolean1

Dataset

Description부산시설공단_영락공원묘지사용현황_20220125
Author부산시설공단
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15067559

Alerts

개장여부 has constant value ""Constant
순번 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
매장종류 is highly overall correlated with 묘지수납구분High correlation
감면구분 is highly overall correlated with 묘지수납구분High correlation
묘지수납구분 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
순번 is highly imbalanced (80.3%)Imbalance
시도구분 is highly imbalanced (76.2%)Imbalance
매장종류 is highly imbalanced (81.1%)Imbalance
감면구분 is highly imbalanced (86.6%)Imbalance
묘지수납구분 is highly imbalanced (79.3%)Imbalance
사용료 is highly imbalanced (98.4%)Imbalance
만료일자 has 9374 (93.7%) missing valuesMissing
개장여부 has 8348 (83.5%) missing valuesMissing
개장일자 has 8348 (83.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:43:27.362888
Analysis finished2023-12-10 16:43:29.054514
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9816
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:43:29.398238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters150000
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9634 ?
Unique (%)96.3%

Sample

1st row06묘원 32블럭 0668호
2nd row07묘원 37블럭 0032호
3rd row07묘원 36블럭 0669호
4th row05묘원 30블럭 0507호
5th row05묘원 31블럭 0224호
ValueCountFrequency (%)
01묘원 1424
 
4.7%
14묘원 987
 
3.3%
11묘원 978
 
3.3%
07묘원 885
 
2.9%
08묘원 880
 
2.9%
02묘원 728
 
2.4%
03묘원 658
 
2.2%
09묘원 603
 
2.0%
13묘원 594
 
2.0%
04묘원 563
 
1.9%
Other values (950) 21700
72.3%
2023-12-11T01:43:29.876467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 23561
15.7%
20000
13.3%
1 12165
8.1%
10000
 
6.7%
10000
 
6.7%
10000
 
6.7%
10000
 
6.7%
10000
 
6.7%
3 7714
 
5.1%
2 7365
 
4.9%
Other values (6) 29195
19.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80000
53.3%
Other Letter 50000
33.3%
Space Separator 20000
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 23561
29.5%
1 12165
15.2%
3 7714
 
9.6%
2 7365
 
9.2%
4 7118
 
8.9%
5 6209
 
7.8%
6 4958
 
6.2%
7 3881
 
4.9%
8 3732
 
4.7%
9 3297
 
4.1%
Other Letter
ValueCountFrequency (%)
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
Space Separator
ValueCountFrequency (%)
20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
66.7%
Hangul 50000
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 23561
23.6%
20000
20.0%
1 12165
12.2%
3 7714
 
7.7%
2 7365
 
7.4%
4 7118
 
7.1%
5 6209
 
6.2%
6 4958
 
5.0%
7 3881
 
3.9%
8 3732
 
3.7%
Hangul
ValueCountFrequency (%)
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
66.7%
Hangul 50000
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 23561
23.6%
20000
20.0%
1 12165
12.2%
3 7714
 
7.7%
2 7365
 
7.4%
4 7118
 
7.1%
5 6209
 
6.2%
6 4958
 
5.0%
7 3881
 
3.9%
8 3732
 
3.7%
Hangul
ValueCountFrequency (%)
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%

순번
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9464 
2
 
520
3
 
16

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9464
94.6%
2 520
 
5.2%
3 16
 
0.2%

Length

2023-12-11T01:43:30.051822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:30.171522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9464
94.6%
2 520
 
5.2%
3 16
 
0.2%
Distinct5388
Distinct (%)54.2%
Missing64
Missing (%)0.6%
Memory size156.2 KiB
2023-12-11T01:43:30.472534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters99360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2949 ?
Unique (%)29.7%

Sample

1st row1974-05-29
2nd row1982-05-09
3rd row1975-07-28
4th row1952-02-02
5th row1973-12-22
ValueCountFrequency (%)
1901-01-01 57
 
0.6%
1927-08-27 15
 
0.2%
1979-04-05 9
 
0.1%
1981-03-09 8
 
0.1%
1981-02-13 7
 
0.1%
1977-12-11 7
 
0.1%
1977-12-29 7
 
0.1%
1978-01-30 7
 
0.1%
1974-05-29 7
 
0.1%
1980-04-30 7
 
0.1%
Other values (5378) 9805
98.7%
2023-12-11T01:43:30.899615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 19872
20.0%
1 19738
19.9%
0 14760
14.9%
9 12494
12.6%
7 8731
8.8%
2 7220
 
7.3%
8 4483
 
4.5%
6 3245
 
3.3%
3 3197
 
3.2%
5 2882
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79488
80.0%
Dash Punctuation 19872
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 19738
24.8%
0 14760
18.6%
9 12494
15.7%
7 8731
11.0%
2 7220
 
9.1%
8 4483
 
5.6%
6 3245
 
4.1%
3 3197
 
4.0%
5 2882
 
3.6%
4 2738
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 19872
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 19872
20.0%
1 19738
19.9%
0 14760
14.9%
9 12494
12.6%
7 8731
8.8%
2 7220
 
7.3%
8 4483
 
4.5%
6 3245
 
3.3%
3 3197
 
3.2%
5 2882
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 19872
20.0%
1 19738
19.9%
0 14760
14.9%
9 12494
12.6%
7 8731
8.8%
2 7220
 
7.3%
8 4483
 
4.5%
6 3245
 
3.3%
3 3197
 
3.2%
5 2882
 
2.9%
Distinct5137
Distinct (%)51.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1967-09-11 00:00:00
Maximum2020-09-14 00:00:00
2023-12-11T01:43:31.076492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:31.223848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

만료일자
Date

MISSING 

Distinct600
Distinct (%)95.8%
Missing9374
Missing (%)93.7%
Memory size156.2 KiB
Minimum2023-05-09 00:00:00
Maximum2050-09-13 00:00:00
2023-12-11T01:43:31.349270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:31.491103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신규
7837 
개장
1652 
재사용
 
511

Length

Max length3
Median length2
Mean length2.0511
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규
2nd row신규
3rd row신규
4th row신규
5th row신규

Common Values

ValueCountFrequency (%)
신규 7837
78.4%
개장 1652
 
16.5%
재사용 511
 
5.1%

Length

2023-12-11T01:43:31.622955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:31.736224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규 7837
78.4%
개장 1652
 
16.5%
재사용 511
 
5.1%

개장여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.1%
Missing8348
Missing (%)83.5%
Memory size97.7 KiB
True
1652 
(Missing)
8348 
ValueCountFrequency (%)
True 1652
 
16.5%
(Missing) 8348
83.5%
2023-12-11T01:43:31.828013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

개장일자
Date

MISSING 

Distinct1264
Distinct (%)76.5%
Missing8348
Missing (%)83.5%
Memory size156.2 KiB
Minimum1900-01-01 00:00:00
Maximum2020-12-31 00:00:00
2023-12-11T01:43:31.933766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:32.080527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시도구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
자시
9418 
인접
 
326
타시
 
256

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인접
2nd row자시
3rd row자시
4th row자시
5th row자시

Common Values

ValueCountFrequency (%)
자시 9418
94.2%
인접 326
 
3.3%
타시 256
 
2.6%

Length

2023-12-11T01:43:32.195920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:32.282784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자시 9418
94.2%
인접 326
 
3.3%
타시 256
 
2.6%

매장종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
시체
9364 
개장유골
 
374
부부합장
 
145
화장유골
 
92
유골합장
 
25

Length

Max length4
Median length2
Mean length2.1272
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시체
2nd row시체
3rd row시체
4th row개장유골
5th row시체

Common Values

ValueCountFrequency (%)
시체 9364
93.6%
개장유골 374
 
3.7%
부부합장 145
 
1.5%
화장유골 92
 
0.9%
유골합장 25
 
0.2%

Length

2023-12-11T01:43:32.397652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:32.498382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시체 9364
93.6%
개장유골 374
 
3.7%
부부합장 145
 
1.5%
화장유골 92
 
0.9%
유골합장 25
 
0.2%

감면구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
9444 
기타면제
 
414
사전예매
 
95
생활수급
 
24
지역주민
 
15
Other values (2)
 
8

Length

Max length4
Median length2
Mean length2.1112
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 9444
94.4%
기타면제 414
 
4.1%
사전예매 95
 
0.9%
생활수급 24
 
0.2%
지역주민 15
 
0.1%
참전유공 5
 
0.1%
국가유공 3
 
< 0.1%

Length

2023-12-11T01:43:32.607768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:32.712395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9444
94.4%
기타면제 414
 
4.1%
사전예매 95
 
0.9%
생활수급 24
 
0.2%
지역주민 15
 
0.1%
참전유공 5
 
< 0.1%
국가유공 3
 
< 0.1%

묘지수납구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영구(신규불가)
9403 
최초(15년)
 
595
<NA>
 
2

Length

Max length8
Median length8
Mean length7.9397
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영구(신규불가)
2nd row영구(신규불가)
3rd row영구(신규불가)
4th row영구(신규불가)
5th row영구(신규불가)

Common Values

ValueCountFrequency (%)
영구(신규불가) 9403
94.0%
최초(15년) 595
 
5.9%
<NA> 2
 
< 0.1%

Length

2023-12-11T01:43:32.818992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:32.912936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영구(신규불가 9403
94.0%
최초(15년 595
 
5.9%
na 2
 
< 0.1%

사용료
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
300000
9969 
600000
 
26
0
 
3
<NA>
 
2

Length

Max length6
Median length6
Mean length5.9981
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row300000
2nd row300000
3rd row300000
4th row300000
5th row300000

Common Values

ValueCountFrequency (%)
300000 9969
99.7%
600000 26
 
0.3%
0 3
 
< 0.1%
<NA> 2
 
< 0.1%

Length

2023-12-11T01:43:32.992804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:33.071091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
300000 9969
99.7%
600000 26
 
0.3%
0 3
 
< 0.1%
na 2
 
< 0.1%

Correlations

2023-12-11T01:43:33.127727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분시도구분매장종류감면구분묘지수납구분사용료
순번1.0000.9360.0000.4680.4200.5210.279
구분0.9361.0000.0280.4510.4200.5090.286
시도구분0.0000.0281.0000.0470.0900.0160.360
매장종류0.4680.4510.0471.0000.2870.4630.150
감면구분0.4200.4200.0900.2871.0000.5480.295
묘지수납구분0.5210.5090.0160.4630.5481.0000.077
사용료0.2790.2860.3600.1500.2950.0771.000
2023-12-11T01:43:33.434265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도구분사용료구분매장종류순번묘지수납구분감면구분
시도구분1.0000.1270.0080.0350.0000.0270.060
사용료0.1271.0000.0950.1130.0920.1280.207
구분0.0080.0951.0000.3820.6900.7720.312
매장종류0.0350.1130.3821.0000.4000.5620.188
순번0.0000.0920.6900.4001.0000.7860.312
묘지수납구분0.0270.1280.7720.5620.7861.0000.589
감면구분0.0600.2070.3120.1880.3120.5891.000
2023-12-11T01:43:33.525340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분시도구분매장종류감면구분묘지수납구분사용료
순번1.0000.6900.0000.4000.3120.7860.092
구분0.6901.0000.0080.3820.3120.7720.095
시도구분0.0000.0081.0000.0350.0600.0270.127
매장종류0.4000.3820.0351.0000.1880.5620.113
감면구분0.3120.3120.0600.1881.0000.5890.207
묘지수납구분0.7860.7720.0270.5620.5891.0000.128
사용료0.0920.0950.1270.1130.2070.1281.000

Missing values

2023-12-11T01:43:28.522934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:43:28.800140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:43:28.960471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

묘지정보순번사망일자매장일자만료일자구분개장여부개장일자시도구분매장종류감면구분묘지수납구분사용료
1203606묘원 32블럭 0668호11974-05-291974-05-30<NA>신규<NA><NA>인접시체일반영구(신규불가)300000
1372907묘원 37블럭 0032호11982-05-091982-05-11<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1367307묘원 36블럭 0669호11975-07-281975-07-30<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1072705묘원 30블럭 0507호11952-02-021973-10-28<NA>신규<NA><NA>자시개장유골일반영구(신규불가)300000
1104005묘원 31블럭 0224호11973-12-221973-12-24<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
2134611묘원 53블럭 0124호11979-12-181979-12-21<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
751603묘원 22블럭 0103호11972-12-031972-12-05<NA>개장Y2015-07-18자시시체일반영구(신규불가)300000
1952109묘원 49블럭 0295호11977-09-201977-09-22<NA>개장Y2008-05-01타시시체일반영구(신규불가)300000
1273507묘원 35블럭 0107호11974-10-231974-10-25<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1460207묘원 38블럭 0339호11975-09-141975-09-16<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
묘지정보순번사망일자매장일자만료일자구분개장여부개장일자시도구분매장종류감면구분묘지수납구분사용료
2488013묘원 58블럭 0008호11980-03-021980-03-04<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
95001묘원 03블럭 0119호11962-01-041969-07-09<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
2577413묘원 59블럭 0193호11980-04-151980-04-17<NA>개장Y2012-04-28자시시체일반영구(신규불가)300000
1695008묘원 43블럭 0271호11976-08-041976-08-06<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1837309묘원 47블럭 0236호11977-04-301977-05-02<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1405907묘원 37블럭 0350호11975-11-061975-11-15<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
230301묘원 07블럭 0054호22000-04-242000-04-26<NA>재사용<NA><NA>자시시체기타면제영구(신규불가)300000
2563213묘원 59블럭 0062호11980-04-111980-04-13<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
2173111묘원 53블럭 0493호11980-01-011980-01-04<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
2358611묘원 56블럭 0097호11979-01-151979-01-17<NA>신규<NA><NA>자시시체일반영구(신규불가)300000