Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells26217
Missing cells (%)20.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory114.0 B

Variable types

Text2
Categorical7
DateTime3
Boolean1

Dataset

Description부산시설공단_영락공원묘지사용현황_20230125
Author부산시설공단
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15067559

Alerts

개장여부 has constant value ""Constant
순번 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
매장종류 is highly overall correlated with 묘지수납구분High correlation
감면구분 is highly overall correlated with 묘지수납구분High correlation
묘지수납구분 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
순번 is highly imbalanced (79.8%)Imbalance
시도구분 is highly imbalanced (76.6%)Imbalance
매장종류 is highly imbalanced (80.9%)Imbalance
감면구분 is highly imbalanced (87.8%)Imbalance
묘지수납구분 is highly imbalanced (78.9%)Imbalance
사용료 is highly imbalanced (98.6%)Imbalance
만료일자 has 9357 (93.6%) missing valuesMissing
개장여부 has 8400 (84.0%) missing valuesMissing
개장일자 has 8400 (84.0%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:43:18.237803
Analysis finished2023-12-10 16:43:19.612363
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9801
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:43:19.837329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters150000
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9603 ?
Unique (%)96.0%

Sample

1st row07묘원 37블럭 0017호
2nd row14묘원 62블럭 0198호
3rd row06묘원 32블럭 0214호
4th row06묘원 32블럭 0279호
5th row03묘원 22블럭 0017호
ValueCountFrequency (%)
01묘원 1417
 
4.7%
11묘원 1023
 
3.4%
14묘원 979
 
3.3%
08묘원 893
 
3.0%
07묘원 857
 
2.9%
02묘원 739
 
2.5%
03묘원 623
 
2.1%
09묘원 581
 
1.9%
04묘원 572
 
1.9%
10묘원 570
 
1.9%
Other values (959) 21746
72.5%
2023-12-11T01:43:20.236050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 23425
15.6%
20000
13.3%
1 12255
8.2%
10000
 
6.7%
10000
 
6.7%
10000
 
6.7%
10000
 
6.7%
10000
 
6.7%
3 7611
 
5.1%
2 7492
 
5.0%
Other values (6) 29217
19.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80000
53.3%
Other Letter 50000
33.3%
Space Separator 20000
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 23425
29.3%
1 12255
15.3%
3 7611
 
9.5%
2 7492
 
9.4%
4 7151
 
8.9%
5 6254
 
7.8%
6 4975
 
6.2%
7 3795
 
4.7%
8 3740
 
4.7%
9 3302
 
4.1%
Other Letter
ValueCountFrequency (%)
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
Space Separator
ValueCountFrequency (%)
20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
66.7%
Hangul 50000
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 23425
23.4%
20000
20.0%
1 12255
12.3%
3 7611
 
7.6%
2 7492
 
7.5%
4 7151
 
7.2%
5 6254
 
6.3%
6 4975
 
5.0%
7 3795
 
3.8%
8 3740
 
3.7%
Hangul
ValueCountFrequency (%)
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
66.7%
Hangul 50000
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 23425
23.4%
20000
20.0%
1 12255
12.3%
3 7611
 
7.6%
2 7492
 
7.5%
4 7151
 
7.2%
5 6254
 
6.3%
6 4975
 
5.0%
7 3795
 
3.8%
8 3740
 
3.7%
Hangul
ValueCountFrequency (%)
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%
10000
20.0%

순번
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9442 
2
 
545
3
 
13

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9442
94.4%
2 545
 
5.5%
3 13
 
0.1%

Length

2023-12-11T01:43:20.384454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:20.541175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9442
94.4%
2 545
 
5.5%
3 13
 
0.1%
Distinct5403
Distinct (%)54.4%
Missing60
Missing (%)0.6%
Memory size156.2 KiB
2023-12-11T01:43:20.866381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters99400
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2989 ?
Unique (%)30.1%

Sample

1st row1975-07-21
2nd row1980-09-22
3rd row1974-03-10
4th row1982-02-18
5th row1972-09-02
ValueCountFrequency (%)
1901-01-01 55
 
0.6%
1927-08-27 17
 
0.2%
1980-05-06 9
 
0.1%
1979-11-16 8
 
0.1%
1979-03-21 8
 
0.1%
1979-10-24 7
 
0.1%
1977-03-12 7
 
0.1%
1978-12-14 7
 
0.1%
1977-11-25 7
 
0.1%
1978-11-12 7
 
0.1%
Other values (5393) 9808
98.7%
2023-12-11T01:43:21.389580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 19880
20.0%
1 19872
20.0%
0 14636
14.7%
9 12553
12.6%
7 8711
8.8%
2 7247
 
7.3%
8 4614
 
4.6%
6 3191
 
3.2%
3 3141
 
3.2%
5 2852
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79520
80.0%
Dash Punctuation 19880
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 19872
25.0%
0 14636
18.4%
9 12553
15.8%
7 8711
11.0%
2 7247
 
9.1%
8 4614
 
5.8%
6 3191
 
4.0%
3 3141
 
3.9%
5 2852
 
3.6%
4 2703
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 19880
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99400
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 19880
20.0%
1 19872
20.0%
0 14636
14.7%
9 12553
12.6%
7 8711
8.8%
2 7247
 
7.3%
8 4614
 
4.6%
6 3191
 
3.2%
3 3141
 
3.2%
5 2852
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99400
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 19880
20.0%
1 19872
20.0%
0 14636
14.7%
9 12553
12.6%
7 8711
8.8%
2 7247
 
7.3%
8 4614
 
4.6%
6 3191
 
3.2%
3 3141
 
3.2%
5 2852
 
2.9%
Distinct5111
Distinct (%)51.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1955-01-14 00:00:00
Maximum2020-09-04 00:00:00
2023-12-11T01:43:21.577023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:21.736701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

만료일자
Date

MISSING 

Distinct615
Distinct (%)95.6%
Missing9357
Missing (%)93.6%
Memory size156.2 KiB
Minimum2023-05-29 00:00:00
Maximum2050-09-03 00:00:00
2023-12-11T01:43:21.919844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:22.092907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신규
7876 
개장
1600 
재사용
 
524

Length

Max length3
Median length2
Mean length2.0524
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규
2nd row신규
3rd row신규
4th row신규
5th row신규

Common Values

ValueCountFrequency (%)
신규 7876
78.8%
개장 1600
 
16.0%
재사용 524
 
5.2%

Length

2023-12-11T01:43:22.226180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:22.330188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규 7876
78.8%
개장 1600
 
16.0%
재사용 524
 
5.2%

개장여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.1%
Missing8400
Missing (%)84.0%
Memory size97.7 KiB
True
1600 
(Missing)
8400 
ValueCountFrequency (%)
True 1600
 
16.0%
(Missing) 8400
84.0%
2023-12-11T01:43:22.437771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

개장일자
Date

MISSING 

Distinct1249
Distinct (%)78.1%
Missing8400
Missing (%)84.0%
Memory size156.2 KiB
Minimum1900-01-01 00:00:00
Maximum2020-10-25 00:00:00
2023-12-11T01:43:22.551560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:22.742546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시도구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
자시
9433 
인접
 
313
타시
 
254

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자시
2nd row자시
3rd row자시
4th row자시
5th row자시

Common Values

ValueCountFrequency (%)
자시 9433
94.3%
인접 313
 
3.1%
타시 254
 
2.5%

Length

2023-12-11T01:43:22.929333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:23.032003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자시 9433
94.3%
인접 313
 
3.1%
타시 254
 
2.5%

매장종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
시체
9358 
개장유골
 
377
부부합장
 
142
화장유골
 
92
유골합장
 
31

Length

Max length4
Median length2
Mean length2.1284
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시체
2nd row시체
3rd row시체
4th row시체
5th row시체

Common Values

ValueCountFrequency (%)
시체 9358
93.6%
개장유골 377
 
3.8%
부부합장 142
 
1.4%
화장유골 92
 
0.9%
유골합장 31
 
0.3%

Length

2023-12-11T01:43:23.141421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:23.308319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시체 9358
93.6%
개장유골 377
 
3.8%
부부합장 142
 
1.4%
화장유골 92
 
0.9%
유골합장 31
 
0.3%

감면구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
9460 
기타면제
 
408
사전예매
 
89
생활수급
 
20
지역주민
 
13
Other values (3)
 
10

Length

Max length4
Median length2
Mean length2.108
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 9460
94.6%
기타면제 408
 
4.1%
사전예매 89
 
0.9%
생활수급 20
 
0.2%
지역주민 13
 
0.1%
참전유공 5
 
0.1%
국가유공 4
 
< 0.1%
시설수용 1
 
< 0.1%

Length

2023-12-11T01:43:23.488747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:23.645080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9460
94.6%
기타면제 408
 
4.1%
사전예매 89
 
0.9%
생활수급 20
 
0.2%
지역주민 13
 
0.1%
참전유공 5
 
< 0.1%
국가유공 4
 
< 0.1%
시설수용 1
 
< 0.1%

묘지수납구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영구(신규불가)
9387 
최초(15년)
 
611
<NA>
 
2

Length

Max length8
Median length8
Mean length7.9381
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영구(신규불가)
2nd row영구(신규불가)
3rd row영구(신규불가)
4th row영구(신규불가)
5th row영구(신규불가)

Common Values

ValueCountFrequency (%)
영구(신규불가) 9387
93.9%
최초(15년) 611
 
6.1%
<NA> 2
 
< 0.1%

Length

2023-12-11T01:43:23.803790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:23.939292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영구(신규불가 9387
93.9%
최초(15년 611
 
6.1%
na 2
 
< 0.1%

사용료
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
300000
9970 
600000
 
22
0
 
5
<NA>
 
2
24500
 
1

Length

Max length6
Median length6
Mean length5.997
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row300000
2nd row300000
3rd row300000
4th row300000
5th row300000

Common Values

ValueCountFrequency (%)
300000 9970
99.7%
600000 22
 
0.2%
0 5
 
0.1%
<NA> 2
 
< 0.1%
24500 1
 
< 0.1%

Length

2023-12-11T01:43:24.073963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:24.186182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
300000 9970
99.7%
600000 22
 
0.2%
0 5
 
< 0.1%
na 2
 
< 0.1%
24500 1
 
< 0.1%

Correlations

2023-12-11T01:43:24.286260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분시도구분매장종류감면구분묘지수납구분사용료
순번1.0000.9330.0000.4710.4490.5350.111
구분0.9331.0000.0160.4530.4460.5230.115
시도구분0.0000.0161.0000.0530.0960.0320.094
매장종류0.4710.4530.0531.0000.2820.4500.154
감면구분0.4490.4460.0960.2821.0000.7640.344
묘지수납구분0.5350.5230.0320.4500.7641.0000.236
사용료0.1110.1150.0940.1540.3440.2361.000
2023-12-11T01:43:24.431911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도구분사용료구분매장종류순번묘지수납구분감면구분
시도구분1.0000.0880.0050.0400.0000.0540.060
사용료0.0881.0000.1090.1260.1040.1570.160
구분0.0050.1091.0000.3840.6840.7880.317
매장종류0.0400.1260.3841.0000.4030.5470.177
순번0.0000.1040.6840.4031.0000.8010.320
묘지수납구분0.0540.1570.7880.5470.8011.0000.589
감면구분0.0600.1600.3170.1770.3200.5891.000
2023-12-11T01:43:24.562841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분시도구분매장종류감면구분묘지수납구분사용료
순번1.0000.6840.0000.4030.3200.8010.104
구분0.6841.0000.0050.3840.3170.7880.109
시도구분0.0000.0051.0000.0400.0600.0540.088
매장종류0.4030.3840.0401.0000.1770.5470.126
감면구분0.3200.3170.0600.1771.0000.5890.160
묘지수납구분0.8010.7880.0540.5470.5891.0000.157
사용료0.1040.1090.0880.1260.1600.1571.000

Missing values

2023-12-11T01:43:19.255184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:43:19.401485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:43:19.536699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

묘지정보순번사망일자매장일자만료일자구분개장여부개장일자시도구분매장종류감면구분묘지수납구분사용료
1371207묘원 37블럭 0017호11975-07-211975-07-23<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
2722214묘원 62블럭 0198호11980-09-221980-09-24<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1159306묘원 32블럭 0214호11974-03-101974-03-11<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1165806묘원 32블럭 0279호11982-02-181982-02-20<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
742803묘원 22블럭 0017호11972-09-021972-09-04<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
2568513묘원 59블럭 0112호11980-04-101980-04-12<NA>개장Y2015-01-27자시시체일반영구(신규불가)300000
1244206묘원 34블럭 0028호11974-06-131974-06-15<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
2468512묘원 57블럭 0541호11979-10-241979-10-26<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1006705묘원 29블럭 0249호11979-02-201979-02-22<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1706808묘원 43블럭 0387호11976-10-131976-10-15<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
묘지정보순번사망일자매장일자만료일자구분개장여부개장일자시도구분매장종류감면구분묘지수납구분사용료
2632413묘원 60블럭 0437호11980-06-201980-06-22<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
328501묘원 10블럭 0048호11970-08-311970-09-02<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1269907묘원 35블럭 0074호12013-01-082013-01-102043-01-09신규<NA><NA>자시시체기타면제최초(15년)300000
1289107묘원 35블럭 0258호11974-10-171974-10-19<NA>개장Y1999-08-13자시시체일반영구(신규불가)300000
952704묘원 28블럭 0040호11973-05-141973-05-16<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1160506묘원 32블럭 0229호11974-03-051974-03-07<NA>개장Y1996-12-30자시시체일반영구(신규불가)300000
1616208묘원 42블럭 0006호11978-07-031978-07-06<NA>개장Y2005-03-12자시시체일반영구(신규불가)300000
2191411묘원 53블럭 0667호11980-01-061980-01-08<NA>신규<NA><NA>자시시체일반영구(신규불가)300000
1062105묘원 30블럭 0395호22006-09-062006-09-082036-09-07재사용<NA><NA>자시시체일반최초(15년)300000
496802묘원 14블럭 0371호11972-08-191972-08-21<NA>개장Y2012-05-03자시시체일반영구(신규불가)300000