Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells49881
Missing cells (%)99.8%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Text3
Numeric2

Dataset

Description경상북도 군위군에서 관리하는 경로당현황에 대한 데이터로 명칭, 소재지 주소, 건축연도, 면적, 전화번호의 항목을 제공합니다.
Author경상북도 군위군
URLhttps://www.data.go.kr/data/15033523/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
명칭 has 9976 (99.8%) missing valuesMissing
주소 has 9976 (99.8%) missing valuesMissing
건축연도 has 9976 (99.8%) missing valuesMissing
면적 has 9976 (99.8%) missing valuesMissing
전화번호 has 9977 (99.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:05:41.896995
Analysis finished2023-12-12 06:05:43.136085
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

명칭
Text

MISSING 

Distinct24
Distinct (%)100.0%
Missing9976
Missing (%)99.8%
Memory size156.2 KiB
2023-12-12T15:05:43.296764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length7
Mean length7.4166667
Min length6

Characters and Unicode

Total characters178
Distinct characters49
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row위성2리경로당
2nd row내의3리경로당
3rd row위성3리경로당
4th row대흥2리경로당
5th row송원2리경로당
ValueCountFrequency (%)
위성2리경로당 1
 
4.2%
내의3리경로당 1
 
4.2%
신덕1리경로당 1
 
4.2%
화북2리경로당 1
 
4.2%
수서1리경로당 1
 
4.2%
노행1리경로당 1
 
4.2%
매곡2리경로당 1
 
4.2%
하곡리(속골)경로당 1
 
4.2%
신화1리경로당 1
 
4.2%
학성리경로당 1
 
4.2%
Other values (14) 14
58.3%
2023-12-12T15:05:43.648310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
14.0%
24
13.5%
24
13.5%
24
13.5%
1 9
 
5.1%
2 9
 
5.1%
4
 
2.2%
3 3
 
1.7%
3
 
1.7%
) 3
 
1.7%
Other values (39) 50
28.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 151
84.8%
Decimal Number 21
 
11.8%
Close Punctuation 3
 
1.7%
Open Punctuation 3
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
16.6%
24
15.9%
24
15.9%
24
15.9%
4
 
2.6%
3
 
2.0%
3
 
2.0%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (34) 38
25.2%
Decimal Number
ValueCountFrequency (%)
1 9
42.9%
2 9
42.9%
3 3
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 151
84.8%
Common 27
 
15.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
16.6%
24
15.9%
24
15.9%
24
15.9%
4
 
2.6%
3
 
2.0%
3
 
2.0%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (34) 38
25.2%
Common
ValueCountFrequency (%)
1 9
33.3%
2 9
33.3%
3 3
 
11.1%
) 3
 
11.1%
( 3
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 151
84.8%
ASCII 27
 
15.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
16.6%
24
15.9%
24
15.9%
24
15.9%
4
 
2.6%
3
 
2.0%
3
 
2.0%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (34) 38
25.2%
ASCII
ValueCountFrequency (%)
1 9
33.3%
2 9
33.3%
3 3
 
11.1%
) 3
 
11.1%
( 3
 
11.1%

주소
Text

MISSING 

Distinct24
Distinct (%)100.0%
Missing9976
Missing (%)99.8%
Memory size156.2 KiB
2023-12-12T15:05:43.926228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length20.541667
Min length18

Characters and Unicode

Total characters493
Distinct characters68
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row경상북도 군위군 소보면 대량길 53
2nd row경상북도 군위군 소보면 내의길 753
3rd row경상북도 군위군 소보면 위성2길 7
4th row경상북도 군위군 군위읍 대흥2길 19-5
5th row경상북도 군위군 소보면 송원2길 5
ValueCountFrequency (%)
경상북도 24
20.7%
군위군 24
20.7%
군위읍 7
 
6.0%
소보면 4
 
3.4%
삼국유사면 3
 
2.6%
의흥면 2
 
1.7%
산성면 2
 
1.7%
부계면 2
 
1.7%
5 2
 
1.7%
우보면 2
 
1.7%
Other values (43) 44
37.9%
2023-12-12T15:05:44.317510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
18.7%
55
 
11.2%
32
 
6.5%
25
 
5.1%
24
 
4.9%
24
 
4.9%
24
 
4.9%
18
 
3.7%
17
 
3.4%
2 16
 
3.2%
Other values (58) 166
33.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 323
65.5%
Space Separator 92
 
18.7%
Decimal Number 73
 
14.8%
Dash Punctuation 5
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
17.0%
32
 
9.9%
25
 
7.7%
24
 
7.4%
24
 
7.4%
24
 
7.4%
18
 
5.6%
17
 
5.3%
7
 
2.2%
6
 
1.9%
Other values (46) 91
28.2%
Decimal Number
ValueCountFrequency (%)
2 16
21.9%
1 13
17.8%
4 9
12.3%
5 8
11.0%
7 8
11.0%
8 6
 
8.2%
9 4
 
5.5%
3 4
 
5.5%
6 3
 
4.1%
0 2
 
2.7%
Space Separator
ValueCountFrequency (%)
92
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 323
65.5%
Common 170
34.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
17.0%
32
 
9.9%
25
 
7.7%
24
 
7.4%
24
 
7.4%
24
 
7.4%
18
 
5.6%
17
 
5.3%
7
 
2.2%
6
 
1.9%
Other values (46) 91
28.2%
Common
ValueCountFrequency (%)
92
54.1%
2 16
 
9.4%
1 13
 
7.6%
4 9
 
5.3%
5 8
 
4.7%
7 8
 
4.7%
8 6
 
3.5%
- 5
 
2.9%
9 4
 
2.4%
3 4
 
2.4%
Other values (2) 5
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 323
65.5%
ASCII 170
34.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
92
54.1%
2 16
 
9.4%
1 13
 
7.6%
4 9
 
5.3%
5 8
 
4.7%
7 8
 
4.7%
8 6
 
3.5%
- 5
 
2.9%
9 4
 
2.4%
3 4
 
2.4%
Other values (2) 5
 
2.9%
Hangul
ValueCountFrequency (%)
55
17.0%
32
 
9.9%
25
 
7.7%
24
 
7.4%
24
 
7.4%
24
 
7.4%
18
 
5.6%
17
 
5.3%
7
 
2.2%
6
 
1.9%
Other values (46) 91
28.2%

건축연도
Real number (ℝ)

MISSING 

Distinct12
Distinct (%)50.0%
Missing9976
Missing (%)99.8%
Infinite0
Infinite (%)0.0%
Mean2004.625
Minimum1995
Maximum2013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:05:44.458596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1995
5-th percentile1996
Q12001.75
median2004.5
Q32009
95-th percentile2012.7
Maximum2013
Range18
Interquartile range (IQR)7.25

Descriptive statistics

Standard deviation5.0200684
Coefficient of variation (CV)0.0025042431
Kurtosis-0.43298478
Mean2004.625
Median Absolute Deviation (MAD)3.5
Skewness-0.18582853
Sum48111
Variance25.201087
MonotonicityNot monotonic
2023-12-12T15:05:44.593876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2001 3
 
< 0.1%
2009 3
 
< 0.1%
2005 3
 
< 0.1%
2004 2
 
< 0.1%
2007 2
 
< 0.1%
2002 2
 
< 0.1%
1996 2
 
< 0.1%
2003 2
 
< 0.1%
2013 2
 
< 0.1%
2010 1
 
< 0.1%
Other values (2) 2
 
< 0.1%
(Missing) 9976
99.8%
ValueCountFrequency (%)
1995 1
 
< 0.1%
1996 2
< 0.1%
2001 3
< 0.1%
2002 2
< 0.1%
2003 2
< 0.1%
2004 2
< 0.1%
2005 3
< 0.1%
2007 2
< 0.1%
2009 3
< 0.1%
2010 1
 
< 0.1%
ValueCountFrequency (%)
2013 2
< 0.1%
2011 1
 
< 0.1%
2010 1
 
< 0.1%
2009 3
< 0.1%
2007 2
< 0.1%
2005 3
< 0.1%
2004 2
< 0.1%
2003 2
< 0.1%
2002 2
< 0.1%
2001 3
< 0.1%

면적
Real number (ℝ)

MISSING 

Distinct19
Distinct (%)79.2%
Missing9976
Missing (%)99.8%
Infinite0
Infinite (%)0.0%
Mean80.414167
Minimum42
Maximum117.17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:05:44.738118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum42
5-th percentile60.8355
Q171.7
median76.275
Q390.4575
95-th percentile114.021
Maximum117.17
Range75.17
Interquartile range (IQR)18.7575

Descriptive statistics

Standard deviation16.99817
Coefficient of variation (CV)0.21138278
Kurtosis0.82776559
Mean80.414167
Median Absolute Deviation (MAD)6.09
Skewness0.39178242
Sum1929.94
Variance288.9378
MonotonicityNot monotonic
2023-12-12T15:05:44.897908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
71.7 6
 
0.1%
42.0 1
 
< 0.1%
96.56 1
 
< 0.1%
116.1 1
 
< 0.1%
70.65 1
 
< 0.1%
96.0 1
 
< 0.1%
90.01 1
 
< 0.1%
60.48 1
 
< 0.1%
74.44 1
 
< 0.1%
80.0 1
 
< 0.1%
Other values (9) 9
 
0.1%
(Missing) 9976
99.8%
ValueCountFrequency (%)
42.0 1
 
< 0.1%
60.48 1
 
< 0.1%
62.85 1
 
< 0.1%
70.65 1
 
< 0.1%
71.7 6
0.1%
74.44 1
 
< 0.1%
75.03 1
 
< 0.1%
77.52 1
 
< 0.1%
80.0 1
 
< 0.1%
80.88 1
 
< 0.1%
ValueCountFrequency (%)
117.17 1
< 0.1%
116.1 1
< 0.1%
102.24 1
< 0.1%
96.56 1
< 0.1%
96.0 1
< 0.1%
91.8 1
< 0.1%
90.01 1
< 0.1%
83.18 1
< 0.1%
82.83 1
< 0.1%
80.88 1
< 0.1%

전화번호
Text

MISSING 

Distinct23
Distinct (%)100.0%
Missing9977
Missing (%)99.8%
Memory size156.2 KiB
2023-12-12T15:05:45.133452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters276
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row054-383-7471
2nd row054-382-4332
3rd row054-382-9031
4th row054-382-3773
5th row054-382-6677
ValueCountFrequency (%)
054-383-7471 1
 
4.3%
054-383-5371 1
 
4.3%
054-382-7876 1
 
4.3%
054-383-4340 1
 
4.3%
054-383-0567 1
 
4.3%
054-382-3957 1
 
4.3%
054-382-9689 1
 
4.3%
054-382-0170 1
 
4.3%
054-383-5508 1
 
4.3%
054-382-6871 1
 
4.3%
Other values (13) 13
56.5%
2023-12-12T15:05:45.453204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 46
16.7%
3 46
16.7%
0 38
13.8%
5 33
12.0%
8 33
12.0%
4 28
10.1%
7 15
 
5.4%
2 14
 
5.1%
1 9
 
3.3%
6 9
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 230
83.3%
Dash Punctuation 46
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 46
20.0%
0 38
16.5%
5 33
14.3%
8 33
14.3%
4 28
12.2%
7 15
 
6.5%
2 14
 
6.1%
1 9
 
3.9%
6 9
 
3.9%
9 5
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 276
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 46
16.7%
3 46
16.7%
0 38
13.8%
5 33
12.0%
8 33
12.0%
4 28
10.1%
7 15
 
5.4%
2 14
 
5.1%
1 9
 
3.3%
6 9
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 276
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 46
16.7%
3 46
16.7%
0 38
13.8%
5 33
12.0%
8 33
12.0%
4 28
10.1%
7 15
 
5.4%
2 14
 
5.1%
1 9
 
3.3%
6 9
 
3.3%

Interactions

2023-12-12T15:05:42.516073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:05:42.264194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:05:42.635164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:05:42.406989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:05:45.566567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
명칭주소건축연도면적전화번호
명칭1.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
건축연도1.0001.0001.0000.6171.000
면적1.0001.0000.6171.0001.000
전화번호1.0001.0001.0001.0001.000
2023-12-12T15:05:45.673271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축연도면적
건축연도1.0000.038
면적0.0381.000

Missing values

2023-12-12T15:05:42.802363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:05:42.906849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:05:43.057413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

명칭주소건축연도면적전화번호
88006<NA><NA><NA><NA><NA>
63084<NA><NA><NA><NA><NA>
20818<NA><NA><NA><NA><NA>
67427<NA><NA><NA><NA><NA>
12507<NA><NA><NA><NA><NA>
1855<NA><NA><NA><NA><NA>
81344<NA><NA><NA><NA><NA>
45831<NA><NA><NA><NA><NA>
66212<NA><NA><NA><NA><NA>
48725<NA><NA><NA><NA><NA>
명칭주소건축연도면적전화번호
72194<NA><NA><NA><NA><NA>
2206<NA><NA><NA><NA><NA>
92026<NA><NA><NA><NA><NA>
6601<NA><NA><NA><NA><NA>
57686<NA><NA><NA><NA><NA>
71670<NA><NA><NA><NA><NA>
64368<NA><NA><NA><NA><NA>
47701<NA><NA><NA><NA><NA>
1158<NA><NA><NA><NA><NA>
88860<NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

명칭주소건축연도면적전화번호# duplicates
0<NA><NA><NA><NA><NA>9976