Overview

Dataset statistics

Number of variables6
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory55.3 B

Variable types

Categorical1
Text2
Numeric2
DateTime1

Alerts

집계년도 has constant value ""Constant
지정년도 is highly overall correlated with 사업비(천원)High correlation
사업비(천원) is highly overall correlated with 지정년도High correlation
시군명 has unique valuesUnique
사업비(천원) has unique valuesUnique

Reproduction

Analysis started2023-12-10 21:03:41.155309
Analysis finished2023-12-10 21:03:42.015950
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년도
Categorical

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023
31 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 31
100.0%

Length

2023-12-11T06:03:42.086092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:03:42.182045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 31
100.0%

시군명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T06:03:42.380091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0967742
Min length3

Characters and Unicode

Total characters96
Distinct characters38
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row수원시
2nd row용인시
3rd row고양시
4th row성남시
5th row화성시
ValueCountFrequency (%)
수원시 1
 
3.2%
하남시 1
 
3.2%
가평군 1
 
3.2%
과천시 1
 
3.2%
동두천시 1
 
3.2%
여주시 1
 
3.2%
양평군 1
 
3.2%
포천시 1
 
3.2%
의왕시 1
 
3.2%
안성시 1
 
3.2%
Other values (21) 21
67.7%
2023-12-11T06:03:42.806210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
30.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.2%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (28) 32
33.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 96
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
30.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.2%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (28) 32
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 96
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
30.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.2%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (28) 32
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 96
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
30.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.2%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (28) 32
33.3%
Distinct23
Distinct (%)74.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T06:03:43.066294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length9.7096774
Min length8

Characters and Unicode

Total characters301
Distinct characters20
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)51.6%

Sample

1st row44동(시 전체)
2nd row38읍면동(시 전체)
3rd row44동(시 전체)
4th row50동(시 전체)
5th row28읍면동(시 전체)
ValueCountFrequency (%)
전체 31
49.2%
14읍면동(시 3
 
4.8%
19동(시 2
 
3.2%
8동(시 2
 
3.2%
6동(시 2
 
3.2%
14동(시 2
 
3.2%
16읍면동(시 2
 
3.2%
44동(시 2
 
3.2%
50동(시 1
 
1.6%
25동(시 1
 
1.6%
Other values (15) 15
23.8%
2023-12-11T06:03:43.453598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
10.6%
) 31
10.3%
31
10.3%
31
10.3%
( 31
10.3%
28
9.3%
28
9.3%
1 19
6.3%
15
 
5.0%
15
 
5.0%
Other values (10) 40
13.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 151
50.2%
Decimal Number 56
 
18.6%
Space Separator 32
 
10.6%
Close Punctuation 31
 
10.3%
Open Punctuation 31
 
10.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 19
33.9%
4 9
16.1%
2 6
 
10.7%
6 5
 
8.9%
8 4
 
7.1%
5 4
 
7.1%
3 3
 
5.4%
0 3
 
5.4%
9 2
 
3.6%
7 1
 
1.8%
Other Letter
ValueCountFrequency (%)
31
20.5%
31
20.5%
28
18.5%
28
18.5%
15
9.9%
15
9.9%
3
 
2.0%
Space Separator
ValueCountFrequency (%)
32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 151
50.2%
Common 150
49.8%

Most frequent character per script

Common
ValueCountFrequency (%)
32
21.3%
) 31
20.7%
( 31
20.7%
1 19
12.7%
4 9
 
6.0%
2 6
 
4.0%
6 5
 
3.3%
8 4
 
2.7%
5 4
 
2.7%
3 3
 
2.0%
Other values (3) 6
 
4.0%
Hangul
ValueCountFrequency (%)
31
20.5%
31
20.5%
28
18.5%
28
18.5%
15
9.9%
15
9.9%
3
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 151
50.2%
ASCII 150
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32
21.3%
) 31
20.7%
( 31
20.7%
1 19
12.7%
4 9
 
6.0%
2 6
 
4.0%
6 5
 
3.3%
8 4
 
2.7%
5 4
 
2.7%
3 3
 
2.0%
Other values (3) 6
 
4.0%
Hangul
ValueCountFrequency (%)
31
20.5%
31
20.5%
28
18.5%
28
18.5%
15
9.9%
15
9.9%
3
 
2.0%

지정년도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.6129
Minimum2007
Maximum2012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-11T06:03:43.578224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2007
5-th percentile2007.5
Q12009.5
median2011
Q32012
95-th percentile2012
Maximum2012
Range5
Interquartile range (IQR)2.5

Descriptive statistics

Standard deviation1.6467629
Coefficient of variation (CV)0.00081903527
Kurtosis-0.46026412
Mean2010.6129
Median Absolute Deviation (MAD)1
Skewness-0.85954159
Sum62329
Variance2.711828
MonotonicityNot monotonic
2023-12-11T06:03:43.691526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2012 15
48.4%
2010 5
 
16.1%
2009 4
 
12.9%
2011 3
 
9.7%
2008 2
 
6.5%
2007 2
 
6.5%
ValueCountFrequency (%)
2007 2
 
6.5%
2008 2
 
6.5%
2009 4
 
12.9%
2010 5
 
16.1%
2011 3
 
9.7%
2012 15
48.4%
ValueCountFrequency (%)
2012 15
48.4%
2011 3
 
9.7%
2010 5
 
16.1%
2009 4
 
12.9%
2008 2
 
6.5%
2007 2
 
6.5%
Distinct23
Distinct (%)74.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum2007-03-02 00:00:00
Maximum2012-11-05 00:00:00
2023-12-11T06:03:43.824386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:03:43.976062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

사업비(천원)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean409732.9
Minimum119250
Maximum902600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-11T06:03:44.109015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum119250
5-th percentile235900
Q1290005
median347650
Q3475750
95-th percentile756885
Maximum902600
Range783350
Interquartile range (IQR)185745

Descriptive statistics

Standard deviation176888.14
Coefficient of variation (CV)0.43171572
Kurtosis1.1218472
Mean409732.9
Median Absolute Deviation (MAD)93700
Skewness1.1432587
Sum12701720
Variance3.1289413 × 1010
MonotonicityNot monotonic
2023-12-11T06:03:44.281119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
902600 1
 
3.2%
529850 1
 
3.2%
237000 1
 
3.2%
234800 1
 
3.2%
119250 1
 
3.2%
309330 1
 
3.2%
246800 1
 
3.2%
288720 1
 
3.2%
346850 1
 
3.2%
253950 1
 
3.2%
Other values (21) 21
67.7%
ValueCountFrequency (%)
119250 1
3.2%
234800 1
3.2%
237000 1
3.2%
246800 1
3.2%
253950 1
3.2%
284730 1
3.2%
287600 1
3.2%
288720 1
3.2%
291290 1
3.2%
299782 1
3.2%
ValueCountFrequency (%)
902600 1
3.2%
788120 1
3.2%
725650 1
3.2%
659200 1
3.2%
608800 1
3.2%
529850 1
3.2%
520950 1
3.2%
478450 1
3.2%
473050 1
3.2%
471440 1
3.2%

Interactions

2023-12-11T06:03:41.534372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:03:41.343094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:03:41.641274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:03:41.438509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:03:44.411455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명사업지역정보지정년도센터개소일사업비(천원)
시군명1.0001.0001.0001.0001.000
사업지역정보1.0001.0000.0000.8670.731
지정년도1.0000.0001.0001.0000.490
센터개소일1.0000.8671.0001.0000.923
사업비(천원)1.0000.7310.4900.9231.000
2023-12-11T06:03:44.535489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정년도사업비(천원)
지정년도1.000-0.567
사업비(천원)-0.5671.000

Missing values

2023-12-11T06:03:41.816374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:03:41.964931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

집계년도시군명사업지역정보지정년도센터개소일사업비(천원)
02023수원시44동(시 전체)20092009-10-13902600
12023용인시38읍면동(시 전체)20112011-06-01529850
22023고양시44동(시 전체)20102010-04-01659200
32023성남시50동(시 전체)20092009-11-16608800
42023화성시28읍면동(시 전체)20102010-09-01725650
52023부천시10동(시 전체)20122012-07-01478450
62023남양주시16읍면동(시 전체)20082008-04-01520950
72023안산시25동(시 전체)20092009-09-12788120
82023평택시25읍면동(시 전체)20122012-08-01473050
92023안양시31동(시 전체)20082008-04-23347650
집계년도시군명사업지역정보지정년도센터개소일사업비(천원)
212023구리시8동(시 전체)20102010-12-27471440
222023안성시15읍면동(시 전체)20122012-08-01287600
232023의왕시6동(시 전체)20122012-07-01253950
242023포천시14읍면동(시 전체)20072007-03-02346850
252023양평군12읍면(군 전체)20122012-07-01288720
262023여주시12읍면동(시 전체)20122012-03-30246800
272023동두천시8동(시 전체)20112011-11-30309330
282023과천시7동 (시 전체)20122012-08-01119250
292023가평군6읍면(군 전체)20122012-11-05234800
302023연천군10읍면(군 전체)20122012-10-30237000