Overview

Dataset statistics

Number of variables9
Number of observations46
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory78.9 B

Variable types

Categorical6
Text1
Numeric2

Dataset

Description남양주시의 과세년도별 세목명, 세원 유형명, 부과건수, 부과금액으로 구성된 데이터입니다. - 세목명 : 담배소비세, 교육세, 도시계획세, 취득세, 레저세, 재산세, 자동차세, 주민세, 지방소득세, 등록면허세, 지역자원시설세, 체납
Author경기도 남양주시
URLhttps://www.data.go.kr/data/15102888/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세년도 has constant value ""Constant
데이터기준일자 has constant value ""Constant
부과건수 is highly overall correlated with 부과금액 and 1 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
세목명 is highly overall correlated with 부과건수High correlation
세원 유형명 has unique valuesUnique
부과건수 has 11 (23.9%) zerosZeros
부과금액 has 11 (23.9%) zerosZeros

Reproduction

Analysis started2023-12-12 08:43:54.603745
Analysis finished2023-12-12 08:43:55.696079
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size500.0 B
경기도
46 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 46
100.0%

Length

2023-12-12T17:43:55.769686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:43:55.887117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 46
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size500.0 B
남양주시
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남양주시
2nd row남양주시
3rd row남양주시
4th row남양주시
5th row남양주시

Common Values

ValueCountFrequency (%)
남양주시 46
100.0%

Length

2023-12-12T17:43:56.023319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:43:56.149402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남양주시 46
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size500.0 B
41360
46 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41360
2nd row41360
3rd row41360
4th row41360
5th row41360

Common Values

ValueCountFrequency (%)
41360 46
100.0%

Length

2023-12-12T17:43:56.289845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:43:56.398364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41360 46
100.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size500.0 B
2021
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 46
100.0%

Length

2023-12-12T17:43:56.546532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:43:56.669177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 46
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)28.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
취득세
자동차세
주민세
재산세
레저세
Other values (8)
14 

Length

Max length7
Median length3
Mean length3.7826087
Min length2

Unique

Unique5 ?
Unique (%)10.9%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 9
19.6%
자동차세 7
15.2%
주민세 7
15.2%
재산세 5
10.9%
레저세 4
8.7%
지방소득세 4
8.7%
지역자원시설세 3
 
6.5%
등록면허세 2
 
4.3%
담배소비세 1
 
2.2%
교육세 1
 
2.2%
Other values (3) 3
 
6.5%

Length

2023-12-12T17:43:56.801624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 9
19.6%
자동차세 7
15.2%
주민세 7
15.2%
재산세 5
10.9%
레저세 4
8.7%
지방소득세 4
8.7%
지역자원시설세 3
 
6.5%
등록면허세 2
 
4.3%
담배소비세 1
 
2.2%
교육세 1
 
2.2%
Other values (3) 3
 
6.5%

세원 유형명
Text

UNIQUE 

Distinct46
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T17:43:57.099447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length8
Mean length6.0217391
Min length2

Characters and Unicode

Total characters277
Distinct characters73
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)100.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(개별)
ValueCountFrequency (%)
담배소비세 1
 
2.2%
주민세(양도소득 1
 
2.2%
지역자원시설세(특자 1
 
2.2%
승합 1
 
2.2%
기타승용 1
 
2.2%
승용 1
 
2.2%
주민세(사업소분 1
 
2.2%
주민세(개인분 1
 
2.2%
주민세(종업원분 1
 
2.2%
주민세(특별징수 1
 
2.2%
Other values (36) 36
78.3%
2023-12-12T17:43:57.570343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
9.7%
) 24
 
8.7%
( 24
 
8.7%
14
 
5.1%
11
 
4.0%
10
 
3.6%
9
 
3.2%
7
 
2.5%
6
 
2.2%
5
 
1.8%
Other values (63) 140
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 228
82.3%
Close Punctuation 24
 
8.7%
Open Punctuation 24
 
8.7%
Decimal Number 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
11.8%
14
 
6.1%
11
 
4.8%
10
 
4.4%
9
 
3.9%
7
 
3.1%
6
 
2.6%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (60) 129
56.6%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 228
82.3%
Common 49
 
17.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
11.8%
14
 
6.1%
11
 
4.8%
10
 
4.4%
9
 
3.9%
7
 
3.1%
6
 
2.6%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (60) 129
56.6%
Common
ValueCountFrequency (%)
) 24
49.0%
( 24
49.0%
3 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 228
82.3%
ASCII 49
 
17.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
 
11.8%
14
 
6.1%
11
 
4.8%
10
 
4.4%
9
 
3.9%
7
 
3.1%
6
 
2.6%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (60) 129
56.6%
ASCII
ValueCountFrequency (%)
) 24
49.0%
( 24
49.0%
3 1
 
2.0%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct36
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92130.935
Minimum0
Maximum1422392
Zeros11
Zeros (%)23.9%
Negative0
Negative (%)0.0%
Memory size546.0 B
2023-12-12T17:43:57.765452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q18
median2946
Q358479.25
95-th percentile411739.25
Maximum1422392
Range1422392
Interquartile range (IQR)58471.25

Descriptive statistics

Standard deviation232902.36
Coefficient of variation (CV)2.5279496
Kurtosis24.200437
Mean92130.935
Median Absolute Deviation (MAD)2946
Skewness4.5260079
Sum4238023
Variance5.4243509 × 1010
MonotonicityNot monotonic
2023-12-12T17:43:57.946342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
0 11
23.9%
481 1
 
2.2%
158234 1
 
2.2%
415520 1
 
2.2%
43470 1
 
2.2%
270606 1
 
2.2%
2704 1
 
2.2%
149137 1
 
2.2%
8908 1
 
2.2%
13484 1
 
2.2%
Other values (26) 26
56.5%
ValueCountFrequency (%)
0 11
23.9%
7 1
 
2.2%
11 1
 
2.2%
12 1
 
2.2%
88 1
 
2.2%
433 1
 
2.2%
475 1
 
2.2%
481 1
 
2.2%
643 1
 
2.2%
1520 1
 
2.2%
ValueCountFrequency (%)
1422392 1
2.2%
472746 1
2.2%
415520 1
2.2%
400397 1
2.2%
270606 1
2.2%
248702 1
2.2%
196838 1
2.2%
158234 1
2.2%
149137 1
2.2%
100123 1
2.2%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct36
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7495979 × 1010
Minimum0
Maximum1.96142 × 1011
Zeros11
Zeros (%)23.9%
Negative0
Negative (%)0.0%
Memory size546.0 B
2023-12-12T17:43:58.111506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q114725500
median2.438527 × 109
Q33.7804968 × 1010
95-th percentile1.124055 × 1011
Maximum1.96142 × 1011
Range1.96142 × 1011
Interquartile range (IQR)3.7790242 × 1010

Descriptive statistics

Standard deviation4.4832157 × 1010
Coefficient of variation (CV)1.6304987
Kurtosis5.0743279
Mean2.7495979 × 1010
Median Absolute Deviation (MAD)2.438527 × 109
Skewness2.2051142
Sum1.264815 × 1012
Variance2.0099223 × 1021
MonotonicityNot monotonic
2023-12-12T17:43:58.286914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
0 11
23.9%
40235500000 1
 
2.2%
26397740000 1
 
2.2%
60193977000 1
 
2.2%
3454402000 1
 
2.2%
2715378000 1
 
2.2%
4329367000 1
 
2.2%
30743173000 1
 
2.2%
38414851000 1
 
2.2%
62469553000 1
 
2.2%
Other values (26) 26
56.5%
ValueCountFrequency (%)
0 11
23.9%
7613000 1
 
2.2%
36063000 1
 
2.2%
45645000 1
 
2.2%
51519000 1
 
2.2%
98710000 1
 
2.2%
128077000 1
 
2.2%
527353000 1
 
2.2%
1099869000 1
 
2.2%
1270720000 1
 
2.2%
ValueCountFrequency (%)
196142000000 1
2.2%
167467000000 1
2.2%
114386000000 1
2.2%
106464000000 1
2.2%
79484000000 1
2.2%
67828201000 1
2.2%
67116583000 1
2.2%
62469553000 1
2.2%
60193977000 1
2.2%
56577433000 1
2.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-02-21
46 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-02-21
2nd row2023-02-21
3rd row2023-02-21
4th row2023-02-21
5th row2023-02-21

Common Values

ValueCountFrequency (%)
2023-02-21 46
100.0%

Length

2023-12-12T17:43:58.443951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:43:58.559175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-02-21 46
100.0%

Interactions

2023-12-12T17:43:55.139149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:43:54.874943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:43:55.240152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:43:55.006426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:43:58.650781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명세원 유형명부과건수부과금액
세목명1.0001.0000.8630.636
세원 유형명1.0001.0001.0001.000
부과건수0.8631.0001.0000.646
부과금액0.6361.0000.6461.000
2023-12-12T17:43:58.787673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액세목명
부과건수1.0000.7770.623
부과금액0.7771.0000.319
세목명0.6230.3191.000

Missing values

2023-12-12T17:43:55.395586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:43:55.629054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액데이터기준일자
0경기도남양주시413602021담배소비세담배소비세481402355000002023-02-21
1경기도남양주시413602021교육세교육세14223921064640000002023-02-21
2경기도남양주시413602021도시계획세도시계획세002023-02-21
3경기도남양주시413602021취득세건축물110301143860000002023-02-21
4경기도남양주시413602021취득세주택(개별)3188210842350002023-02-21
5경기도남양주시413602021취득세주택(단독)231761674670000002023-02-21
6경기도남양주시413602021취득세기타43321616760002023-02-21
7경기도남양주시413602021취득세항공기002023-02-21
8경기도남양주시413602021취득세기계장비64310998690002023-02-21
9경기도남양주시413602021취득세차량54514671165830002023-02-21
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액데이터기준일자
36경기도남양주시413602021지방소득세지방소득세(법인소득)8908384148510002023-02-21
37경기도남양주시413602021지방소득세지방소득세(양도소득)13484624695530002023-02-21
38경기도남양주시413602021지방소득세지방소득세(종합소득)158234263977400002023-02-21
39경기도남양주시413602021지방소비세지방소비세7101525750002023-02-21
40경기도남양주시413602021등록면허세등록면허세(면허)8976813976890002023-02-21
41경기도남양주시413602021등록면허세등록면허세(등록)196838301537160002023-02-21
42경기도남양주시413602021지역자원시설세지역자원시설세(소방)400397162173630002023-02-21
43경기도남양주시413602021지역자원시설세지역자원시설세(시설)11987100002023-02-21
44경기도남양주시413602021지역자원시설세지역자원시설세(특자)217916212960002023-02-21
45경기도남양주시413602021체납체납472746565774330002023-02-21