Overview

Dataset statistics

Number of variables3
Number of observations2284
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory58.1 KiB
Average record size in memory26.1 B

Variable types

Numeric2
Text1

Dataset

Description대구광역시_북구_교통유발부담금 부과현황_20141231
Author대구광역시 북구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15032292&dataSetDetailId=150322921a7719f1a5c48&provdMethod=FILE

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2024-04-19 06:13:44.291553
Analysis finished2024-04-19 06:13:44.995057
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct2284
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1142.5
Minimum1
Maximum2284
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.2 KiB
2024-04-19T15:13:45.074024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile115.15
Q1571.75
median1142.5
Q31713.25
95-th percentile2169.85
Maximum2284
Range2283
Interquartile range (IQR)1141.5

Descriptive statistics

Standard deviation659.47833
Coefficient of variation (CV)0.57722392
Kurtosis-1.2
Mean1142.5
Median Absolute Deviation (MAD)571
Skewness0
Sum2609470
Variance434911.67
MonotonicityStrictly increasing
2024-04-19T15:13:45.201501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1527 1
 
< 0.1%
1521 1
 
< 0.1%
1522 1
 
< 0.1%
1523 1
 
< 0.1%
1524 1
 
< 0.1%
1525 1
 
< 0.1%
1526 1
 
< 0.1%
1528 1
 
< 0.1%
1519 1
 
< 0.1%
Other values (2274) 2274
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2284 1
< 0.1%
2283 1
< 0.1%
2282 1
< 0.1%
2281 1
< 0.1%
2280 1
< 0.1%
2279 1
< 0.1%
2278 1
< 0.1%
2277 1
< 0.1%
2276 1
< 0.1%
2275 1
< 0.1%
Distinct903
Distinct (%)39.5%
Missing0
Missing (%)0.0%
Memory size18.0 KiB
2024-04-19T15:13:45.491856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length44
Mean length22.57049
Min length19

Characters and Unicode

Total characters51551
Distinct characters106
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique792 ?
Unique (%)34.7%

Sample

1st row대구광역시 북구 고성동1가 104-9번지
2nd row대구광역시 북구 고성동2가 1-0번지
3rd row대구광역시 북구 고성동2가 33-1번지
4th row대구광역시 북구 고성동2가 57-2번지
5th row대구광역시 북구 고성동2가 59-1번지 외 3필지
ValueCountFrequency (%)
대구광역시 2284
24.2%
북구 2284
24.2%
산격동 1176
12.5%
동천동 245
 
2.6%
1629-0번지 238
 
2.5%
1621번지 180
 
1.9%
유통단지전자관 180
 
1.9%
1668-0번지 137
 
1.5%
침산동 136
 
1.4%
1667-0번지 134
 
1.4%
Other values (960) 2441
25.9%
2024-04-19T15:13:46.046462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9433
18.3%
4649
 
9.0%
2566
 
5.0%
2522
 
4.9%
2328
 
4.5%
2299
 
4.5%
2286
 
4.4%
1 2286
 
4.4%
2284
 
4.4%
2284
 
4.4%
Other values (96) 18614
36.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29287
56.8%
Decimal Number 10711
 
20.8%
Space Separator 9433
 
18.3%
Dash Punctuation 2093
 
4.1%
Other Punctuation 20
 
< 0.1%
Uppercase Letter 5
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4649
15.9%
2566
8.8%
2522
8.6%
2328
7.9%
2299
7.8%
2286
7.8%
2284
7.8%
2284
7.8%
2284
7.8%
1321
 
4.5%
Other values (78) 4464
15.2%
Decimal Number
ValueCountFrequency (%)
1 2286
21.3%
6 2018
18.8%
0 1539
14.4%
2 1328
12.4%
9 793
 
7.4%
5 618
 
5.8%
7 597
 
5.6%
3 597
 
5.6%
8 506
 
4.7%
4 429
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
40.0%
A 1
20.0%
H 1
20.0%
D 1
20.0%
Space Separator
ValueCountFrequency (%)
9433
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2093
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29287
56.8%
Common 22259
43.2%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4649
15.9%
2566
8.8%
2522
8.6%
2328
7.9%
2299
7.8%
2286
7.8%
2284
7.8%
2284
7.8%
2284
7.8%
1321
 
4.5%
Other values (78) 4464
15.2%
Common
ValueCountFrequency (%)
9433
42.4%
1 2286
 
10.3%
- 2093
 
9.4%
6 2018
 
9.1%
0 1539
 
6.9%
2 1328
 
6.0%
9 793
 
3.6%
5 618
 
2.8%
7 597
 
2.7%
3 597
 
2.7%
Other values (4) 957
 
4.3%
Latin
ValueCountFrequency (%)
B 2
40.0%
A 1
20.0%
H 1
20.0%
D 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29287
56.8%
ASCII 22264
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9433
42.4%
1 2286
 
10.3%
- 2093
 
9.4%
6 2018
 
9.1%
0 1539
 
6.9%
2 1328
 
6.0%
9 793
 
3.6%
5 618
 
2.8%
7 597
 
2.7%
3 597
 
2.7%
Other values (8) 962
 
4.3%
Hangul
ValueCountFrequency (%)
4649
15.9%
2566
8.8%
2522
8.6%
2328
7.9%
2299
7.8%
2286
7.8%
2284
7.8%
2284
7.8%
2284
7.8%
1321
 
4.5%
Other values (78) 4464
15.2%

부과금액
Real number (ℝ)

Distinct1461
Distinct (%)64.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean984078.54
Minimum2960
Maximum1.425178 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.2 KiB
2024-04-19T15:13:46.195105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2960
5-th percentile39980
Q180715
median161900
Q3528025
95-th percentile3024276
Maximum1.425178 × 108
Range1.4251484 × 108
Interquartile range (IQR)447310

Descriptive statistics

Standard deviation5791187.3
Coefficient of variation (CV)5.8848833
Kurtosis297.92884
Mean984078.54
Median Absolute Deviation (MAD)104340
Skewness15.804273
Sum2.2476354 × 109
Variance3.3537851 × 1013
MonotonicityNot monotonic
2024-04-19T15:13:46.357502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
79970 134
 
5.9%
98920 111
 
4.9%
88440 86
 
3.8%
68520 79
 
3.5%
107360 60
 
2.6%
39980 56
 
2.5%
27050 41
 
1.8%
72410 31
 
1.4%
49730 27
 
1.2%
131900 17
 
0.7%
Other values (1451) 1642
71.9%
ValueCountFrequency (%)
2960 1
< 0.1%
4050 2
0.1%
4420 1
< 0.1%
5690 1
< 0.1%
5920 1
< 0.1%
6940 1
< 0.1%
7010 1
< 0.1%
7320 1
< 0.1%
7550 1
< 0.1%
8060 1
< 0.1%
ValueCountFrequency (%)
142517800 1
< 0.1%
110691640 1
< 0.1%
98092810 1
< 0.1%
97201080 1
< 0.1%
60289260 1
< 0.1%
54980570 1
< 0.1%
50928730 1
< 0.1%
50928720 1
< 0.1%
48586490 2
0.1%
41335600 1
< 0.1%

Interactions

2024-04-19T15:13:44.632216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:13:44.463524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:13:44.720180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:13:44.542614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T15:13:46.459802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번부과금액
연번1.0000.079
부과금액0.0791.000
2024-04-19T15:13:46.554435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번부과금액
연번1.0000.369
부과금액0.3691.000

Missing values

2024-04-19T15:13:44.855431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T15:13:44.958613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설물주소부과금액
01대구광역시 북구 고성동1가 104-9번지521910
12대구광역시 북구 고성동2가 1-0번지183570
23대구광역시 북구 고성동2가 33-1번지514850
34대구광역시 북구 고성동2가 57-2번지252950
45대구광역시 북구 고성동2가 59-1번지 외 3필지742600
56대구광역시 북구 고성동2가 116-2번지235510
67대구광역시 북구 고성동3가 1-1번지6552760
78대구광역시 북구 고성동3가 -0번지 1-4, 1-5번지419350
89대구광역시 북구 고성동3가 2-0번지33670160
910대구광역시 북구 고성동3가 5-18번지152030
연번시설물주소부과금액
22742275대구광역시 북구 학정동 533번지10084210
22752276대구광역시 북구 국우동 1099-6번지232570
22762277대구광역시 북구 국우동 1109-4번지534580
22772278대구광역시 북구 학정동 456번지466050
22782279대구광역시 북구 대현동 281-14번지434090
22792280대구광역시 북구 칠성동2가 302-155번지48586490
22802281대구광역시 북구 칠성동2가 302-155번지48586490
22812282대구광역시 북구 칠성동2가 20-1번지 외1필지50928730
22822283대구광역시 북구 칠성동2가 20-1번지 외1필지50928720
22832284대구광역시 북구 동천동 968-0번지142517800