Overview

Dataset statistics

Number of variables5
Number of observations3001
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory129.1 KiB
Average record size in memory44.0 B

Variable types

Categorical1
Numeric3
Text1

Dataset

Description대전광역시 지역화폐인 온통대전의 월별 업종별 매출액 현황입니다. 매출액 단위는 (원)입니다. 기간은 2020년 5월부터 2021년 9월까지입니다. 업종은 상세업종이며, 표준산업분류체계와 다를 수 있습니다. 2021년 공공데이터 기업매칭지원사업으로 추진되었습니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15097192/fileData.do

Alerts

is highly overall correlated with High correlation
is highly overall correlated with High correlation

Reproduction

Analysis started2023-12-12 17:31:58.248468
Analysis finished2023-12-12 17:31:59.783381
Duration1.53 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.6 KiB
2021
1588 
2020
1413 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2021 1588
52.9%
2020 1413
47.1%

Length

2023-12-13T02:31:59.843356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:31:59.958541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 1588
52.9%
2020 1413
47.1%


Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.6531156
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.5 KiB
2023-12-13T02:32:00.076249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q15
median7
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.0034366
Coefficient of variation (CV)0.45143309
Kurtosis-0.76913282
Mean6.6531156
Median Absolute Deviation (MAD)2
Skewness-0.11486304
Sum19966
Variance9.0206311
MonotonicityNot monotonic
2023-12-13T02:32:00.184815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
6 357
11.9%
7 355
11.8%
8 355
11.8%
5 351
11.7%
9 351
11.7%
11 179
6.0%
2 177
5.9%
10 176
5.9%
3 176
5.9%
12 175
5.8%
Other values (2) 349
11.6%
ValueCountFrequency (%)
1 174
5.8%
2 177
5.9%
3 176
5.9%
4 175
5.8%
5 351
11.7%
6 357
11.9%
7 355
11.8%
8 355
11.8%
9 351
11.7%
10 176
5.9%
ValueCountFrequency (%)
12 175
5.8%
11 179
6.0%
10 176
5.9%
9 351
11.7%
8 355
11.8%
7 355
11.8%
6 357
11.9%
5 351
11.7%
4 175
5.8%
3 176
5.9%

업종코드
Real number (ℝ)

Distinct186
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5181.2569
Minimum1101
Maximum9906
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.5 KiB
2023-12-13T02:32:00.323234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1101
5-th percentile1201
Q13101
median5103
Q38103
95-th percentile9402
Maximum9906
Range8805
Interquartile range (IQR)5002

Descriptive statistics

Standard deviation2641.2438
Coefficient of variation (CV)0.50976892
Kurtosis-1.3009924
Mean5181.2569
Median Absolute Deviation (MAD)2502
Skewness0.070488285
Sum15548952
Variance6976168.6
MonotonicityNot monotonic
2023-12-13T02:32:00.455115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1101 17
 
0.6%
7103 17
 
0.6%
6502 17
 
0.6%
6504 17
 
0.6%
6601 17
 
0.6%
6602 17
 
0.6%
6603 17
 
0.6%
6701 17
 
0.6%
6702 17
 
0.6%
7101 17
 
0.6%
Other values (176) 2831
94.3%
ValueCountFrequency (%)
1101 17
0.6%
1104 17
0.6%
1106 17
0.6%
1107 17
0.6%
1108 16
0.5%
1110 17
0.6%
1111 17
0.6%
1112 17
0.6%
1201 17
0.6%
1202 17
0.6%
ValueCountFrequency (%)
9906 17
0.6%
9801 17
0.6%
9602 17
0.6%
9511 17
0.6%
9504 17
0.6%
9503 16
0.5%
9502 17
0.6%
9407 17
0.6%
9402 17
0.6%
9306 17
0.6%
Distinct186
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size23.6 KiB
2023-12-13T02:32:00.683812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length5.8617128
Min length2

Characters and Unicode

Total characters17591
Distinct characters244
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row양복점.한복점
2nd row아동.유아복
3rd row일반의류
4th row스포츠용품점
5th row모피.무스탕
ValueCountFrequency (%)
양복점.한복점 17
 
0.6%
입시학원,보습학원 17
 
0.6%
자동차정비 17
 
0.6%
타이어판매점 17
 
0.6%
주유소 17
 
0.6%
주유소(lpg 17
 
0.6%
기타연료판매 17
 
0.6%
주차장.폐차장 17
 
0.6%
세차장 17
 
0.6%
유치원 17
 
0.6%
Other values (177) 2848
94.4%
2023-12-13T02:32:01.116281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
818
 
4.7%
. 797
 
4.5%
579
 
3.3%
530
 
3.0%
459
 
2.6%
341
 
1.9%
320
 
1.8%
320
 
1.8%
309
 
1.8%
288
 
1.6%
Other values (234) 12830
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16021
91.1%
Other Punctuation 1018
 
5.8%
Close Punctuation 225
 
1.3%
Open Punctuation 225
 
1.3%
Uppercase Letter 85
 
0.5%
Space Separator 17
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
818
 
5.1%
579
 
3.6%
530
 
3.3%
459
 
2.9%
341
 
2.1%
320
 
2.0%
320
 
2.0%
309
 
1.9%
288
 
1.8%
255
 
1.6%
Other values (224) 11802
73.7%
Uppercase Letter
ValueCountFrequency (%)
P 34
40.0%
C 17
20.0%
L 17
20.0%
G 17
20.0%
Other Punctuation
ValueCountFrequency (%)
. 797
78.3%
, 187
 
18.4%
/ 34
 
3.3%
Close Punctuation
ValueCountFrequency (%)
) 225
100.0%
Open Punctuation
ValueCountFrequency (%)
( 225
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16021
91.1%
Common 1485
 
8.4%
Latin 85
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
818
 
5.1%
579
 
3.6%
530
 
3.3%
459
 
2.9%
341
 
2.1%
320
 
2.0%
320
 
2.0%
309
 
1.9%
288
 
1.8%
255
 
1.6%
Other values (224) 11802
73.7%
Common
ValueCountFrequency (%)
. 797
53.7%
) 225
 
15.2%
( 225
 
15.2%
, 187
 
12.6%
/ 34
 
2.3%
17
 
1.1%
Latin
ValueCountFrequency (%)
P 34
40.0%
C 17
20.0%
L 17
20.0%
G 17
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16021
91.1%
ASCII 1570
 
8.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
818
 
5.1%
579
 
3.6%
530
 
3.3%
459
 
2.9%
341
 
2.1%
320
 
2.0%
320
 
2.0%
309
 
1.9%
288
 
1.8%
255
 
1.6%
Other values (224) 11802
73.7%
ASCII
ValueCountFrequency (%)
. 797
50.8%
) 225
 
14.3%
( 225
 
14.3%
, 187
 
11.9%
/ 34
 
2.2%
P 34
 
2.2%
C 17
 
1.1%
L 17
 
1.1%
17
 
1.1%
G 17
 
1.1%

매출액
Real number (ℝ)

Distinct2981
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.6055362 × 108
Minimum-1123500
Maximum3.1648561 × 1010
Zeros0
Zeros (%)0.0%
Negative1
Negative (%)< 0.1%
Memory size26.5 KiB
2023-12-13T02:32:01.257702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1123500
5-th percentile815000
Q117408500
median1.1092715 × 108
Q34.723942 × 108
95-th percentile2.7896138 × 109
Maximum3.1648561 × 1010
Range3.1649685 × 1010
Interquartile range (IQR)4.549857 × 108

Descriptive statistics

Standard deviation1.9619622 × 109
Coefficient of variation (CV)2.9701785
Kurtosis86.108138
Mean6.6055362 × 108
Median Absolute Deviation (MAD)1.0823195 × 108
Skewness8.049883
Sum1.9823214 × 1012
Variance3.8492955 × 1018
MonotonicityNot monotonic
2023-12-13T02:32:01.415000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300000 5
 
0.2%
100000 4
 
0.1%
200000 3
 
0.1%
800 3
 
0.1%
390000 3
 
0.1%
1460000 2
 
0.1%
2488000 2
 
0.1%
960000 2
 
0.1%
1600 2
 
0.1%
50000 2
 
0.1%
Other values (2971) 2973
99.1%
ValueCountFrequency (%)
-1123500 1
 
< 0.1%
800 3
0.1%
1000 1
 
< 0.1%
1200 1
 
< 0.1%
1600 2
0.1%
2000 1
 
< 0.1%
5600 1
 
< 0.1%
7200 1
 
< 0.1%
10000 1
 
< 0.1%
15000 1
 
< 0.1%
ValueCountFrequency (%)
31648561493 1
< 0.1%
27942186265 1
< 0.1%
26420437168 1
< 0.1%
25652085912 1
< 0.1%
23777724418 1
< 0.1%
22917198343 1
< 0.1%
21809803006 1
< 0.1%
20870897009 1
< 0.1%
19868929416 1
< 0.1%
19329639644 1
< 0.1%

Interactions

2023-12-13T02:31:59.334065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:58.533577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:58.787671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:59.431358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:58.620922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:58.881539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:59.534533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:58.699906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:31:58.974949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:32:01.512022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종코드매출액
1.0000.8040.0000.074
0.8041.0000.0000.000
업종코드0.0000.0001.0000.307
매출액0.0740.0000.3071.000
2023-12-13T02:32:01.617361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종코드매출액
1.0000.003-0.0280.637
업종코드0.0031.000-0.2640.000
매출액-0.028-0.2641.0000.057
0.6370.0000.0571.000

Missing values

2023-12-13T02:31:59.666074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:31:59.750565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종코드업종명매출액
0202051101양복점.한복점9781000
1202051104아동.유아복21652220
2202051106일반의류114344410
3202051107스포츠용품점35796121
4202051108모피.무스탕300000
5202051110내의류15813570
6202051111교복9023000
7202051112의류관련(기타)3062500
8202051201커튼.카펫.수건15395200
9202051202침구및수예품55046730
업종코드업종명매출액
2991202199306창고보관업3403900
2992202199402판촉물.인쇄.복사41288951
2993202199407기계.장비(기타)30495543
2994202199502법무전문서비스7810250
2995202199503세무전문서비스850000
2996202199504광고기획.대행업6972525
2997202199511기타전문서비스38237760
2998202199602기타서비스514324432
2999202199801각종단체회비9182850
3000202199906기부금12445000