Overview

Dataset statistics

Number of variables5
Number of observations198
Missing cells18
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory42.7 B

Variable types

Numeric2
Text3

Alerts

(주)에이비엠그린텍 has 18 (9.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 06:41:39.368693
Analysis finished2023-12-10 06:41:41.027926
Duration1.66 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

20200506
Real number (ℝ)

Distinct13
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200521
Minimum20200506
Maximum20200526
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:41:41.114175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200506
5-th percentile20200506
Q120200515
median20200526
Q320200526
95-th percentile20200526
Maximum20200526
Range20
Interquartile range (IQR)11

Descriptive statistics

Standard deviation7.0995009
Coefficient of variation (CV)3.5145138 × 10-7
Kurtosis-0.55298494
Mean20200521
Median Absolute Deviation (MAD)0
Skewness-0.98390718
Sum3.9997031 × 109
Variance50.402912
MonotonicityIncreasing
2023-12-10T15:41:41.276475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
20200526 112
56.6%
20200506 15
 
7.6%
20200508 12
 
6.1%
20200515 11
 
5.6%
20200519 8
 
4.0%
20200514 7
 
3.5%
20200520 7
 
3.5%
20200522 7
 
3.5%
20200512 6
 
3.0%
20200511 4
 
2.0%
Other values (3) 9
 
4.5%
ValueCountFrequency (%)
20200506 15
7.6%
20200508 12
6.1%
20200511 4
 
2.0%
20200512 6
 
3.0%
20200514 7
3.5%
20200515 11
5.6%
20200518 4
 
2.0%
20200519 8
4.0%
20200520 7
3.5%
20200521 3
 
1.5%
ValueCountFrequency (%)
20200526 112
56.6%
20200525 2
 
1.0%
20200522 7
 
3.5%
20200521 3
 
1.5%
20200520 7
 
3.5%
20200519 8
 
4.0%
20200518 4
 
2.0%
20200515 11
 
5.6%
20200514 7
 
3.5%
20200512 6
 
3.0%
Distinct127
Distinct (%)64.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:41:41.614095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length5.8585859
Min length2

Characters and Unicode

Total characters1160
Distinct characters202
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)47.5%

Sample

1st row내의판매
2nd row무역업(종합)
3rd row반도체가공기계제조
4th row부동산
5th row안과
ValueCountFrequency (%)
건설업(종합 13
 
6.6%
고용알선(종합 6
 
3.0%
건물건설(종합 6
 
3.0%
주민센터 5
 
2.5%
건축자재판매 4
 
2.0%
부동산 4
 
2.0%
가공식품도매(종합 4
 
2.0%
곱창.양구이 4
 
2.0%
검정고시학원 3
 
1.5%
화장품.향수 3
 
1.5%
Other values (117) 146
73.7%
2023-12-10T15:41:42.203027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
4.9%
) 57
 
4.9%
( 57
 
4.9%
50
 
4.3%
49
 
4.2%
31
 
2.7%
31
 
2.7%
28
 
2.4%
26
 
2.2%
. 22
 
1.9%
Other values (192) 752
64.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 998
86.0%
Close Punctuation 57
 
4.9%
Open Punctuation 57
 
4.9%
Uppercase Letter 23
 
2.0%
Other Punctuation 22
 
1.9%
Decimal Number 2
 
0.2%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
5.7%
50
 
5.0%
49
 
4.9%
31
 
3.1%
31
 
3.1%
28
 
2.8%
26
 
2.6%
19
 
1.9%
18
 
1.8%
18
 
1.8%
Other values (177) 671
67.2%
Uppercase Letter
ValueCountFrequency (%)
C 6
26.1%
P 5
21.7%
V 3
13.0%
D 2
 
8.7%
G 2
 
8.7%
L 2
 
8.7%
R 1
 
4.3%
M 1
 
4.3%
T 1
 
4.3%
Decimal Number
ValueCountFrequency (%)
3 1
50.0%
4 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Other Punctuation
ValueCountFrequency (%)
. 22
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 998
86.0%
Common 139
 
12.0%
Latin 23
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
5.7%
50
 
5.0%
49
 
4.9%
31
 
3.1%
31
 
3.1%
28
 
2.8%
26
 
2.6%
19
 
1.9%
18
 
1.8%
18
 
1.8%
Other values (177) 671
67.2%
Latin
ValueCountFrequency (%)
C 6
26.1%
P 5
21.7%
V 3
13.0%
D 2
 
8.7%
G 2
 
8.7%
L 2
 
8.7%
R 1
 
4.3%
M 1
 
4.3%
T 1
 
4.3%
Common
ValueCountFrequency (%)
) 57
41.0%
( 57
41.0%
. 22
 
15.8%
1
 
0.7%
3 1
 
0.7%
4 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 998
86.0%
ASCII 162
 
14.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
 
5.7%
50
 
5.0%
49
 
4.9%
31
 
3.1%
31
 
3.1%
28
 
2.8%
26
 
2.6%
19
 
1.9%
18
 
1.8%
18
 
1.8%
Other values (177) 671
67.2%
ASCII
ValueCountFrequency (%)
) 57
35.2%
( 57
35.2%
. 22
 
13.6%
C 6
 
3.7%
P 5
 
3.1%
V 3
 
1.9%
D 2
 
1.2%
G 2
 
1.2%
L 2
 
1.2%
R 1
 
0.6%
Other values (5) 5
 
3.1%
Distinct179
Distinct (%)99.4%
Missing18
Missing (%)9.1%
Memory size1.7 KiB
2023-12-10T15:41:42.597349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length7.7388889
Min length2

Characters and Unicode

Total characters1393
Distinct characters342
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique178 ?
Unique (%)98.9%

Sample

1st row퍼스트올로이수점
2nd row블랙월인터내셔널
3rd row아이스팩코리아
4th row강남퍼스트안과
5th row스티유에이케이플라자수원점
ValueCountFrequency (%)
당진pvc상사 2
 
1.1%
가자조경자재지주목인조잔디마사토모래인공 1
 
0.6%
진미상회 1
 
0.6%
퍼스트올로이수점 1
 
0.6%
통영바다내음 1
 
0.6%
에스에이치종합건설(주 1
 
0.6%
진혜건설 1
 
0.6%
한신공영(주 1
 
0.6%
진기건설기계 1
 
0.6%
대명환경 1
 
0.6%
Other values (169) 169
93.9%
2023-12-10T15:41:43.130296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53
 
3.8%
( 47
 
3.4%
) 47
 
3.4%
33
 
2.4%
31
 
2.2%
27
 
1.9%
21
 
1.5%
18
 
1.3%
17
 
1.2%
16
 
1.1%
Other values (332) 1083
77.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1256
90.2%
Open Punctuation 47
 
3.4%
Close Punctuation 47
 
3.4%
Uppercase Letter 32
 
2.3%
Other Punctuation 7
 
0.5%
Decimal Number 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
4.2%
33
 
2.6%
31
 
2.5%
27
 
2.1%
21
 
1.7%
18
 
1.4%
17
 
1.4%
16
 
1.3%
16
 
1.3%
15
 
1.2%
Other values (307) 1009
80.3%
Uppercase Letter
ValueCountFrequency (%)
C 4
12.5%
S 3
 
9.4%
P 3
 
9.4%
K 3
 
9.4%
R 2
 
6.2%
A 2
 
6.2%
V 2
 
6.2%
E 2
 
6.2%
N 2
 
6.2%
X 1
 
3.1%
Other values (8) 8
25.0%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 1
25.0%
3 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 5
71.4%
& 2
 
28.6%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1256
90.2%
Common 105
 
7.5%
Latin 32
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
4.2%
33
 
2.6%
31
 
2.5%
27
 
2.1%
21
 
1.7%
18
 
1.4%
17
 
1.4%
16
 
1.3%
16
 
1.3%
15
 
1.2%
Other values (307) 1009
80.3%
Latin
ValueCountFrequency (%)
C 4
12.5%
S 3
 
9.4%
P 3
 
9.4%
K 3
 
9.4%
R 2
 
6.2%
A 2
 
6.2%
V 2
 
6.2%
E 2
 
6.2%
N 2
 
6.2%
X 1
 
3.1%
Other values (8) 8
25.0%
Common
ValueCountFrequency (%)
( 47
44.8%
) 47
44.8%
. 5
 
4.8%
& 2
 
1.9%
1 2
 
1.9%
2 1
 
1.0%
3 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1256
90.2%
ASCII 137
 
9.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
53
 
4.2%
33
 
2.6%
31
 
2.5%
27
 
2.1%
21
 
1.7%
18
 
1.4%
17
 
1.4%
16
 
1.3%
16
 
1.3%
15
 
1.2%
Other values (307) 1009
80.3%
ASCII
ValueCountFrequency (%)
( 47
34.3%
) 47
34.3%
. 5
 
3.6%
C 4
 
2.9%
S 3
 
2.2%
P 3
 
2.2%
K 3
 
2.2%
& 2
 
1.5%
R 2
 
1.5%
A 2
 
1.5%
Other values (15) 19
13.9%
Distinct197
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:41:43.504447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16.5
Mean length7.7121212
Min length2

Characters and Unicode

Total characters1527
Distinct characters363
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)99.0%

Sample

1st row퍼스트올로이수점
2nd row블랙월인터네셔널
3rd row아이스팩코리아
4th row시장공인중개사사무소
5th row강남퍼스트안과
ValueCountFrequency (%)
당진pvc상사 2
 
1.0%
파란자원 1
 
0.5%
진기건설기계 1
 
0.5%
주)선진윈도우 1
 
0.5%
주)케이이엔지 1
 
0.5%
대성건설중기 1
 
0.5%
대유건설(주 1
 
0.5%
동우개발(주 1
 
0.5%
디아이종합건설(주 1
 
0.5%
명신종합건설(주 1
 
0.5%
Other values (187) 187
94.4%
2023-12-10T15:41:44.149516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
3.5%
( 47
 
3.1%
) 47
 
3.1%
36
 
2.4%
32
 
2.1%
29
 
1.9%
24
 
1.6%
21
 
1.4%
21
 
1.4%
19
 
1.2%
Other values (353) 1197
78.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1385
90.7%
Open Punctuation 47
 
3.1%
Close Punctuation 47
 
3.1%
Uppercase Letter 36
 
2.4%
Other Punctuation 7
 
0.5%
Decimal Number 4
 
0.3%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
3.9%
36
 
2.6%
32
 
2.3%
29
 
2.1%
24
 
1.7%
21
 
1.5%
21
 
1.5%
19
 
1.4%
16
 
1.2%
16
 
1.2%
Other values (327) 1117
80.6%
Uppercase Letter
ValueCountFrequency (%)
K 5
13.9%
C 4
11.1%
S 3
 
8.3%
A 3
 
8.3%
P 3
 
8.3%
T 2
 
5.6%
R 2
 
5.6%
E 2
 
5.6%
N 2
 
5.6%
V 2
 
5.6%
Other values (8) 8
22.2%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 1
25.0%
3 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 5
71.4%
& 2
 
28.6%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1385
90.7%
Common 106
 
6.9%
Latin 36
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
3.9%
36
 
2.6%
32
 
2.3%
29
 
2.1%
24
 
1.7%
21
 
1.5%
21
 
1.5%
19
 
1.4%
16
 
1.2%
16
 
1.2%
Other values (327) 1117
80.6%
Latin
ValueCountFrequency (%)
K 5
13.9%
C 4
11.1%
S 3
 
8.3%
A 3
 
8.3%
P 3
 
8.3%
T 2
 
5.6%
R 2
 
5.6%
E 2
 
5.6%
N 2
 
5.6%
V 2
 
5.6%
Other values (8) 8
22.2%
Common
ValueCountFrequency (%)
( 47
44.3%
) 47
44.3%
. 5
 
4.7%
& 2
 
1.9%
1 2
 
1.9%
2 1
 
0.9%
3 1
 
0.9%
_ 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1385
90.7%
ASCII 142
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
3.9%
36
 
2.6%
32
 
2.3%
29
 
2.1%
24
 
1.7%
21
 
1.5%
21
 
1.5%
19
 
1.4%
16
 
1.2%
16
 
1.2%
Other values (327) 1117
80.6%
ASCII
ValueCountFrequency (%)
( 47
33.1%
) 47
33.1%
K 5
 
3.5%
. 5
 
3.5%
C 4
 
2.8%
S 3
 
2.1%
A 3
 
2.1%
P 3
 
2.1%
& 2
 
1.4%
T 2
 
1.4%
Other values (16) 21
14.8%

420000
Real number (ℝ)

Distinct128
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean682528.09
Minimum11300
Maximum980928
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:41:44.382835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11300
5-th percentile406681.95
Q1550225.5
median707802
Q3852426
95-th percentile962209
Maximum980928
Range969628
Interquartile range (IQR)302200.5

Descriptive statistics

Standard deviation209246.99
Coefficient of variation (CV)0.30657638
Kurtosis-0.51687085
Mean682528.09
Median Absolute Deviation (MAD)153411
Skewness-0.58856481
Sum1.3514056 × 108
Variance4.3784303 × 1010
MonotonicityNot monotonic
2023-12-10T15:41:44.605085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
420000 13
 
6.6%
839100 6
 
3.0%
422100 6
 
3.0%
962209 5
 
2.5%
572101 4
 
2.0%
803001 4
 
2.0%
452100 4
 
2.0%
852106 4
 
2.0%
892209 3
 
1.5%
550302 3
 
1.5%
Other values (118) 146
73.7%
ValueCountFrequency (%)
11300 1
 
0.5%
111299 1
 
0.5%
115101 1
 
0.5%
222306 1
 
0.5%
222315 1
 
0.5%
292200 2
 
1.0%
292912 1
 
0.5%
321002 1
 
0.5%
331213 1
 
0.5%
420000 13
6.6%
ValueCountFrequency (%)
980928 2
 
1.0%
980919 1
 
0.5%
965312 1
 
0.5%
964411 1
 
0.5%
963403 1
 
0.5%
962209 5
2.5%
962206 1
 
0.5%
902106 1
 
0.5%
902105 3
1.5%
901210 1
 
0.5%

Interactions

2023-12-10T15:41:40.438737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:41:40.101184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:41:40.621674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:41:40.275660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:41:44.742558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
20200506420000
202005061.0000.356
4200000.3561.000
2023-12-10T15:41:44.869447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
20200506420000
202005061.000-0.275
420000-0.2751.000

Missing values

2023-12-10T15:41:40.808225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:41:40.970484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

20200506건설업(종합)(주)에이비엠그린텍(주)에이비엠그린텍.1420000
020200506내의판매퍼스트올로이수점퍼스트올로이수점561304
120200506무역업(종합)블랙월인터내셔널블랙월인터네셔널491000
220200506반도체가공기계제조아이스팩코리아아이스팩코리아331213
320200506부동산<NA>시장공인중개사사무소803001
420200506안과강남퍼스트안과강남퍼스트안과861208
520200506영농조합<NA>화랑촌권역영농조합법인980919
620200506의류판매(종합)<NA>라뜰리에드레브561900
720200506의류판매(종합)스티유에이케이플라자수원점스티유AK플라자수원점561900
820200506자동차튜닝<NA>팀게러지592014
920200506족발.보쌈<NA>장충동왕족발북구점852104
20200506건설업(종합)(주)에이비엠그린텍(주)에이비엠그린텍.1420000
18820200526공기압축기제조.판매범한산업(주)범한산업(주)292912
18920200526공사.공단한국산업단지공단김해지사한국산업단지공단김해지사964411
19020200526공연기획및제작(주)투래빗스타(주)투래빗스타839957
19120200526공원생초국제조각공원생초국제조각공원893007
19220200526공인회계사박영인세무회계사무소박영인세무회계사무소831201
19320200526공작기계제조(종합)삼성공업사삼성공업사292200
19420200526공작기계제조(종합)이테크이테크292200
19520200526공해방지시설공사(종합)지엘엔텍(주)지엘엔텍(주)431100
19620200526과실.음료.향신작물생산(종합)홍천너래홍천너래11300
19720200526과일주스전문점뱌오쥬스뱌오쥬스854006