Overview

Dataset statistics

Number of variables18
Number of observations30
Missing cells1
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory154.4 B

Variable types

Categorical10
Text4
Numeric3
DateTime1

Dataset

Description샘플 데이터
Author한국평가데이터㈜
URLhttps://bigdata-region.kr/#/dataset/cf8afba8-45ef-4dd2-ad9f-c8d634bac381

Alerts

기준년도 has constant value ""Constant
등록일자 has constant value ""Constant
작업자명 has constant value ""Constant
가공기업구분코드 is highly overall correlated with 가공기업구분High correlation
업종대분류코드 is highly overall correlated with 업종대분류명High correlation
가공기업구분 is highly overall correlated with 가공기업구분코드High correlation
업종대분류명 is highly overall correlated with 업종대분류코드High correlation
업력구간코드 is highly overall correlated with 평균액 and 2 other fieldsHigh correlation
평균액 is highly overall correlated with 업력구간코드 and 1 other fieldsHigh correlation
중위액 is highly overall correlated with 업력구간코드 and 1 other fieldsHigh correlation
업력구간 is highly overall correlated with 업력구간코드High correlation
총기업수 is highly imbalanced (73.5%)Imbalance
시군구명 has 1 (3.3%) missing valuesMissing
행정동명 has unique valuesUnique
평균액 has unique valuesUnique
중위액 has unique valuesUnique
업력구간코드 has 1 (3.3%) zerosZeros

Reproduction

Analysis started2023-12-10 14:14:58.549998
Analysis finished2023-12-10 14:15:02.703790
Duration4.15 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2016
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2016
3rd row2016
4th row2016
5th row2016

Common Values

ValueCountFrequency (%)
2016 30
100.0%

Length

2023-12-10T23:15:02.812372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:02.990665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016 30
100.0%

시도명
Categorical

Distinct10
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
경기
10 
서울
충북
부산
대구
Other values (5)

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row충북
2nd row인천
3rd row경기
4th row서울
5th row대구

Common Values

ValueCountFrequency (%)
경기 10
33.3%
서울 6
20.0%
충북 3
 
10.0%
부산 3
 
10.0%
대구 2
 
6.7%
충남 2
 
6.7%
인천 1
 
3.3%
제주 1
 
3.3%
세종 1
 
3.3%
전남 1
 
3.3%

Length

2023-12-10T23:15:03.200650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:03.436266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기 10
33.3%
서울 6
20.0%
충북 3
 
10.0%
부산 3
 
10.0%
대구 2
 
6.7%
충남 2
 
6.7%
인천 1
 
3.3%
제주 1
 
3.3%
세종 1
 
3.3%
전남 1
 
3.3%

시군구명
Text

MISSING 

Distinct24
Distinct (%)82.8%
Missing1
Missing (%)3.3%
Memory size372.0 B
2023-12-10T23:15:03.796361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.7586207
Min length2

Characters and Unicode

Total characters109
Distinct characters39
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)69.0%

Sample

1st row음성군
2nd row남동구
3rd row하남시
4th row강서구
5th row수성구
ValueCountFrequency (%)
강남구 3
 
8.8%
시흥시 2
 
5.9%
수성구 2
 
5.9%
화성시 2
 
5.9%
성남시 1
 
2.9%
음성군 1
 
2.9%
제천시 1
 
2.9%
포천시 1
 
2.9%
흥덕구 1
 
2.9%
청주시 1
 
2.9%
Other values (19) 19
55.9%
2023-12-10T23:15:04.300563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
15.6%
16
 
14.7%
7
 
6.4%
7
 
6.4%
5
 
4.6%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (29) 41
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 104
95.4%
Space Separator 5
 
4.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
16.3%
16
15.4%
7
 
6.7%
7
 
6.7%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (28) 38
36.5%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 104
95.4%
Common 5
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
16.3%
16
15.4%
7
 
6.7%
7
 
6.7%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (28) 38
36.5%
Common
ValueCountFrequency (%)
5
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 104
95.4%
ASCII 5
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
16.3%
16
15.4%
7
 
6.7%
7
 
6.7%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (28) 38
36.5%
ASCII
ValueCountFrequency (%)
5
100.0%

행정동명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:15:04.650542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4.5
Mean length3.4666667
Min length2

Characters and Unicode

Total characters104
Distinct characters52
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row감곡면
2nd row논현2동
3rd row초이동
4th row가양1동
5th row고산2동
ValueCountFrequency (%)
감곡면 1
 
3.3%
논현2동 1
 
3.3%
창수면 1
 
3.3%
오송읍 1
 
3.3%
재송1동 1
 
3.3%
동탄1동 1
 
3.3%
군내면 1
 
3.3%
명동 1
 
3.3%
조치원읍 1
 
3.3%
논현1동 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:15:05.249004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
21.2%
1 8
 
7.7%
6
 
5.8%
4
 
3.8%
2 4
 
3.8%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (42) 49
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 91
87.5%
Decimal Number 13
 
12.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
24.2%
6
 
6.6%
4
 
4.4%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (39) 44
48.4%
Decimal Number
ValueCountFrequency (%)
1 8
61.5%
2 4
30.8%
3 1
 
7.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 91
87.5%
Common 13
 
12.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
24.2%
6
 
6.6%
4
 
4.4%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (39) 44
48.4%
Common
ValueCountFrequency (%)
1 8
61.5%
2 4
30.8%
3 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 91
87.5%
ASCII 13
 
12.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
24.2%
6
 
6.6%
4
 
4.4%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (39) 44
48.4%
ASCII
ValueCountFrequency (%)
1 8
61.5%
2 4
30.8%
3 1
 
7.7%

업종대분류코드
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
C
12 
L
G
J
A
Other values (4)

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st rowC
2nd rowG
3rd rowC
4th rowJ
5th rowJ

Common Values

ValueCountFrequency (%)
C 12
40.0%
L 5
16.7%
G 4
 
13.3%
J 3
 
10.0%
A 2
 
6.7%
H 1
 
3.3%
N 1
 
3.3%
R 1
 
3.3%
E 1
 
3.3%

Length

2023-12-10T23:15:05.472922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:05.686029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
c 12
40.0%
l 5
16.7%
g 4
 
13.3%
j 3
 
10.0%
a 2
 
6.7%
h 1
 
3.3%
n 1
 
3.3%
r 1
 
3.3%
e 1
 
3.3%
Distinct18
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:15:05.949106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters90
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st rowC30
2nd rowG46
3rd rowC23
4th rowJ58
5th rowJ58
ValueCountFrequency (%)
l68 5
16.7%
g46 3
 
10.0%
c29 3
 
10.0%
c30 2
 
6.7%
a01 2
 
6.7%
j58 2
 
6.7%
c23 2
 
6.7%
c28 1
 
3.3%
h52 1
 
3.3%
n74 1
 
3.3%
Other values (8) 8
26.7%
2023-12-10T23:15:06.429283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 13
14.4%
C 12
13.3%
6 10
11.1%
8 9
10.0%
3 6
 
6.7%
L 5
 
5.6%
4 5
 
5.6%
0 5
 
5.6%
G 4
 
4.4%
9 4
 
4.4%
Other values (9) 17
18.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60
66.7%
Uppercase Letter 30
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 13
21.7%
6 10
16.7%
8 9
15.0%
3 6
10.0%
4 5
 
8.3%
0 5
 
8.3%
9 4
 
6.7%
5 4
 
6.7%
1 3
 
5.0%
7 1
 
1.7%
Uppercase Letter
ValueCountFrequency (%)
C 12
40.0%
L 5
16.7%
G 4
 
13.3%
J 3
 
10.0%
A 2
 
6.7%
E 1
 
3.3%
R 1
 
3.3%
N 1
 
3.3%
H 1
 
3.3%

Most occurring scripts

ValueCountFrequency (%)
Common 60
66.7%
Latin 30
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
2 13
21.7%
6 10
16.7%
8 9
15.0%
3 6
10.0%
4 5
 
8.3%
0 5
 
8.3%
9 4
 
6.7%
5 4
 
6.7%
1 3
 
5.0%
7 1
 
1.7%
Latin
ValueCountFrequency (%)
C 12
40.0%
L 5
16.7%
G 4
 
13.3%
J 3
 
10.0%
A 2
 
6.7%
E 1
 
3.3%
R 1
 
3.3%
N 1
 
3.3%
H 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 13
14.4%
C 12
13.3%
6 10
11.1%
8 9
10.0%
3 6
 
6.7%
L 5
 
5.6%
4 5
 
5.6%
0 5
 
5.6%
G 4
 
4.4%
9 4
 
4.4%
Other values (9) 17
18.9%

업종대분류명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
제조업
12 
부동산업
도매 및 소매업
정보통신업
농업; 임업 및 어업
Other values (4)

Length

Max length24
Median length23
Mean length6.6333333
Min length3

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row제조업
2nd row도매 및 소매업
3rd row제조업
4th row정보통신업
5th row정보통신업

Common Values

ValueCountFrequency (%)
제조업 12
40.0%
부동산업 5
16.7%
도매 및 소매업 4
 
13.3%
정보통신업 3
 
10.0%
농업; 임업 및 어업 2
 
6.7%
운수 및 창고업 1
 
3.3%
사업시설 관리; 사업 지원 및 임대 서비스업 1
 
3.3%
예술; 스포츠 및 여가관련 서비스업 1
 
3.3%
수도; 하수 및 폐기물 처리; 원료 재생업 1
 
3.3%

Length

2023-12-10T23:15:06.729426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:06.999207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 12
19.4%
10
16.1%
부동산업 5
 
8.1%
도매 4
 
6.5%
소매업 4
 
6.5%
정보통신업 3
 
4.8%
임업 2
 
3.2%
어업 2
 
3.2%
농업 2
 
3.2%
서비스업 2
 
3.2%
Other values (16) 16
25.8%
Distinct18
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:15:07.413143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length15
Mean length11.366667
Min length2

Characters and Unicode

Total characters341
Distinct characters87
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st row자동차 및 트레일러 제조업
2nd row도매 및 상품 중개업
3rd row비금속 광물제품 제조업
4th row출판업
5th row출판업
ValueCountFrequency (%)
17
 
16.8%
제조업 12
 
11.9%
부동산업 5
 
5.0%
기계 3
 
3.0%
장비 3
 
3.0%
도매 3
 
3.0%
상품 3
 
3.0%
중개업 3
 
3.0%
자동차 3
 
3.0%
서비스업 3
 
3.0%
Other values (38) 46
45.5%
2023-12-10T23:15:07.972024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
20.8%
31
 
9.1%
17
 
5.0%
17
 
5.0%
13
 
3.8%
10
 
2.9%
10
 
2.9%
8
 
2.3%
8
 
2.3%
; 7
 
2.1%
Other values (77) 149
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 263
77.1%
Space Separator 71
 
20.8%
Other Punctuation 7
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
11.8%
17
 
6.5%
17
 
6.5%
13
 
4.9%
10
 
3.8%
10
 
3.8%
8
 
3.0%
8
 
3.0%
7
 
2.7%
6
 
2.3%
Other values (75) 136
51.7%
Space Separator
ValueCountFrequency (%)
71
100.0%
Other Punctuation
ValueCountFrequency (%)
; 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 263
77.1%
Common 78
 
22.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
11.8%
17
 
6.5%
17
 
6.5%
13
 
4.9%
10
 
3.8%
10
 
3.8%
8
 
3.0%
8
 
3.0%
7
 
2.7%
6
 
2.3%
Other values (75) 136
51.7%
Common
ValueCountFrequency (%)
71
91.0%
; 7
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 263
77.1%
ASCII 78
 
22.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71
91.0%
; 7
 
9.0%
Hangul
ValueCountFrequency (%)
31
 
11.8%
17
 
6.5%
17
 
6.5%
13
 
4.9%
10
 
3.8%
10
 
3.8%
8
 
3.0%
8
 
3.0%
7
 
2.7%
6
 
2.3%
Other values (75) 136
51.7%

가공기업구분코드
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
3
16 
4
11 
1
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row3
2nd row3
3rd row3
4th row4
5th row4

Common Values

ValueCountFrequency (%)
3 16
53.3%
4 11
36.7%
1 2
 
6.7%
2 1
 
3.3%

Length

2023-12-10T23:15:08.207035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:08.361774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 16
53.3%
4 11
36.7%
1 2
 
6.7%
2 1
 
3.3%

가공기업구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
중기업
16 
소기업
11 
대기업
중견기업
 
1

Length

Max length4
Median length3
Mean length3.0333333
Min length3

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row중기업
2nd row중기업
3rd row중기업
4th row소기업
5th row소기업

Common Values

ValueCountFrequency (%)
중기업 16
53.3%
소기업 11
36.7%
대기업 2
 
6.7%
중견기업 1
 
3.3%

Length

2023-12-10T23:15:08.521033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:08.685448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중기업 16
53.3%
소기업 11
36.7%
대기업 2
 
6.7%
중견기업 1
 
3.3%

업력구간코드
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.066667
Minimum0
Maximum30
Zeros1
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:15:08.814062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.45
Q15
median10
Q310
95-th percentile30
Maximum30
Range30
Interquartile range (IQR)5

Descriptive statistics

Standard deviation9.4938577
Coefficient of variation (CV)0.8578787
Kurtosis0.33816616
Mean11.066667
Median Absolute Deviation (MAD)5
Skewness1.2287401
Sum332
Variance90.133333
MonotonicityNot monotonic
2023-12-10T23:15:08.956705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
10 12
40.0%
5 7
23.3%
30 5
16.7%
2 3
 
10.0%
20 1
 
3.3%
0 1
 
3.3%
1 1
 
3.3%
ValueCountFrequency (%)
0 1
 
3.3%
1 1
 
3.3%
2 3
 
10.0%
5 7
23.3%
10 12
40.0%
20 1
 
3.3%
30 5
16.7%
ValueCountFrequency (%)
30 5
16.7%
20 1
 
3.3%
10 12
40.0%
5 7
23.3%
2 3
 
10.0%
1 1
 
3.3%
0 1
 
3.3%

업력구간
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
10년 이상 20년 미만
12 
5년 이상 10년 미만
30년 이상 40년 미만
2년 이상 5년 미만
20년 이상 30년 미만
 
1
Other values (2)

Length

Max length13
Median length13
Mean length12.233333
Min length5

Unique

Unique3 ?
Unique (%)10.0%

Sample

1st row5년 이상 10년 미만
2nd row10년 이상 20년 미만
3rd row10년 이상 20년 미만
4th row10년 이상 20년 미만
5th row2년 이상 5년 미만

Common Values

ValueCountFrequency (%)
10년 이상 20년 미만 12
40.0%
5년 이상 10년 미만 7
23.3%
30년 이상 40년 미만 5
16.7%
2년 이상 5년 미만 3
 
10.0%
20년 이상 30년 미만 1
 
3.3%
1년 미만 1
 
3.3%
1년 이상 2년 미만 1
 
3.3%

Length

2023-12-10T23:15:09.188246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:09.382550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 30
25.4%
이상 29
24.6%
10년 19
16.1%
20년 13
11.0%
5년 10
 
8.5%
30년 6
 
5.1%
40년 5
 
4.2%
2년 4
 
3.4%
1년 2
 
1.7%

항목명
Categorical

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
부채총계
자산총계
매출액
자본총계
자본금
Other values (2)

Length

Max length5
Median length4
Mean length3.8
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부채총계
2nd row부채총계
3rd row자본총계
4th row자산총계
5th row자본총계

Common Values

ValueCountFrequency (%)
부채총계 7
23.3%
자산총계 5
16.7%
매출액 5
16.7%
자본총계 4
13.3%
자본금 4
13.3%
당기순이익 3
10.0%
영업이익 2
 
6.7%

Length

2023-12-10T23:15:09.616276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:09.858155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부채총계 7
23.3%
자산총계 5
16.7%
매출액 5
16.7%
자본총계 4
13.3%
자본금 4
13.3%
당기순이익 3
10.0%
영업이익 2
 
6.7%

총기업수
Categorical

IMBALANCE 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
1
28 
2
 
1
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)6.7%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 28
93.3%
2 1
 
3.3%
3 1
 
3.3%

Length

2023-12-10T23:15:10.040964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:10.215664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 28
93.3%
2 1
 
3.3%
3 1
 
3.3%

평균액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13892227
Minimum-7868896
Maximum51356752
Zeros0
Zeros (%)0.0%
Negative2
Negative (%)6.7%
Memory size402.0 B
2023-12-10T23:15:10.478328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-7868896
5-th percentile-975734.1
Q11824809.2
median11910492
Q320527926
95-th percentile38323339
Maximum51356752
Range59225648
Interquartile range (IQR)18703117

Descriptive statistics

Standard deviation13889112
Coefficient of variation (CV)0.99977576
Kurtosis0.65575717
Mean13892227
Median Absolute Deviation (MAD)9941054
Skewness0.91722446
Sum4.1676681 × 108
Variance1.9290743 × 1014
MonotonicityNot monotonic
2023-12-10T23:15:10.675881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
22861778 1
 
3.3%
21432932 1
 
3.3%
8423357 1
 
3.3%
17589788 1
 
3.3%
2646765 1
 
3.3%
51356752 1
 
3.3%
43123739 1
 
3.3%
1463 1
 
3.3%
7603888 1
 
3.3%
16655435 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
-7868896 1
3.3%
-1775259 1
3.3%
1463 1
3.3%
300000 1
3.3%
458009 1
3.3%
1350000 1
3.3%
1430000 1
3.3%
1550824 1
3.3%
2646765 1
3.3%
5514331 1
3.3%
ValueCountFrequency (%)
51356752 1
3.3%
43123739 1
3.3%
32456184 1
3.3%
30840801 1
3.3%
30768704 1
3.3%
26317084 1
3.3%
22861778 1
3.3%
21432932 1
3.3%
17812909 1
3.3%
17589788 1
3.3%

중위액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13951458
Minimum-7868896
Maximum51356752
Zeros0
Zeros (%)0.0%
Negative1
Negative (%)3.3%
Memory size402.0 B
2023-12-10T23:15:11.007223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-7868896
5-th percentile1562
Q11824809.2
median11910492
Q320527926
95-th percentile38323339
Maximum51356752
Range59225648
Interquartile range (IQR)18703117

Descriptive statistics

Standard deviation13823627
Coefficient of variation (CV)0.99083741
Kurtosis0.68671792
Mean13951458
Median Absolute Deviation (MAD)9941054
Skewness0.93285331
Sum4.1854375 × 108
Variance1.9109266 × 1014
MonotonicityNot monotonic
2023-12-10T23:15:11.350421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
22861778 1
 
3.3%
21432932 1
 
3.3%
8423357 1
 
3.3%
17589788 1
 
3.3%
2646765 1
 
3.3%
51356752 1
 
3.3%
43123739 1
 
3.3%
1463 1
 
3.3%
7603888 1
 
3.3%
16655435 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
-7868896 1
3.3%
1463 1
3.3%
1683 1
3.3%
300000 1
3.3%
458009 1
3.3%
1350000 1
3.3%
1430000 1
3.3%
1550824 1
3.3%
2646765 1
3.3%
5514331 1
3.3%
ValueCountFrequency (%)
51356752 1
3.3%
43123739 1
3.3%
32456184 1
3.3%
30840801 1
3.3%
30768704 1
3.3%
26317084 1
3.3%
22861778 1
3.3%
21432932 1
3.3%
17812909 1
3.3%
17589788 1
3.3%

등록일자
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2021-10-19 00:00:00
Maximum2021-10-19 00:00:00
2023-12-10T23:15:11.944156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:12.102174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

작업자명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
KEDSYS
30 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKEDSYS
2nd rowKEDSYS
3rd rowKEDSYS
4th rowKEDSYS
5th rowKEDSYS

Common Values

ValueCountFrequency (%)
KEDSYS 30
100.0%

Length

2023-12-10T23:15:12.398142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:15:12.612987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kedsys 30
100.0%

Interactions

2023-12-10T23:15:01.155509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:00.190832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:00.658757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:01.351634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:00.357290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:00.811040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:01.600460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:00.509364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:15:00.960187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:15:12.769413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명시군구명행정동명업종대분류코드업종중분류코드업종대분류명업종중분류명가공기업구분코드가공기업구분업력구간코드업력구간항목명총기업수평균액중위액
시도명1.0001.0001.0000.6300.0000.6300.0000.0000.0000.5820.6760.0000.0000.0000.000
시군구명1.0001.0001.0000.8730.9270.8730.9270.6130.6130.4880.1200.4490.0000.7900.790
행정동명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
업종대분류코드0.6300.8731.0001.0001.0001.0001.0000.0000.0000.7340.6390.0000.0000.0000.000
업종중분류코드0.0000.9271.0001.0001.0001.0001.0000.7500.7500.7610.0000.0000.0000.8290.829
업종대분류명0.6300.8731.0001.0001.0001.0001.0000.0000.0000.7340.6390.0000.0000.0000.000
업종중분류명0.0000.9271.0001.0001.0001.0001.0000.7500.7500.7610.0000.0000.0000.8290.829
가공기업구분코드0.0000.6131.0000.0000.7500.0000.7501.0001.0000.3580.1680.4030.0000.0640.064
가공기업구분0.0000.6131.0000.0000.7500.0000.7501.0001.0000.3580.1680.4030.0000.0640.064
업력구간코드0.5820.4881.0000.7340.7610.7340.7610.3580.3581.0001.0000.1920.0000.4270.427
업력구간0.6760.1201.0000.6390.0000.6390.0000.1680.1681.0001.0000.5180.0000.0000.000
항목명0.0000.4491.0000.0000.0000.0000.0000.4030.4030.1920.5181.0000.3930.3000.300
총기업수0.0000.0001.0000.0000.0000.0000.0000.0000.0000.0000.0000.3931.0000.0000.000
평균액0.0000.7901.0000.0000.8290.0000.8290.0640.0640.4270.0000.3000.0001.0001.000
중위액0.0000.7901.0000.0000.8290.0000.8290.0640.0640.4270.0000.3000.0001.0001.000
2023-12-10T23:15:13.271377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가공기업구분코드총기업수업종대분류코드시도명업력구간가공기업구분항목명업종대분류명
가공기업구분코드1.0000.0000.0000.0000.0661.0000.2580.000
총기업수0.0001.0000.0000.0000.0000.0000.2530.000
업종대분류코드0.0000.0001.0000.3210.3760.0000.0001.000
시도명0.0000.0000.3211.0000.3840.0000.0000.321
업력구간0.0660.0000.3760.3841.0000.0660.1800.376
가공기업구분1.0000.0000.0000.0000.0661.0000.2580.000
항목명0.2580.2530.0000.0000.1800.2581.0000.000
업종대분류명0.0000.0001.0000.3210.3760.0000.0001.000
2023-12-10T23:15:13.520753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업력구간코드평균액중위액시도명업종대분류코드업종대분류명가공기업구분코드가공기업구분업력구간항목명총기업수
업력구간코드1.0000.5180.5210.2270.4870.4870.2850.2850.9590.0780.000
평균액0.5181.0001.0000.0000.0000.0000.0000.0000.3390.1910.000
중위액0.5211.0001.0000.0000.0000.0000.0000.0000.3390.1910.000
시도명0.2270.0000.0001.0000.3210.3210.0000.0000.3840.0000.000
업종대분류코드0.4870.0000.0000.3211.0001.0000.0000.0000.3760.0000.000
업종대분류명0.4870.0000.0000.3211.0001.0000.0000.0000.3760.0000.000
가공기업구분코드0.2850.0000.0000.0000.0000.0001.0001.0000.0660.2580.000
가공기업구분0.2850.0000.0000.0000.0000.0001.0001.0000.0660.2580.000
업력구간0.9590.3390.3390.3840.3760.3760.0660.0661.0000.1800.000
항목명0.0780.1910.1910.0000.0000.0000.2580.2580.1801.0000.253
총기업수0.0000.0000.0000.0000.0000.0000.0000.0000.0000.2531.000

Missing values

2023-12-10T23:15:01.936100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:15:02.533651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년도시도명시군구명행정동명업종대분류코드업종중분류코드업종대분류명업종중분류명가공기업구분코드가공기업구분업력구간코드업력구간항목명총기업수평균액중위액등록일자작업자명
02016충북음성군감곡면CC30제조업자동차 및 트레일러 제조업3중기업55년 이상 10년 미만부채총계122861778228617782021-10-19KEDSYS
12016인천남동구논현2동GG46도매 및 소매업도매 및 상품 중개업3중기업1010년 이상 20년 미만부채총계130768704307687042021-10-19KEDSYS
22016경기하남시초이동CC23제조업비금속 광물제품 제조업3중기업1010년 이상 20년 미만자본총계110028348100283482021-10-19KEDSYS
32016서울강서구가양1동JJ58정보통신업출판업4소기업1010년 이상 20년 미만자산총계112736007127360072021-10-19KEDSYS
42016대구수성구고산2동JJ58정보통신업출판업4소기업22년 이상 5년 미만자본총계14580094580092021-10-19KEDSYS
52016충남천안시 서북구성거읍HH52운수 및 창고업창고 및 운송관련 서비스업3중기업2020년 이상 30년 미만자본금1135000013500002021-10-19KEDSYS
62016경기화성시봉담읍LL68부동산업부동산업4소기업55년 이상 10년 미만자산총계117382707173827072021-10-19KEDSYS
72016부산사하구신평1동CC30제조업자동차 및 트레일러 제조업3중기업1010년 이상 20년 미만영업이익111959832119598322021-10-19KEDSYS
82016경기수원시 장안구파장동CC29제조업기타 기계 및 장비 제조업3중기업1010년 이상 20년 미만매출액132456184324561842021-10-19KEDSYS
92016부산동래구수민동NN74사업시설 관리; 사업 지원 및 임대 서비스업사업시설 관리 및 조경 서비스업3중기업1010년 이상 20년 미만매출액126317084263170842021-10-19KEDSYS
기준년도시도명시군구명행정동명업종대분류코드업종중분류코드업종대분류명업종중분류명가공기업구분코드가공기업구분업력구간코드업력구간항목명총기업수평균액중위액등록일자작업자명
202016서울강남구삼성2동LL68부동산업부동산업4소기업55년 이상 10년 미만당기순이익3-177525916832021-10-19KEDSYS
212016서울강남구논현1동GG46도매 및 소매업도매 및 상품 중개업1대기업3030년 이상 40년 미만자본금1155082415508242021-10-19KEDSYS
222016세종<NA>조치원읍CC23제조업비금속 광물제품 제조업3중기업3030년 이상 40년 미만자산총계116655435166554352021-10-19KEDSYS
232016서울중구명동LL68부동산업부동산업1대기업1010년 이상 20년 미만영업이익1760388876038882021-10-19KEDSYS
242016전남진도군군내면AA01농업; 임업 및 어업농업4소기업11년 이상 2년 미만부채총계1146314632021-10-19KEDSYS
252016경기화성시동탄1동CC26제조업전자부품; 컴퓨터; 영상; 음향 및 통신장비 제조업3중기업3030년 이상 40년 미만부채총계143123739431237392021-10-19KEDSYS
262016부산해운대구재송1동GG45도매 및 소매업자동차 및 부품 판매업3중기업3030년 이상 40년 미만자산총계151356752513567522021-10-19KEDSYS
272016충북청주시 흥덕구오송읍JJ62정보통신업컴퓨터 프로그래밍; 시스템 통합 및 관리업3중기업1010년 이상 20년 미만부채총계1264676526467652021-10-19KEDSYS
282016경기포천시창수면EE38수도; 하수 및 폐기물 처리; 원료 재생업폐기물 수집; 운반; 처리 및 원료 재생업3중기업55년 이상 10년 미만자산총계117589788175897882021-10-19KEDSYS
292016경기남양주시진접읍CC22제조업고무 및 플라스틱제품 제조업3중기업1010년 이상 20년 미만자본총계1842335784233572021-10-19KEDSYS