Overview

Dataset statistics

Number of variables6
Number of observations1199
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory57.5 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description본 데이터는 충남 건설기계 사업자 현황에 대한 데이터로 영업상태 상호 사업유형 등록종별 주소(영업상태 상호 사업유형 등록종별 주소) 등의 항목을 제공합니다
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=43&beforeMenuCd=DOM_000000201001001000&publicdatapk=15114206

Alerts

상태 has constant value ""Constant
등록종별 is highly overall correlated with 사업유형High correlation
사업유형 is highly overall correlated with 등록종별High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:28:40.491480
Analysis finished2024-01-09 20:28:41.142344
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1199
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean600
Minimum1
Maximum1199
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.7 KiB
2024-01-10T05:28:41.218716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile60.9
Q1300.5
median600
Q3899.5
95-th percentile1139.1
Maximum1199
Range1198
Interquartile range (IQR)599

Descriptive statistics

Standard deviation346.26579
Coefficient of variation (CV)0.57710966
Kurtosis-1.2
Mean600
Median Absolute Deviation (MAD)300
Skewness0
Sum719400
Variance119900
MonotonicityStrictly increasing
2024-01-10T05:28:41.344633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
807 1
 
0.1%
805 1
 
0.1%
804 1
 
0.1%
803 1
 
0.1%
802 1
 
0.1%
801 1
 
0.1%
800 1
 
0.1%
799 1
 
0.1%
798 1
 
0.1%
Other values (1189) 1189
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1199 1
0.1%
1198 1
0.1%
1197 1
0.1%
1196 1
0.1%
1195 1
0.1%
1194 1
0.1%
1193 1
0.1%
1192 1
0.1%
1191 1
0.1%
1190 1
0.1%

상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
영업
1199 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 1199
100.0%

Length

2024-01-10T05:28:41.461152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:28:41.544056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 1199
100.0%
Distinct1067
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2024-01-10T05:28:41.712501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length6.9933278
Min length3

Characters and Unicode

Total characters8385
Distinct characters350
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique982 ?
Unique (%)81.9%

Sample

1st row(주)혜인
2nd row대양건기(주)
3rd row신창현대서비스
4th row애향자동차정비공업(주)
5th row놀뫼자동차공업사
ValueCountFrequency (%)
주식회사 53
 
4.1%
일광건설중기 15
 
1.2%
쌍용중기 7
 
0.5%
태양중기 4
 
0.3%
우림중기 4
 
0.3%
개미중기 4
 
0.3%
우리중기 4
 
0.3%
주)혜인 4
 
0.3%
논산지점 3
 
0.2%
대성중기 3
 
0.2%
Other values (1075) 1184
92.1%
2024-01-10T05:28:42.026404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
615
 
7.3%
468
 
5.6%
440
 
5.2%
( 387
 
4.6%
) 387
 
4.6%
287
 
3.4%
233
 
2.8%
222
 
2.6%
194
 
2.3%
178
 
2.1%
Other values (340) 4974
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7467
89.1%
Open Punctuation 387
 
4.6%
Close Punctuation 387
 
4.6%
Space Separator 86
 
1.0%
Uppercase Letter 23
 
0.3%
Decimal Number 22
 
0.3%
Other Punctuation 6
 
0.1%
Other Symbol 5
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
615
 
8.2%
468
 
6.3%
440
 
5.9%
287
 
3.8%
233
 
3.1%
222
 
3.0%
194
 
2.6%
178
 
2.4%
175
 
2.3%
139
 
1.9%
Other values (311) 4516
60.5%
Uppercase Letter
ValueCountFrequency (%)
S 4
17.4%
C 4
17.4%
E 3
13.0%
J 2
8.7%
D 2
8.7%
P 1
 
4.3%
H 1
 
4.3%
M 1
 
4.3%
I 1
 
4.3%
G 1
 
4.3%
Other values (3) 3
13.0%
Decimal Number
ValueCountFrequency (%)
1 12
54.5%
5 4
 
18.2%
0 2
 
9.1%
3 1
 
4.5%
4 1
 
4.5%
2 1
 
4.5%
8 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 2
33.3%
· 2
33.3%
& 1
16.7%
, 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 387
100.0%
Close Punctuation
ValueCountFrequency (%)
) 387
100.0%
Space Separator
ValueCountFrequency (%)
86
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7472
89.1%
Common 890
 
10.6%
Latin 23
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
615
 
8.2%
468
 
6.3%
440
 
5.9%
287
 
3.8%
233
 
3.1%
222
 
3.0%
194
 
2.6%
178
 
2.4%
175
 
2.3%
139
 
1.9%
Other values (312) 4521
60.5%
Common
ValueCountFrequency (%)
( 387
43.5%
) 387
43.5%
86
 
9.7%
1 12
 
1.3%
5 4
 
0.4%
. 2
 
0.2%
· 2
 
0.2%
0 2
 
0.2%
- 2
 
0.2%
3 1
 
0.1%
Other values (5) 5
 
0.6%
Latin
ValueCountFrequency (%)
S 4
17.4%
C 4
17.4%
E 3
13.0%
J 2
8.7%
D 2
8.7%
P 1
 
4.3%
H 1
 
4.3%
M 1
 
4.3%
I 1
 
4.3%
G 1
 
4.3%
Other values (3) 3
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7467
89.1%
ASCII 911
 
10.9%
None 7
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
615
 
8.2%
468
 
6.3%
440
 
5.9%
287
 
3.8%
233
 
3.1%
222
 
3.0%
194
 
2.6%
178
 
2.4%
175
 
2.3%
139
 
1.9%
Other values (311) 4516
60.5%
ASCII
ValueCountFrequency (%)
( 387
42.5%
) 387
42.5%
86
 
9.4%
1 12
 
1.3%
5 4
 
0.4%
S 4
 
0.4%
C 4
 
0.4%
E 3
 
0.3%
. 2
 
0.2%
0 2
 
0.2%
Other values (17) 20
 
2.2%
None
ValueCountFrequency (%)
5
71.4%
· 2
 
28.6%

사업유형
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
대여업
771 
정비업
247 
매매업
133 
해체재활용업
 
36
등록번호제작자
 
12

Length

Max length7
Median length3
Mean length3.1301084
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정비업
2nd row정비업
3rd row정비업
4th row정비업
5th row정비업

Common Values

ValueCountFrequency (%)
대여업 771
64.3%
정비업 247
 
20.6%
매매업 133
 
11.1%
해체재활용업 36
 
3.0%
등록번호제작자 12
 
1.0%

Length

2024-01-10T05:28:42.170961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:28:42.281413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대여업 771
64.3%
정비업 247
 
20.6%
매매업 133
 
11.1%
해체재활용업 36
 
3.0%
등록번호제작자 12
 
1.0%

등록종별
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
개별
499 
일반
272 
<NA>
181 
종합(덤프 및 믹서트럭)
96 
부분(일반)
81 
Other values (9)
70 

Length

Max length19
Median length2
Mean length3.7898249
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row종합(전기종)
2nd row종합(전기종)
3rd row종합(덤프 및 믹서트럭)
4th row종합(덤프 및 믹서트럭)
5th row종합(덤프 및 믹서트럭)

Common Values

ValueCountFrequency (%)
개별 499
41.6%
일반 272
22.7%
<NA> 181
 
15.1%
종합(덤프 및 믹서트럭) 96
 
8.0%
부분(일반) 81
 
6.8%
전문(유압) 24
 
2.0%
종합(지게차) 19
 
1.6%
부분(타이어식은 종합) 8
 
0.7%
종합(전기종) 7
 
0.6%
종합(굴착기) 4
 
0.3%
Other values (4) 8
 
0.7%

Length

2024-01-10T05:28:42.392791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
개별 499
35.5%
일반 272
19.4%
na 181
 
12.9%
종합(덤프 96
 
6.8%
96
 
6.8%
믹서트럭 96
 
6.8%
부분(일반 81
 
5.8%
전문(유압 24
 
1.7%
종합(지게차 19
 
1.4%
종합 11
 
0.8%
Other values (8) 30
 
2.1%

주소
Text

Distinct886
Distinct (%)73.9%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2024-01-10T05:28:42.638328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length46
Mean length23.386989
Min length16

Characters and Unicode

Total characters28041
Distinct characters331
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique790 ?
Unique (%)65.9%

Sample

1st row충청남도 천안시 서북구 2공단5로 23(차암동)
2nd row충청남도 공주시 월송동현로 254-10(동현동)
3rd row충청남도 공주시 우금티로 616(금학동)
4th row충청남도 서산시 음암면 중앙로 548
5th row충청남도 논산시 해월로 258(반월동)
ValueCountFrequency (%)
충청남도 1199
 
19.3%
예산군 222
 
3.6%
삽교읍 183
 
3.0%
도청대로 182
 
2.9%
1182 182
 
2.9%
서산시 166
 
2.7%
당진시 139
 
2.2%
천안시 132
 
2.1%
아산시 105
 
1.7%
보령시 69
 
1.1%
Other values (1632) 3624
58.4%
2024-01-10T05:28:43.053982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5013
 
17.9%
1410
 
5.0%
1390
 
5.0%
1348
 
4.8%
1 1245
 
4.4%
1241
 
4.4%
795
 
2.8%
759
 
2.7%
758
 
2.7%
2 752
 
2.7%
Other values (321) 13330
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17441
62.2%
Space Separator 5013
 
17.9%
Decimal Number 4591
 
16.4%
Close Punctuation 279
 
1.0%
Open Punctuation 279
 
1.0%
Dash Punctuation 255
 
0.9%
Other Punctuation 171
 
0.6%
Uppercase Letter 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1410
 
8.1%
1390
 
8.0%
1348
 
7.7%
1241
 
7.1%
795
 
4.6%
759
 
4.4%
758
 
4.3%
493
 
2.8%
472
 
2.7%
470
 
2.7%
Other values (298) 8305
47.6%
Decimal Number
ValueCountFrequency (%)
1 1245
27.1%
2 752
16.4%
3 450
 
9.8%
8 425
 
9.3%
4 332
 
7.2%
5 331
 
7.2%
0 320
 
7.0%
6 269
 
5.9%
7 238
 
5.2%
9 229
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 6
50.0%
B 2
 
16.7%
V 1
 
8.3%
P 1
 
8.3%
D 1
 
8.3%
S 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 166
97.1%
/ 3
 
1.8%
@ 2
 
1.2%
Space Separator
ValueCountFrequency (%)
5013
100.0%
Close Punctuation
ValueCountFrequency (%)
) 279
100.0%
Open Punctuation
ValueCountFrequency (%)
( 279
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 255
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17441
62.2%
Common 10588
37.8%
Latin 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1410
 
8.1%
1390
 
8.0%
1348
 
7.7%
1241
 
7.1%
795
 
4.6%
759
 
4.4%
758
 
4.3%
493
 
2.8%
472
 
2.7%
470
 
2.7%
Other values (298) 8305
47.6%
Common
ValueCountFrequency (%)
5013
47.3%
1 1245
 
11.8%
2 752
 
7.1%
3 450
 
4.3%
8 425
 
4.0%
4 332
 
3.1%
5 331
 
3.1%
0 320
 
3.0%
) 279
 
2.6%
( 279
 
2.6%
Other values (7) 1162
 
11.0%
Latin
ValueCountFrequency (%)
A 6
50.0%
B 2
 
16.7%
V 1
 
8.3%
P 1
 
8.3%
D 1
 
8.3%
S 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17441
62.2%
ASCII 10600
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5013
47.3%
1 1245
 
11.7%
2 752
 
7.1%
3 450
 
4.2%
8 425
 
4.0%
4 332
 
3.1%
5 331
 
3.1%
0 320
 
3.0%
) 279
 
2.6%
( 279
 
2.6%
Other values (13) 1174
 
11.1%
Hangul
ValueCountFrequency (%)
1410
 
8.1%
1390
 
8.0%
1348
 
7.7%
1241
 
7.1%
795
 
4.6%
759
 
4.4%
758
 
4.3%
493
 
2.8%
472
 
2.7%
470
 
2.7%
Other values (298) 8305
47.6%

Interactions

2024-01-10T05:28:40.899183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:28:43.145427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업유형등록종별
순번1.0000.5560.464
사업유형0.5561.0001.000
등록종별0.4641.0001.000
2024-01-10T05:28:43.224832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록종별사업유형
등록종별1.0000.995
사업유형0.9951.000
2024-01-10T05:28:43.308577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업유형등록종별
순번1.0000.2620.212
사업유형0.2621.0000.995
등록종별0.2120.9951.000

Missing values

2024-01-10T05:28:40.995125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:28:41.097013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상태상호(명칭)사업유형등록종별주소
01영업(주)혜인정비업종합(전기종)충청남도 천안시 서북구 2공단5로 23(차암동)
12영업대양건기(주)정비업종합(전기종)충청남도 공주시 월송동현로 254-10(동현동)
23영업신창현대서비스정비업종합(덤프 및 믹서트럭)충청남도 공주시 우금티로 616(금학동)
34영업애향자동차정비공업(주)정비업종합(덤프 및 믹서트럭)충청남도 서산시 음암면 중앙로 548
45영업놀뫼자동차공업사정비업종합(덤프 및 믹서트럭)충청남도 논산시 해월로 258(반월동)
56영업(주)칠갑정비정비업종합(덤프 및 믹서트럭)충청남도 청양군 청양읍 칠갑산로 343
67영업합덕자동차공업사정비업종합(덤프 및 믹서트럭)충청남도 당진시 합덕읍 덕평로 462-9
78영업동신자동차공업(주)정비업종합(덤프 및 믹서트럭)충청남도 보령시 보령남로 144(명천동)
89영업신례원자동차정비공장정비업종합(덤프 및 믹서트럭)충청남도 예산군 예산읍 충서로 1298
910영업(주)홍성자동차정비업종합(덤프 및 믹서트럭)충청남도 홍성군 홍성읍 대교리 46외 2필지
순번상태상호(명칭)사업유형등록종별주소
11891190영업(주)충남중기산업매매업<NA>충청남도 당진시 합덕읍 예덕로 113
11901191영업(주)정도기업매매업<NA>충청남도 당진시 서해로 6163-18(시곡동)
11911192영업대지중기(주)대여업일반충청남도 계룡시 엄사면 유동리 238번지
11921193영업논산건설기계대여업개별충청남도 논산시 연산면 선비로434번길 4
11931194영업폐지된사업자(계룡시)대여업일반충청남도 계룡시 엄사면 유동리 238번지
11941195영업대조종합건기(주) - 계룡지점대여업일반충청남도 계룡시 서금암5길 9, 101호(금암동, 신성2차미소지움 상가)
11951196영업계룡기업사등록번호제작자<NA>충청남도 계룡시 금암동 277번지 10호
11961197영업오케이퓨처(주)정비업전문(유압)충청남도 계룡시 두마면 왕대리 154번지
11971198영업신화중기정비공업사정비업부분(일반)충청남도 계룡시 두마면 왕대리 243번지
11981199영업오케이퓨쳐 주식회사정비업부분(일반)충청남도 계룡시 두마면 제1산단로 25-57