Overview

Dataset statistics

Number of variables13
Number of observations112
Missing cells78
Missing cells (%)5.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.7 KiB
Average record size in memory107.1 B

Variable types

Numeric1
Text5
Categorical6
DateTime1

Dataset

Description전북특별자치도 행정소송.처분 현황(업체명, 대표자, 지역, 주소, 처리일자, 업종면허, 등록번호, 위반내용, 처분근거, 처분내용 등)
Author전북특별자치도
URLhttps://www.data.go.kr/data/3081476/fileData.do

Alerts

지역 has constant value ""Constant
처분(원) is highly overall correlated with 번호 and 2 other fieldsHigh correlation
처분근거 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
처분내용 is highly overall correlated with 위반내용 and 2 other fieldsHigh correlation
위반내용 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
번호 is highly overall correlated with 위반내용 and 2 other fieldsHigh correlation
처분(원) is highly imbalanced (67.0%)Imbalance
처분_말소기간 has 78 (69.6%) missing valuesMissing

Reproduction

Analysis started2024-03-14 15:19:30.675413
Analysis finished2024-03-14 15:19:33.237375
Duration2.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION 

Distinct108
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.517857
Minimum1
Maximum108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-15T00:19:33.377084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.55
Q126.75
median54.5
Q382.25
95-th percentile102.45
Maximum108
Range107
Interquartile range (IQR)55.5

Descriptive statistics

Standard deviation31.660355
Coefficient of variation (CV)0.58073366
Kurtosis-1.2391611
Mean54.517857
Median Absolute Deviation (MAD)28
Skewness-0.0027805664
Sum6106
Variance1002.3781
MonotonicityIncreasing
2024-03-15T00:19:33.637961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25 2
 
1.8%
7 2
 
1.8%
89 2
 
1.8%
99 2
 
1.8%
1 1
 
0.9%
71 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
Other values (98) 98
87.5%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 2
1.8%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 2
1.8%
Distinct84
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-03-15T00:19:34.504253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.4285714
Min length5

Characters and Unicode

Total characters944
Distinct characters95
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)61.6%

Sample

1st row(유)신세계건설
2nd row(주)대흥종합건설
3rd row수봉종합건설(주)
4th row(유)세종종합건설
5th row(유)강산토건
ValueCountFrequency (%)
미가에이엔씨플레닝(주 6
 
5.4%
주)삼목토건 5
 
4.5%
한림종합건설(주 5
 
4.5%
유)에스에스지건설 4
 
3.6%
주)제이비종합건설 3
 
2.7%
주)리뉴종합건설 2
 
1.8%
유)다일건설 2
 
1.8%
정재건설(주 2
 
1.8%
주)에스지종합건설 2
 
1.8%
주)삼부종합건설 2
 
1.8%
Other values (74) 79
70.5%
2024-03-15T00:19:35.599535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 112
 
11.9%
( 112
 
11.9%
95
 
10.1%
85
 
9.0%
72
 
7.6%
56
 
5.9%
55
 
5.8%
39
 
4.1%
17
 
1.8%
13
 
1.4%
Other values (85) 288
30.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 720
76.3%
Close Punctuation 112
 
11.9%
Open Punctuation 112
 
11.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
95
 
13.2%
85
 
11.8%
72
 
10.0%
56
 
7.8%
55
 
7.6%
39
 
5.4%
17
 
2.4%
13
 
1.8%
13
 
1.8%
11
 
1.5%
Other values (83) 264
36.7%
Close Punctuation
ValueCountFrequency (%)
) 112
100.0%
Open Punctuation
ValueCountFrequency (%)
( 112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 720
76.3%
Common 224
 
23.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
95
 
13.2%
85
 
11.8%
72
 
10.0%
56
 
7.8%
55
 
7.6%
39
 
5.4%
17
 
2.4%
13
 
1.8%
13
 
1.8%
11
 
1.5%
Other values (83) 264
36.7%
Common
ValueCountFrequency (%)
) 112
50.0%
( 112
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 720
76.3%
ASCII 224
 
23.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 112
50.0%
( 112
50.0%
Hangul
ValueCountFrequency (%)
95
 
13.2%
85
 
11.8%
72
 
10.0%
56
 
7.8%
55
 
7.6%
39
 
5.4%
17
 
2.4%
13
 
1.8%
13
 
1.8%
11
 
1.5%
Other values (83) 264
36.7%
Distinct75
Distinct (%)67.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-03-15T00:19:36.501493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.0357143
Min length3

Characters and Unicode

Total characters340
Distinct characters76
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)50.0%

Sample

1st row김*용
2nd row김*봉
3rd row황*주
4th row육*식
5th row이*순
ValueCountFrequency (%)
이*호 7
 
6.2%
이*열 6
 
5.4%
백*민 6
 
5.4%
윤*민 4
 
3.6%
윤*호 3
 
2.7%
한*수 3
 
2.7%
이*진 3
 
2.7%
소*수 2
 
1.8%
김*옥 2
 
1.8%
신*곤 2
 
1.8%
Other values (65) 74
66.1%
2024-03-15T00:19:37.745355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 113
33.2%
23
 
6.8%
23
 
6.8%
12
 
3.5%
11
 
3.2%
9
 
2.6%
7
 
2.1%
7
 
2.1%
6
 
1.8%
6
 
1.8%
Other values (66) 123
36.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 226
66.5%
Other Punctuation 114
33.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
10.2%
23
 
10.2%
12
 
5.3%
11
 
4.9%
9
 
4.0%
7
 
3.1%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (64) 116
51.3%
Other Punctuation
ValueCountFrequency (%)
* 113
99.1%
, 1
 
0.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 226
66.5%
Common 114
33.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
10.2%
23
 
10.2%
12
 
5.3%
11
 
4.9%
9
 
4.0%
7
 
3.1%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (64) 116
51.3%
Common
ValueCountFrequency (%)
* 113
99.1%
, 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 226
66.5%
ASCII 114
33.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 113
99.1%
, 1
 
0.9%
Hangul
ValueCountFrequency (%)
23
 
10.2%
23
 
10.2%
12
 
5.3%
11
 
4.9%
9
 
4.0%
7
 
3.1%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (64) 116
51.3%

지역
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
전북
112 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전북
2nd row전북
3rd row전북
4th row전북
5th row전북

Common Values

ValueCountFrequency (%)
전북 112
100.0%

Length

2024-03-15T00:19:37.966988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:19:38.124237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북 112
100.0%

주소
Text

Distinct84
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-03-15T00:19:39.007532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length37.482143
Min length25

Characters and Unicode

Total characters4198
Distinct characters179
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)61.6%

Sample

1st row(560-810) 전라북도 전주시 완산구 안행8길 52-10 (삼천동1가)
2nd row(560-816) 전라북도 전주시 완산구 장승배기로 341 (서서학동)
3rd row(561-211) 전라북도 전주시 덕진구 동부대로 879, 남촌빌딩 (호성동1가)
4th row(595-805) 전북 순창군 순창읍 순화리 483-1 제2호
5th row(580-060) 전라북도 정읍시 정읍사로 527 (시기동)
ValueCountFrequency (%)
전라북도 108
 
13.8%
전주시 50
 
6.4%
완산구 37
 
4.7%
2층 15
 
1.9%
군산시 14
 
1.8%
덕진구 13
 
1.7%
12
 
1.5%
익산시 10
 
1.3%
정읍시 9
 
1.2%
완주군 7
 
0.9%
Other values (321) 505
64.7%
2024-03-15T00:19:40.254299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
668
 
15.9%
( 190
 
4.5%
) 190
 
4.5%
5 186
 
4.4%
173
 
4.1%
1 157
 
3.7%
0 146
 
3.5%
- 140
 
3.3%
6 131
 
3.1%
2 127
 
3.0%
Other values (169) 2090
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1854
44.2%
Decimal Number 1114
26.5%
Space Separator 668
 
15.9%
Open Punctuation 190
 
4.5%
Close Punctuation 190
 
4.5%
Dash Punctuation 140
 
3.3%
Other Punctuation 42
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
 
9.3%
118
 
6.4%
112
 
6.0%
108
 
5.8%
102
 
5.5%
98
 
5.3%
96
 
5.2%
67
 
3.6%
61
 
3.3%
53
 
2.9%
Other values (154) 866
46.7%
Decimal Number
ValueCountFrequency (%)
5 186
16.7%
1 157
14.1%
0 146
13.1%
6 131
11.8%
2 127
11.4%
8 108
9.7%
3 80
7.2%
7 65
 
5.8%
9 59
 
5.3%
4 55
 
4.9%
Space Separator
ValueCountFrequency (%)
668
100.0%
Open Punctuation
ValueCountFrequency (%)
( 190
100.0%
Close Punctuation
ValueCountFrequency (%)
) 190
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 140
100.0%
Other Punctuation
ValueCountFrequency (%)
, 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2344
55.8%
Hangul 1854
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
 
9.3%
118
 
6.4%
112
 
6.0%
108
 
5.8%
102
 
5.5%
98
 
5.3%
96
 
5.2%
67
 
3.6%
61
 
3.3%
53
 
2.9%
Other values (154) 866
46.7%
Common
ValueCountFrequency (%)
668
28.5%
( 190
 
8.1%
) 190
 
8.1%
5 186
 
7.9%
1 157
 
6.7%
0 146
 
6.2%
- 140
 
6.0%
6 131
 
5.6%
2 127
 
5.4%
8 108
 
4.6%
Other values (5) 301
12.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2344
55.8%
Hangul 1854
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
668
28.5%
( 190
 
8.1%
) 190
 
8.1%
5 186
 
7.9%
1 157
 
6.7%
0 146
 
6.2%
- 140
 
6.0%
6 131
 
5.6%
2 127
 
5.4%
8 108
 
4.6%
Other values (5) 301
12.8%
Hangul
ValueCountFrequency (%)
173
 
9.3%
118
 
6.4%
112
 
6.0%
108
 
5.8%
102
 
5.5%
98
 
5.3%
96
 
5.2%
67
 
3.6%
61
 
3.3%
53
 
2.9%
Other values (154) 866
46.7%
Distinct22
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
Minimum2014-01-08 00:00:00
Maximum2014-08-07 00:00:00
2024-03-15T00:19:40.816690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:19:41.195425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)

업종면허
Categorical

Distinct4
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
건축공사업
55 
토목건축공사업
39 
토목공사업
17 
조경공사업
 
1

Length

Max length7
Median length5
Mean length5.6964286
Min length5

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row건축공사업
2nd row건축공사업
3rd row토목건축공사업
4th row토목건축공사업
5th row토목공사업

Common Values

ValueCountFrequency (%)
건축공사업 55
49.1%
토목건축공사업 39
34.8%
토목공사업 17
 
15.2%
조경공사업 1
 
0.9%

Length

2024-03-15T00:19:41.640651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:19:41.934520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건축공사업 55
49.1%
토목건축공사업 39
34.8%
토목공사업 17
 
15.2%
조경공사업 1
 
0.9%
Distinct84
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-03-15T00:19:42.828715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length6.6428571
Min length3

Characters and Unicode

Total characters744
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)61.6%

Sample

1st row14­0328
2nd row14­0193
3rd row전북제14­0150
4th row14­0175
5th row14­0350
ValueCountFrequency (%)
01­1387 6
 
5.4%
01­0071 5
 
4.5%
14­0200 5
 
4.5%
14­0165 4
 
3.6%
14­0457 3
 
2.7%
01­2984 2
 
1.8%
2783 2
 
1.8%
855 2
 
1.8%
제150558호 2
 
1.8%
1534 2
 
1.8%
Other values (74) 79
70.5%
2024-03-15T00:19:44.207114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 160
21.5%
1 153
20.6%
­ 95
12.8%
4 88
11.8%
5 53
 
7.1%
2 44
 
5.9%
3 34
 
4.6%
7 32
 
4.3%
8 29
 
3.9%
9 25
 
3.4%
Other values (8) 31
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 638
85.8%
Format 95
 
12.8%
Other Letter 11
 
1.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 160
25.1%
1 153
24.0%
4 88
13.8%
5 53
 
8.3%
2 44
 
6.9%
3 34
 
5.3%
7 32
 
5.0%
8 29
 
4.5%
9 25
 
3.9%
6 20
 
3.1%
Other Letter
ValueCountFrequency (%)
3
27.3%
2
18.2%
2
18.2%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
Format
ValueCountFrequency (%)
­ 95
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 733
98.5%
Hangul 11
 
1.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 160
21.8%
1 153
20.9%
­ 95
13.0%
4 88
12.0%
5 53
 
7.2%
2 44
 
6.0%
3 34
 
4.6%
7 32
 
4.4%
8 29
 
4.0%
9 25
 
3.4%
Hangul
ValueCountFrequency (%)
3
27.3%
2
18.2%
2
18.2%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 638
85.8%
None 95
 
12.8%
Hangul 11
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 160
25.1%
1 153
24.0%
4 88
13.8%
5 53
 
8.3%
2 44
 
6.9%
3 34
 
5.3%
7 32
 
5.0%
8 29
 
4.5%
9 25
 
3.9%
6 20
 
3.1%
None
ValueCountFrequency (%)
­ 95
100.0%
Hangul
ValueCountFrequency (%)
3
27.3%
2
18.2%
2
18.2%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%

위반내용
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
건설공사대장 미통보
52 
건설업 등록기준 미달(자본금)
10 
건설업 등록기준 미달(보증가능금액)
10 
하도급(재하도급 포함) 통지의무 불이행
영업정지후 처분종료일까지 등록기준 미달사항 미 보완
Other values (9)
24 

Length

Max length37
Median length28
Mean length15.151786
Min length10

Unique

Unique4 ?
Unique (%)3.6%

Sample

1st row건설업 등록기준 미달(자본금)
2nd row주기적 등록사항신고 불이행
3rd row건설업 등록기준 미달(보증가능금액)
4th row주기적 등록사항신고 불이행
5th row건설업 등록기준 미달(보증가능금액)

Common Values

ValueCountFrequency (%)
건설공사대장 미통보 52
46.4%
건설업 등록기준 미달(자본금) 10
 
8.9%
건설업 등록기준 미달(보증가능금액) 10
 
8.9%
하도급(재하도급 포함) 통지의무 불이행 9
 
8.0%
영업정지후 처분종료일까지 등록기준 미달사항 미 보완 7
 
6.2%
주기적 등록사항신고 불이행 6
 
5.4%
건설공사대장 미통보 시정명령 불이행 6
 
5.4%
건설업체의 폐업사실확인 3
 
2.7%
등록기준 미달로 영업정지처분 후 3년 이내에 동일한 등록기준에 미달 3
 
2.7%
직접시공계획서 미통보 2
 
1.8%
Other values (4) 4
 
3.6%

Length

2024-03-15T00:19:44.442501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미통보 60
17.9%
건설공사대장 58
17.3%
등록기준 31
 
9.2%
불이행 22
 
6.5%
건설업 21
 
6.2%
미달(자본금 10
 
3.0%
미달(보증가능금액 10
 
3.0%
하도급(재하도급 9
 
2.7%
포함 9
 
2.7%
통지의무 9
 
2.7%
Other values (28) 97
28.9%

처분근거
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
건설산업기본법 제81조제3호
52 
건설산업기본법 제83조제3호
21 
건설산업기본법 제99조제5호
건설산업기본법 제83조제3호의2
건설산업기본법 제81조제2호
Other values (6)
17 

Length

Max length18
Median length15
Mean length15.3125
Min length15

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row건설산업기본법 제83조제3호
2nd row건설산업기본법 제81조제2호
3rd row건설산업기본법 제83조제3호
4th row건설산업기본법 제81조제2호
5th row건설산업기본법 제83조제3호

Common Values

ValueCountFrequency (%)
건설산업기본법 제81조제3호 52
46.4%
건설산업기본법 제83조제3호 21
18.8%
건설산업기본법 제99조제5호 9
 
8.0%
건설산업기본법 제83조제3호의2 7
 
6.2%
건설산업기본법 제81조제2호 6
 
5.4%
건설산업기본법 제99조제11호 6
 
5.4%
건설산업기본법 제83조제12호 3
 
2.7%
건설산업기본법 제83조제3호의3 3
 
2.7%
건설산업기본법 제82조제2항제3호 2
 
1.8%
건설산업기본법 제99조제4호 2
 
1.8%

Length

2024-03-15T00:19:44.656937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건설산업기본법 112
50.0%
제81조제3호 52
23.2%
제83조제3호 21
 
9.4%
제99조제5호 9
 
4.0%
제83조제3호의2 7
 
3.1%
제81조제2호 6
 
2.7%
제99조제11호 6
 
2.7%
제83조제12호 3
 
1.3%
제83조제3호의3 3
 
1.3%
제82조제2항제3호 2
 
0.9%
Other values (2) 3
 
1.3%

처분내용
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
시정명령
59 
등록말소
21 
과태료
17 
영업정지
13 
과징금
 
2

Length

Max length4
Median length4
Mean length3.8303571
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업정지
2nd row시정명령
3rd row영업정지
4th row시정명령
5th row영업정지

Common Values

ValueCountFrequency (%)
시정명령 59
52.7%
등록말소 21
 
18.8%
과태료 17
 
15.2%
영업정지 13
 
11.6%
과징금 2
 
1.8%

Length

2024-03-15T00:19:44.866647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:19:45.049411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시정명령 59
52.7%
등록말소 21
 
18.8%
과태료 17
 
15.2%
영업정지 13
 
11.6%
과징금 2
 
1.8%

처분_말소기간
Text

MISSING 

Distinct21
Distinct (%)61.8%
Missing78
Missing (%)69.6%
Memory size1.0 KiB
2024-03-15T00:19:45.719405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length10
Mean length14.235294
Min length10

Characters and Unicode

Total characters484
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)41.2%

Sample

1st row2014-08-08~2015-01-07
2nd row2014-08-15~2015-01-14
3rd row2014-08-05~2015-02-04
4th row2014-08-05
5th row2014-08-05
ValueCountFrequency (%)
2014-04-01 6
17.6%
2014-03-05 4
 
11.8%
2014-08-05 3
 
8.8%
2014-01-20 2
 
5.9%
2014-03-10~2014-10-09 2
 
5.9%
2014-06-25 2
 
5.9%
2014-05-15~2014-10-14 2
 
5.9%
2014-03-10 1
 
2.9%
2014-02-20~2014-07-19 1
 
2.9%
2014-03-05~2014-06-04 1
 
2.9%
Other values (10) 10
29.4%
2024-03-15T00:19:46.790321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 126
26.0%
- 94
19.4%
1 78
16.1%
2 58
12.0%
4 58
12.0%
5 20
 
4.1%
~ 13
 
2.7%
3 10
 
2.1%
7 8
 
1.7%
8 7
 
1.4%
Other values (3) 12
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 376
77.7%
Dash Punctuation 94
 
19.4%
Math Symbol 13
 
2.7%
Space Separator 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 126
33.5%
1 78
20.7%
2 58
15.4%
4 58
15.4%
5 20
 
5.3%
3 10
 
2.7%
7 8
 
2.1%
8 7
 
1.9%
9 6
 
1.6%
6 5
 
1.3%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 484
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 126
26.0%
- 94
19.4%
1 78
16.1%
2 58
12.0%
4 58
12.0%
5 20
 
4.1%
~ 13
 
2.7%
3 10
 
2.1%
7 8
 
1.7%
8 7
 
1.4%
Other values (3) 12
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 484
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 126
26.0%
- 94
19.4%
1 78
16.1%
2 58
12.0%
4 58
12.0%
5 20
 
4.1%
~ 13
 
2.7%
3 10
 
2.1%
7 8
 
1.7%
8 7
 
1.4%
Other values (3) 12
 
2.5%

처분(원)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
<NA>
93 
500000
15 
35493000
 
1
800000
 
1
400000
 
1

Length

Max length8
Median length4
Mean length4.375
Min length4

Unique

Unique4 ?
Unique (%)3.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 93
83.0%
500000 15
 
13.4%
35493000 1
 
0.9%
800000 1
 
0.9%
400000 1
 
0.9%
84324000 1
 
0.9%

Length

2024-03-15T00:19:47.231233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:19:47.584864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 93
83.0%
500000 15
 
13.4%
35493000 1
 
0.9%
800000 1
 
0.9%
400000 1
 
0.9%
84324000 1
 
0.9%

Interactions

2024-03-15T00:19:32.378861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:19:47.838580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업체명대표자주소처리일자업종면허등록번호위반내용처분근거처분내용처분_말소기간처분(원)
번호1.0000.9320.9090.9320.9160.3600.9240.8320.8150.8280.9940.609
업체명0.9321.0001.0001.0000.9800.9991.0000.8670.8970.8610.9881.000
대표자0.9091.0001.0001.0000.9420.9651.0000.8680.8760.8470.9821.000
주소0.9321.0001.0001.0000.9800.9991.0000.8670.8970.8610.9881.000
처리일자0.9160.9800.9420.9801.0000.4090.9800.9690.9930.9290.9970.944
업종면허0.3600.9990.9650.9990.4091.0000.9990.3000.3730.2030.5830.000
등록번호0.9241.0001.0001.0000.9800.9991.0000.8690.8970.8650.9901.000
위반내용0.8320.8670.8680.8670.9690.3000.8691.0000.9920.9740.9090.944
처분근거0.8150.8970.8760.8970.9930.3730.8970.9921.0000.9620.9310.582
처분내용0.8280.8610.8470.8610.9290.2030.8650.9740.9621.0001.0001.000
처분_말소기간0.9940.9880.9820.9880.9970.5830.9900.9090.9311.0001.000NaN
처분(원)0.6091.0001.0001.0000.9440.0001.0000.9440.5821.000NaN1.000
2024-03-15T00:19:48.178698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종면허처분(원)처분근거처분내용위반내용
업종면허1.0000.0000.2240.1650.162
처분(원)0.0001.0000.4810.9070.660
처분근거0.2240.4811.0000.8850.951
처분내용0.1650.9070.8851.0000.887
위반내용0.1620.6600.9510.8871.000
2024-03-15T00:19:48.448926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종면허위반내용처분근거처분내용처분(원)
번호1.0000.2150.5200.5210.4760.509
업종면허0.2151.0000.1620.2240.1650.000
위반내용0.5200.1621.0000.9510.8870.660
처분근거0.5210.2240.9511.0000.8850.481
처분내용0.4760.1650.8870.8851.0000.907
처분(원)0.5090.0000.6600.4810.9071.000

Missing values

2024-03-15T00:19:32.722697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:19:33.121528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업체명대표자지역주소처리일자업종면허등록번호위반내용처분근거처분내용처분_말소기간처분(원)
01(유)신세계건설김*용전북(560-810) 전라북도 전주시 완산구 안행8길 52-10 (삼천동1가)2014-08-07건축공사업14­0328건설업 등록기준 미달(자본금)건설산업기본법 제83조제3호영업정지2014-08-08~2015-01-07<NA>
12(주)대흥종합건설김*봉전북(560-816) 전라북도 전주시 완산구 장승배기로 341 (서서학동)2014-08-07건축공사업14­0193주기적 등록사항신고 불이행건설산업기본법 제81조제2호시정명령<NA><NA>
23수봉종합건설(주)황*주전북(561-211) 전라북도 전주시 덕진구 동부대로 879, 남촌빌딩 (호성동1가)2014-07-25토목건축공사업전북제14­0150건설업 등록기준 미달(보증가능금액)건설산업기본법 제83조제3호영업정지2014-08-15~2015-01-14<NA>
34(유)세종종합건설육*식전북(595-805) 전북 순창군 순창읍 순화리 483-1 제2호2014-07-17토목건축공사업14­0175주기적 등록사항신고 불이행건설산업기본법 제81조제2호시정명령<NA><NA>
45(유)강산토건이*순전북(580-060) 전라북도 정읍시 정읍사로 527 (시기동)2014-07-16토목공사업14­0350건설업 등록기준 미달(보증가능금액)건설산업기본법 제83조제3호영업정지2014-08-05~2015-02-04<NA>
56(유)강산토건이*순전북(580-060) 전라북도 정읍시 정읍사로 527 (시기동)2014-07-16토목공사업14­0350주기적 등록사항신고 불이행건설산업기본법 제81조제2호시정명령<NA><NA>
67(유)다일건설이*희전북(580-804) 전라북도 정읍시 수성5로 27-11, 2층 (수성동)2014-07-16토목공사업14­0525건설업 등록기준 미달(보증가능금액)건설산업기본법 제83조제12호등록말소2014-08-05<NA>
77(유)다일건설이*희전북(580-804) 전라북도 정읍시 수성5로 27-11, 2층 (수성동)2014-07-16토목공사업14­0525건설업체의 폐업사실확인건설산업기본법 제83조제3호등록말소2014-08-05<NA>
88진실종합건설(주)김*록전북(597-803) 전라북도 장수군 장수읍 준비길 92014-07-16토목건축공사업토건­04­0013건설업체의 폐업사실확인건설산업기본법 제83조제12호등록말소2014-08-05<NA>
99(유)보현종합건설김*성전북(570-767) 전라북도 익산시 영등동 800 우남샘물타운 5층2014-07-09토목공사업05­0097건설업 등록기준 미달(자본금)건설산업기본법 제83조제3호등록말소2014-07-25<NA>
번호업체명대표자지역주소처리일자업종면허등록번호위반내용처분근거처분내용처분_말소기간처분(원)
10299(유)티오피종합건설김*선전북(576-150) 전라북도 김제시 하정로 15 (흥사동)2014-01-13건축공사업14­0127등록기준 미달로 영업정지처분 후 3년 이내에 동일한 등록기준에 미달건설산업기본법 제83조제3호의3등록말소2014-01-20<NA>
103100(주)삼목토건이*호전북(567-911) 전라북도 진안군 부귀면 전진로 1846 , 2층2014-01-08토목건축공사업01­0071하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
104101(주)삼목토건이*호전북(567-911) 전라북도 진안군 부귀면 전진로 1846 , 2층2014-01-08토목건축공사업01­0071하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
105102(주)삼목토건이*호전북(567-911) 전라북도 진안군 부귀면 전진로 1846 , 2층2014-01-08토목건축공사업01­0071하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
106103(주)삼목토건이*호전북(567-911) 전라북도 진안군 부귀면 전진로 1846 , 2층2014-01-08토목건축공사업01­0071하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
107104(주)성창종합건설전*범전북(561-360) 전라북도 전주시 덕진구 장동유통로 29-9, 2층 (장동)2014-01-08건축공사업14­0403하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
108105한림종합건설(주)이*열전북(565-861) 전라북도 완주군 고산면 남봉신기길 62014-01-08토목건축공사업14­0200하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
109106한림종합건설(주)이*열전북(565-861) 전라북도 완주군 고산면 남봉신기길 62014-01-08토목건축공사업14­0200하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
110107한림종합건설(주)이*열전북(565-861) 전라북도 완주군 고산면 남봉신기길 62014-01-08토목건축공사업14­0200하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000
111108한림종합건설(주)이*열전북(565-861) 전라북도 완주군 고산면 남봉신기길 62014-01-08토목건축공사업14­0200하도급(재하도급 포함) 통지의무 불이행건설산업기본법 제99조제5호과태료<NA>500000