Overview

Dataset statistics

Number of variables5
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory43.2 B

Variable types

Text3
DateTime1
Categorical1

Dataset

Description경기도 부천시 관내의 계량기업체 현황에 대한 데이터로 인허가번호, 업체명, 전화번호, 처리일자, 처리구분 등의 자료를 제공합니다.
Author경기도 부천시
URLhttps://www.data.go.kr/data/3079359/fileData.do

Alerts

처리구분 has constant value ""Constant
인허가번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:02:50.031171
Analysis finished2023-12-12 14:02:50.537836
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인허가번호
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T23:02:50.722065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length23
Mean length23
Min length23

Characters and Unicode

Total characters943
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row2022-3860555-06-5-00003
2nd row2022-3860555-06-5-00002
3rd row2022-3860555-06-5-00001
4th row2021-3860555-06-5-00001
5th row2020-3860555-06-5-00001
ValueCountFrequency (%)
2022-3860555-06-5-00003 1
 
2.4%
2011-3860212-06-5-00001 1
 
2.4%
2010-3860212-06-5-00002 1
 
2.4%
2010-3860212-06-5-00001 1
 
2.4%
2009-3860212-06-5-00002 1
 
2.4%
2009-3860212-06-5-00001 1
 
2.4%
2009-3860126-06-5-00002 1
 
2.4%
2009-3860126-06-5-00001 1
 
2.4%
2008-3860126-06-5-00005 1
 
2.4%
2008-3860126-06-5-00004 1
 
2.4%
Other values (31) 31
75.6%
2023-12-12T23:02:51.112941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 297
31.5%
- 164
17.4%
6 94
 
10.0%
2 87
 
9.2%
1 76
 
8.1%
5 63
 
6.7%
3 62
 
6.6%
8 51
 
5.4%
9 22
 
2.3%
4 19
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 779
82.6%
Dash Punctuation 164
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 297
38.1%
6 94
 
12.1%
2 87
 
11.2%
1 76
 
9.8%
5 63
 
8.1%
3 62
 
8.0%
8 51
 
6.5%
9 22
 
2.8%
4 19
 
2.4%
7 8
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 164
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 943
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 297
31.5%
- 164
17.4%
6 94
 
10.0%
2 87
 
9.2%
1 76
 
8.1%
5 63
 
6.7%
3 62
 
6.6%
8 51
 
5.4%
9 22
 
2.3%
4 19
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 943
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 297
31.5%
- 164
17.4%
6 94
 
10.0%
2 87
 
9.2%
1 76
 
8.1%
5 63
 
6.7%
3 62
 
6.6%
8 51
 
5.4%
9 22
 
2.3%
4 19
 
2.0%
Distinct33
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T23:02:51.330543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length8.5121951
Min length4

Characters and Unicode

Total characters349
Distinct characters94
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)61.0%

Sample

1st row에스알지엠(SRGM)
2nd row주식회사 선광시스템
3rd row주식회사 선광시스템
4th row주식회사 근화
5th row케이씨계량기
ValueCountFrequency (%)
주식회사 7
 
14.3%
주)세화씨엔엠 2
 
4.1%
태광인더스트리 2
 
4.1%
주)한국기술산전 2
 
4.1%
주)트라이에스 2
 
4.1%
주)명성하이텍 2
 
4.1%
주)이시다매뉴팩쳐링코리아 2
 
4.1%
선광시스템 2
 
4.1%
한국다쓰노(주 2
 
4.1%
카스산업계기 1
 
2.0%
Other values (25) 25
51.0%
2023-12-12T23:02:51.677816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
8.3%
( 23
 
6.6%
) 23
 
6.6%
13
 
3.7%
13
 
3.7%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
8
 
2.3%
Other values (84) 206
59.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 291
83.4%
Open Punctuation 23
 
6.6%
Close Punctuation 23
 
6.6%
Space Separator 8
 
2.3%
Uppercase Letter 4
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
10.0%
13
 
4.5%
13
 
4.5%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.4%
7
 
2.4%
Other values (77) 180
61.9%
Uppercase Letter
ValueCountFrequency (%)
S 1
25.0%
R 1
25.0%
G 1
25.0%
M 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 291
83.4%
Common 54
 
15.5%
Latin 4
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
10.0%
13
 
4.5%
13
 
4.5%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.4%
7
 
2.4%
Other values (77) 180
61.9%
Latin
ValueCountFrequency (%)
S 1
25.0%
R 1
25.0%
G 1
25.0%
M 1
25.0%
Common
ValueCountFrequency (%)
( 23
42.6%
) 23
42.6%
8
 
14.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 291
83.4%
ASCII 58
 
16.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
 
10.0%
13
 
4.5%
13
 
4.5%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.4%
7
 
2.4%
Other values (77) 180
61.9%
ASCII
ValueCountFrequency (%)
( 23
39.7%
) 23
39.7%
8
 
13.8%
S 1
 
1.7%
R 1
 
1.7%
G 1
 
1.7%
M 1
 
1.7%
Distinct32
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T23:02:51.882263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.097561
Min length11

Characters and Unicode

Total characters496
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)56.1%

Sample

1st row032-671-1126
2nd row02-858-8412
3rd row02-858-8412
4th row032-681-8239
5th row0504-0314-667
ValueCountFrequency (%)
032-668-1551 2
 
4.9%
032-663-0144 2
 
4.9%
032-624-3050 2
 
4.9%
032-670-7355 2
 
4.9%
032-321-1246 2
 
4.9%
032-624-0060 2
 
4.9%
032-684-2959 2
 
4.9%
032-672-9111 2
 
4.9%
02-858-8412 2
 
4.9%
032-0675-3405 1
 
2.4%
Other values (22) 22
53.7%
2023-12-12T23:02:52.311864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 82
16.5%
0 79
15.9%
2 67
13.5%
3 63
12.7%
6 49
9.9%
1 38
7.7%
5 33
6.7%
4 29
 
5.8%
8 22
 
4.4%
7 22
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 414
83.5%
Dash Punctuation 82
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 79
19.1%
2 67
16.2%
3 63
15.2%
6 49
11.8%
1 38
9.2%
5 33
8.0%
4 29
 
7.0%
8 22
 
5.3%
7 22
 
5.3%
9 12
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 82
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 496
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 82
16.5%
0 79
15.9%
2 67
13.5%
3 63
12.7%
6 49
9.9%
1 38
7.7%
5 33
6.7%
4 29
 
5.8%
8 22
 
4.4%
7 22
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 496
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 82
16.5%
0 79
15.9%
2 67
13.5%
3 63
12.7%
6 49
9.9%
1 38
7.7%
5 33
6.7%
4 29
 
5.8%
8 22
 
4.4%
7 22
 
4.4%
Distinct33
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size460.0 B
Minimum1978-09-11 00:00:00
Maximum2022-03-04 00:00:00
2023-12-12T23:02:52.442134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:02:52.585675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

처리구분
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
영업중
41 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 41
100.0%

Length

2023-12-12T23:02:52.714883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:02:52.803162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 41
100.0%

Correlations

2023-12-12T23:02:52.877733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호업체명전화번호처리일자
인허가번호1.0001.0001.0001.000
업체명1.0001.0000.9980.996
전화번호1.0000.9981.0000.993
처리일자1.0000.9960.9931.000

Missing values

2023-12-12T23:02:50.342673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:02:50.488104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인허가번호업체명전화번호처리일자처리구분
02022-3860555-06-5-00003에스알지엠(SRGM)032-671-11262022-03-04영업중
12022-3860555-06-5-00002주식회사 선광시스템02-858-84122022-03-04영업중
22022-3860555-06-5-00001주식회사 선광시스템02-858-84122022-03-04영업중
32021-3860555-06-5-00001주식회사 근화032-681-82392021-04-22영업중
42020-3860555-06-5-00001케이씨계량기0504-0314-6672020-01-13영업중
52017-3860431-06-5-20071태광인더스트리 주식회사032-684-29592007-04-10영업중
62017-3860431-06-5-00002(주)제일에스코032-683-34192017-07-19영업중
72017-3860431-06-5-00001시앤시인스트루먼트(주)032-327-33442017-06-09영업중
82016-3860431-06-5-00001(주)세화하이테크032-624-38002016-08-04영업중
92016-3860334-06-5-00003동은정공 주식회사032-671-96812016-04-18영업중
인허가번호업체명전화번호처리일자처리구분
312008-3860126-06-5-00003(주)세화씨엔엠032-624-00602008-04-16영업중
322008-3860126-06-5-00002(주)세화씨엔엠032-624-00602008-04-16영업중
332007-3860431-06-5-20072태광인더스트리 주식회사032-684-29592007-04-10영업중
342005-3860126-06-5-00001(주)모텍스032-673-50052005-01-21영업중
351998-3860291-06-5-00004경인이시다코리아032-668-14522007-06-01영업중
361998-3860291-06-5-00001카스산업계기032-679-01411998-02-28영업중
371993-3860000-06-5-00007종합컴퓨터계량증명업소032-0675-34051993-02-02영업중
381985-3860126-06-5-85205삼성계기032-624-28551985-07-24영업중
391983-3860000-06-5-00004정일계량증명업소032-0346-85281983-11-14영업중
401978-3860000-06-5-00001부천계량증명업소032-0672-67231978-09-11영업중