Overview

Dataset statistics

Number of variables6
Number of observations72
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory51.8 B

Variable types

Numeric2
Text1
Categorical2
DateTime1

Dataset

Description전북특별자치도 내 14개 시군 소재 안전진단 전문기관 등록 현황(등록번호, 업체명, 등록분야, 등록일자 등)교량, 터널, 수리, 건축
Author전북특별자치도
URLhttps://www.data.go.kr/data/3081445/fileData.do

Alerts

연번 is highly overall correlated with 등록번호High correlation
등록번호 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 15:03:29.335854
Analysis finished2024-03-14 15:03:31.250287
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.5
Minimum1
Maximum72
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.0 B
2024-03-15T00:03:31.457411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.55
Q118.75
median36.5
Q354.25
95-th percentile68.45
Maximum72
Range71
Interquartile range (IQR)35.5

Descriptive statistics

Standard deviation20.92845
Coefficient of variation (CV)0.57338218
Kurtosis-1.2
Mean36.5
Median Absolute Deviation (MAD)18
Skewness0
Sum2628
Variance438
MonotonicityStrictly increasing
2024-03-15T00:03:31.905659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
38 1
 
1.4%
54 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
Other values (62) 62
86.1%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
72 1
1.4%
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%

등록번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.611111
Minimum1
Maximum242
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.0 B
2024-03-15T00:03:32.334292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.55
Q131.75
median55.5
Q377.25
95-th percentile92.45
Maximum242
Range241
Interquartile range (IQR)45.5

Descriptive statistics

Standard deviation38.54522
Coefficient of variation (CV)0.68087729
Kurtosis7.9239117
Mean56.611111
Median Absolute Deviation (MAD)23
Skewness1.9632684
Sum4076
Variance1485.734
MonotonicityNot monotonic
2024-03-15T00:03:32.785118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
191 1
 
1.4%
55 1
 
1.4%
75 1
 
1.4%
74 1
 
1.4%
73 1
 
1.4%
72 1
 
1.4%
69 1
 
1.4%
68 1
 
1.4%
67 1
 
1.4%
66 1
 
1.4%
Other values (62) 62
86.1%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
4 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
11 1
1.4%
12 1
1.4%
14 1
1.4%
15 1
1.4%
ValueCountFrequency (%)
242 1
1.4%
191 1
1.4%
94 1
1.4%
93 1
1.4%
92 1
1.4%
91 1
1.4%
90 1
1.4%
89 1
1.4%
88 1
1.4%
87 1
1.4%
Distinct71
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size704.0 B
2024-03-15T00:03:33.660596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length21
Mean length10.069444
Min length3

Characters and Unicode

Total characters725
Distinct characters120
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)97.2%

Sample

1st row㈜건설방재기술연구원
2nd row기술사건축사사무소 미문건설기술연구원㈜
3rd row㈜건설품질시험원
4th row㈜건설품질시험원
5th row(유)센이엔지건축안전진단[구: (유)센이엔지건축사사무소]
ValueCountFrequency (%)
주식회사 13
 
13.7%
유한회사 3
 
3.2%
㈜한아 2
 
2.1%
㈜건설품질시험원 2
 
2.1%
태안특수건설㈜ 1
 
1.1%
선인건축사사무소 1
 
1.1%
㈜혜원이엔지 1
 
1.1%
채움기술 1
 
1.1%
㈜남지건설이앤씨 1
 
1.1%
서현이앤씨 1
 
1.1%
Other values (69) 69
72.6%
2024-03-15T00:03:34.967681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
 
5.5%
39
 
5.4%
33
 
4.6%
24
 
3.3%
) 23
 
3.2%
23
 
3.2%
( 23
 
3.2%
23
 
3.2%
22
 
3.0%
22
 
3.0%
Other values (110) 453
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 592
81.7%
Other Symbol 40
 
5.5%
Close Punctuation 29
 
4.0%
Open Punctuation 29
 
4.0%
Space Separator 23
 
3.2%
Other Punctuation 8
 
1.1%
Decimal Number 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
6.6%
33
 
5.6%
24
 
4.1%
23
 
3.9%
22
 
3.7%
22
 
3.7%
22
 
3.7%
21
 
3.5%
18
 
3.0%
18
 
3.0%
Other values (101) 350
59.1%
Close Punctuation
ValueCountFrequency (%)
) 23
79.3%
] 6
 
20.7%
Open Punctuation
ValueCountFrequency (%)
( 23
79.3%
[ 6
 
20.7%
Decimal Number
ValueCountFrequency (%)
0 2
50.0%
1 2
50.0%
Other Symbol
ValueCountFrequency (%)
40
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Punctuation
ValueCountFrequency (%)
: 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 632
87.2%
Common 93
 
12.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
6.3%
39
 
6.2%
33
 
5.2%
24
 
3.8%
23
 
3.6%
22
 
3.5%
22
 
3.5%
22
 
3.5%
21
 
3.3%
18
 
2.8%
Other values (102) 368
58.2%
Common
ValueCountFrequency (%)
) 23
24.7%
23
24.7%
( 23
24.7%
: 8
 
8.6%
[ 6
 
6.5%
] 6
 
6.5%
0 2
 
2.2%
1 2
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 592
81.7%
ASCII 93
 
12.8%
None 40
 
5.5%

Most frequent character per block

None
ValueCountFrequency (%)
40
100.0%
Hangul
ValueCountFrequency (%)
39
 
6.6%
33
 
5.6%
24
 
4.1%
23
 
3.9%
22
 
3.7%
22
 
3.7%
22
 
3.7%
21
 
3.5%
18
 
3.0%
18
 
3.0%
Other values (101) 350
59.1%
ASCII
ValueCountFrequency (%)
) 23
24.7%
23
24.7%
( 23
24.7%
: 8
 
8.6%
[ 6
 
6.5%
] 6
 
6.5%
0 2
 
2.2%
1 2
 
2.2%

등록분야
Categorical

Distinct9
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size704.0 B
교량/터널,수리
22 
교량/터널
20 
건축
17 
교량/터널,수리,건축
교량/터널,건축
Other values (4)

Length

Max length12
Median length11
Mean length6.0416667
Min length2

Unique

Unique3 ?
Unique (%)4.2%

Sample

1st row교량/터널,수리,건축
2nd row건축
3rd row교량/터널,수리
4th row교량/터널,수리
5th row건축

Common Values

ValueCountFrequency (%)
교량/터널,수리 22
30.6%
교량/터널 20
27.8%
건축 17
23.6%
교량/터널,수리,건축 4
 
5.6%
교량/터널,건축 4
 
5.6%
교량/터널, 수리 2
 
2.8%
교량/터널,항만 1
 
1.4%
교량/터널,수리,항만 1
 
1.4%
교량/터널,수리,건축 1
 
1.4%

Length

2024-03-15T00:03:35.394785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:03:35.753789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교량/터널,수리 22
29.7%
교량/터널 22
29.7%
건축 17
23.0%
교량/터널,수리,건축 5
 
6.8%
교량/터널,건축 4
 
5.4%
수리 2
 
2.7%
교량/터널,항만 1
 
1.4%
교량/터널,수리,항만 1
 
1.4%

대표자
Categorical

Distinct25
Distinct (%)34.7%
Missing0
Missing (%)0.0%
Memory size704.0 B
이**
18 
김**
13 
박**
최**
강**
Other values (20)
27 

Length

Max length8
Median length3
Mean length3.1111111
Min length2

Unique

Unique13 ?
Unique (%)18.1%

Sample

1st row백**
2nd row전**
3rd row이**
4th row이**
5th row박**

Common Values

ValueCountFrequency (%)
이** 18
25.0%
김** 13
18.1%
박** 7
 
9.7%
최** 4
 
5.6%
강** 3
 
4.2%
정** 2
 
2.8%
조** 2
 
2.8%
장** 2
 
2.8%
백** 2
 
2.8%
송** 2
 
2.8%
Other values (15) 17
23.6%

Length

2024-03-15T00:03:36.198030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
19
25.7%
15
20.3%
7
 
9.5%
5
 
6.8%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (12) 14
18.9%
Distinct71
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size704.0 B
Minimum1998-11-09 00:00:00
Maximum2020-01-23 00:00:00
2024-03-15T00:03:36.575043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:03:37.019641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-15T00:03:30.184468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:03:29.678631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:03:30.437093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:03:29.936441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:03:37.304703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호업체명등록분야대표자등록일자
연번1.0000.8661.0000.4410.4001.000
등록번호0.8661.0001.0000.5780.5841.000
업체명1.0001.0001.0001.0001.0001.000
등록분야0.4410.5781.0001.0000.0001.000
대표자0.4000.5841.0000.0001.0001.000
등록일자1.0001.0001.0001.0001.0001.000
2024-03-15T00:03:37.575288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표자등록분야
대표자1.0000.000
등록분야0.0001.000
2024-03-15T00:03:37.878211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호등록분야대표자
연번1.0000.8380.2120.111
등록번호0.8381.0000.3220.251
등록분야0.2120.3221.0000.000
대표자0.1110.2510.0001.000

Missing values

2024-03-15T00:03:30.768178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:03:31.116241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번등록번호업체명등록분야대표자등록일자
01191㈜건설방재기술연구원교량/터널,수리,건축백**1998-11-09
12242기술사건축사사무소 미문건설기술연구원㈜건축전**2000-04-08
231㈜건설품질시험원교량/터널,수리이**2003-04-03
342㈜건설품질시험원교량/터널,수리이**2003-04-03
454(유)센이엔지건축안전진단[구: (유)센이엔지건축사사무소]건축박**2005-02-23
566㈜대한건설연구원교량/터널,수리조**2005-06-22
677㈜대들보구조안전기술단교량/터널,건축박**2006-08-09
788㈜한국건설기술공사교량/터널,수리장**2007-03-09
8911㈜세종건설기술교량/터널,수리이**2009-05-11
91012(유)쎈구조엔지니어링건축김**2009-05-28
연번등록번호업체명등록분야대표자등록일자
6263851010건축사사무소건축최**2019-03-11
636486(유)영화이엔지교량/터널윤**2019-05-07
646587주식회사 신화기술건축김**2019-06-10
656688유한회사 금강기술건축김**2019-06-17
666789유한회사 라온건설기술사사무소건축이**2019-07-18
676890주식회사 온길교량/터널이**2019-07-29
686991선인건축사사무소건축서**2019-08-26
697092태안특수건설㈜교량/터널이**2019-11-28
707193(유)장원종합건축사사무소건축박**2019-12-03
717294(유)큰길이엔지교량/터널송**2020-01-23