Overview

Dataset statistics

Number of variables14
Number of observations2620
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory289.2 KiB
Average record size in memory113.1 B

Variable types

Numeric1
Text5
DateTime1
Categorical7

Dataset

Description경기도 양주시 정보통신공사 사용전검사 현황에 관련된 데이터로 처리일자, 현장주소 등의 내용을 포함하고 있습니다.
URLhttps://www.data.go.kr/data/3073858/fileData.do

Alerts

관리기관명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
공사의 종류 is highly overall correlated with 수수료 and 2 other fieldsHigh correlation
판정 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
수수료 is highly overall correlated with 공사의 종류 and 3 other fieldsHigh correlation
재검사여부 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
연번 is highly overall correlated with 재검사여부 and 2 other fieldsHigh correlation
관리기관 전화번호 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
공사의 종류 is highly imbalanced (72.9%)Imbalance
재검사여부 is highly imbalanced (54.3%)Imbalance
판정 is highly imbalanced (54.3%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:30:54.767087
Analysis finished2023-12-12 22:30:55.951808
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2620
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1310.5
Minimum1
Maximum2620
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.2 KiB
2023-12-13T07:30:56.021448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile131.95
Q1655.75
median1310.5
Q31965.25
95-th percentile2489.05
Maximum2620
Range2619
Interquartile range (IQR)1309.5

Descriptive statistics

Standard deviation756.47318
Coefficient of variation (CV)0.57724012
Kurtosis-1.2
Mean1310.5
Median Absolute Deviation (MAD)655
Skewness0
Sum3433510
Variance572251.67
MonotonicityStrictly increasing
2023-12-13T07:30:56.199189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1762 1
 
< 0.1%
1744 1
 
< 0.1%
1745 1
 
< 0.1%
1746 1
 
< 0.1%
1747 1
 
< 0.1%
1748 1
 
< 0.1%
1749 1
 
< 0.1%
1750 1
 
< 0.1%
1751 1
 
< 0.1%
Other values (2610) 2610
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2620 1
< 0.1%
2619 1
< 0.1%
2618 1
< 0.1%
2617 1
< 0.1%
2616 1
< 0.1%
2615 1
< 0.1%
2614 1
< 0.1%
2613 1
< 0.1%
2612 1
< 0.1%
2611 1
< 0.1%
Distinct2352
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-13T07:30:56.539344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length9.8832061
Min length7

Characters and Unicode

Total characters25894
Distinct characters21
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2338 ?
Unique (%)89.2%

Sample

1st row양주시-2016-149
2nd row양주시-2017-363
3rd row양주시-2017-489
4th row양주시-2017-492
5th row양주시-2017-493
ValueCountFrequency (%)
데이터 252
 
8.8%
미집계 252
 
8.8%
양주시-2018-133 3
 
0.1%
양주시-2018-150 3
 
0.1%
양주시-2018-070 3
 
0.1%
양주시-2017-160 3
 
0.1%
양주시-2016-156 2
 
0.1%
양주시-2018-152 2
 
0.1%
양주시-2017-489 2
 
0.1%
양주시-2017-474 2
 
0.1%
Other values (2343) 2348
81.8%
2023-12-13T07:30:57.057386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8082
31.2%
2 5552
21.4%
- 2593
 
10.0%
1 2282
 
8.8%
3 1088
 
4.2%
9 886
 
3.4%
4 771
 
3.0%
8 659
 
2.5%
5 564
 
2.2%
6 491
 
1.9%
Other values (11) 2926
 
11.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20862
80.6%
Dash Punctuation 2593
 
10.0%
Other Letter 2187
 
8.4%
Space Separator 252
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8082
38.7%
2 5552
26.6%
1 2282
 
10.9%
3 1088
 
5.2%
9 886
 
4.2%
4 771
 
3.7%
8 659
 
3.2%
5 564
 
2.7%
6 491
 
2.4%
7 487
 
2.3%
Other Letter
ValueCountFrequency (%)
252
11.5%
252
11.5%
252
11.5%
252
11.5%
252
11.5%
252
11.5%
225
10.3%
225
10.3%
225
10.3%
Dash Punctuation
ValueCountFrequency (%)
- 2593
100.0%
Space Separator
ValueCountFrequency (%)
252
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 23707
91.6%
Hangul 2187
 
8.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8082
34.1%
2 5552
23.4%
- 2593
 
10.9%
1 2282
 
9.6%
3 1088
 
4.6%
9 886
 
3.7%
4 771
 
3.3%
8 659
 
2.8%
5 564
 
2.4%
6 491
 
2.1%
Other values (2) 739
 
3.1%
Hangul
ValueCountFrequency (%)
252
11.5%
252
11.5%
252
11.5%
252
11.5%
252
11.5%
252
11.5%
225
10.3%
225
10.3%
225
10.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 23707
91.6%
Hangul 2187
 
8.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8082
34.1%
2 5552
23.4%
- 2593
 
10.9%
1 2282
 
9.6%
3 1088
 
4.6%
9 886
 
3.7%
4 771
 
3.3%
8 659
 
2.8%
5 564
 
2.4%
6 491
 
2.1%
Other values (2) 739
 
3.1%
Hangul
ValueCountFrequency (%)
252
11.5%
252
11.5%
252
11.5%
252
11.5%
252
11.5%
252
11.5%
225
10.3%
225
10.3%
225
10.3%
Distinct563
Distinct (%)21.5%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
Minimum2018-01-02 00:00:00
Maximum2023-07-21 00:00:00
2023-12-13T07:30:57.226768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:30:57.379189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2524
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-13T07:30:57.696842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length36
Mean length21.025573
Min length9

Characters and Unicode

Total characters55087
Distinct characters159
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2439 ?
Unique (%)93.1%

Sample

1st row경기도 양주시 백석읍 방성리 336-3, 외 1필지
2nd row경기도 양주시 봉양동 363-3
3rd row경기도 양주시 칠봉산로 251-15 (봉양동)
4th row경기도 양주시 광적면 광적로 368-68
5th row경기도 양주시 부흥로 2155 (삼숭동)
ValueCountFrequency (%)
양주시 2641
19.8%
경기도 2633
19.7%
600
 
4.5%
백석읍 403
 
3.0%
1필지 354
 
2.6%
은현면 329
 
2.5%
광적면 259
 
1.9%
장흥면 251
 
1.9%
옥정동 246
 
1.8%
삼숭동 174
 
1.3%
Other values (2475) 5471
40.9%
2023-12-13T07:30:58.193273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10744
19.5%
2785
 
5.1%
2704
 
4.9%
2656
 
4.8%
2651
 
4.8%
2650
 
4.8%
2648
 
4.8%
1 2228
 
4.0%
- 2155
 
3.9%
2 1586
 
2.9%
Other values (149) 22280
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30419
55.2%
Decimal Number 11225
 
20.4%
Space Separator 10744
 
19.5%
Dash Punctuation 2155
 
3.9%
Other Punctuation 436
 
0.8%
Open Punctuation 46
 
0.1%
Close Punctuation 46
 
0.1%
Uppercase Letter 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2785
 
9.2%
2704
 
8.9%
2656
 
8.7%
2651
 
8.7%
2650
 
8.7%
2648
 
8.7%
1361
 
4.5%
1223
 
4.0%
998
 
3.3%
811
 
2.7%
Other values (128) 9932
32.7%
Decimal Number
ValueCountFrequency (%)
1 2228
19.8%
2 1586
14.1%
3 1203
10.7%
4 1109
9.9%
5 1054
9.4%
6 1012
9.0%
7 800
 
7.1%
0 786
 
7.0%
8 750
 
6.7%
9 697
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
C 5
31.2%
B 4
25.0%
A 4
25.0%
E 1
 
6.2%
D 1
 
6.2%
R 1
 
6.2%
Space Separator
ValueCountFrequency (%)
10744
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2155
100.0%
Other Punctuation
ValueCountFrequency (%)
, 436
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30419
55.2%
Common 24652
44.8%
Latin 16
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2785
 
9.2%
2704
 
8.9%
2656
 
8.7%
2651
 
8.7%
2650
 
8.7%
2648
 
8.7%
1361
 
4.5%
1223
 
4.0%
998
 
3.3%
811
 
2.7%
Other values (128) 9932
32.7%
Common
ValueCountFrequency (%)
10744
43.6%
1 2228
 
9.0%
- 2155
 
8.7%
2 1586
 
6.4%
3 1203
 
4.9%
4 1109
 
4.5%
5 1054
 
4.3%
6 1012
 
4.1%
7 800
 
3.2%
0 786
 
3.2%
Other values (5) 1975
 
8.0%
Latin
ValueCountFrequency (%)
C 5
31.2%
B 4
25.0%
A 4
25.0%
E 1
 
6.2%
D 1
 
6.2%
R 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30419
55.2%
ASCII 24668
44.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10744
43.6%
1 2228
 
9.0%
- 2155
 
8.7%
2 1586
 
6.4%
3 1203
 
4.9%
4 1109
 
4.5%
5 1054
 
4.3%
6 1012
 
4.1%
7 800
 
3.2%
0 786
 
3.2%
Other values (11) 1991
 
8.1%
Hangul
ValueCountFrequency (%)
2785
 
9.2%
2704
 
8.9%
2656
 
8.7%
2651
 
8.7%
2650
 
8.7%
2648
 
8.7%
1361
 
4.5%
1223
 
4.0%
998
 
3.3%
811
 
2.7%
Other values (128) 9932
32.7%

공사의 종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
구내통신선로설비,방송공동수신설비(종합유선방송)
2164 
데이터 미집계
252 
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송)
 
131
구내통신선로설비
 
27
구내통신선로설비,방송공동수신설비(종합유선방송),이동통신구내선로설비
 
27
Other values (7)
 
19

Length

Max length55
Median length25
Mean length24.250382
Min length7

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row구내통신선로설비,방송공동수신설비(종합유선방송)
2nd row구내통신선로설비,방송공동수신설비(종합유선방송)
3rd row구내통신선로설비,방송공동수신설비(종합유선방송)
4th row구내통신선로설비,방송공동수신설비(종합유선방송)
5th row구내통신선로설비,방송공동수신설비(종합유선방송)

Common Values

ValueCountFrequency (%)
구내통신선로설비,방송공동수신설비(종합유선방송) 2164
82.6%
데이터 미집계 252
 
9.6%
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송) 131
 
5.0%
구내통신선로설비 27
 
1.0%
구내통신선로설비,방송공동수신설비(종합유선방송),이동통신구내선로설비 27
 
1.0%
구내통신선로설비,방송공동수신설비(지상파TV,FM라디오방송,종합유선방송) 9
 
0.3%
구내통신선로설비,방송공동수신설비(지상파TV,FM라디오방송,종합유선방송),이동통신구내선로설비 3
 
0.1%
구내통신선로설비,방송공동수신설비() 2
 
0.1%
구내통신선로설비,방송공동수신설비(지상파TV,위성방송,FM라디오방송,종합유선방송),이동통신구내선로설비 2
 
0.1%
구내통신선로설비,방송공동수신설비(지상파TV,종합유선방송) 1
 
< 0.1%
Other values (2) 2
 
0.1%

Length

2023-12-13T07:30:58.388691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
구내통신선로설비,방송공동수신설비(종합유선방송 2164
75.3%
데이터 252
 
8.8%
미집계 252
 
8.8%
구내통신선로설비,방송공동수신설비(지상파tv,위성방송,fm라디오방송,종합유선방송 131
 
4.6%
구내통신선로설비 27
 
0.9%
구내통신선로설비,방송공동수신설비(종합유선방송),이동통신구내선로설비 27
 
0.9%
구내통신선로설비,방송공동수신설비(지상파tv,fm라디오방송,종합유선방송 9
 
0.3%
구내통신선로설비,방송공동수신설비(지상파tv,fm라디오방송,종합유선방송),이동통신구내선로설비 3
 
0.1%
구내통신선로설비,방송공동수신설비 2
 
0.1%
구내통신선로설비,방송공동수신설비(지상파tv,위성방송,fm라디오방송,종합유선방송),이동통신구내선로설비 2
 
0.1%
Other values (3) 3
 
0.1%
Distinct516
Distinct (%)19.7%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-13T07:30:58.687585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.7114504
Min length7

Characters and Unicode

Total characters25444
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)2.2%

Sample

1st row2018-01-02
2nd row2018-01-02
3rd row2018-01-02
4th row2018-01-02
5th row2018-01-02
ValueCountFrequency (%)
데이터 252
 
8.8%
미집계 252
 
8.8%
2022-09-23 37
 
1.3%
2023-05-24 13
 
0.5%
2021-12-03 12
 
0.4%
2018-05-25 12
 
0.4%
2021-06-23 12
 
0.4%
2018-06-15 12
 
0.4%
2021-12-31 12
 
0.4%
2021-09-15 11
 
0.4%
Other values (507) 2247
78.2%
2023-12-13T07:30:59.165279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 6035
23.7%
0 5528
21.7%
- 4736
18.6%
1 3297
13.0%
3 818
 
3.2%
9 808
 
3.2%
8 681
 
2.7%
5 488
 
1.9%
6 438
 
1.7%
4 432
 
1.7%
Other values (8) 2183
 
8.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18944
74.5%
Dash Punctuation 4736
 
18.6%
Other Letter 1512
 
5.9%
Space Separator 252
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 6035
31.9%
0 5528
29.2%
1 3297
17.4%
3 818
 
4.3%
9 808
 
4.3%
8 681
 
3.6%
5 488
 
2.6%
6 438
 
2.3%
4 432
 
2.3%
7 419
 
2.2%
Other Letter
ValueCountFrequency (%)
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 4736
100.0%
Space Separator
ValueCountFrequency (%)
252
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 23932
94.1%
Hangul 1512
 
5.9%

Most frequent character per script

Common
ValueCountFrequency (%)
2 6035
25.2%
0 5528
23.1%
- 4736
19.8%
1 3297
13.8%
3 818
 
3.4%
9 808
 
3.4%
8 681
 
2.8%
5 488
 
2.0%
6 438
 
1.8%
4 432
 
1.8%
Other values (2) 671
 
2.8%
Hangul
ValueCountFrequency (%)
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 23932
94.1%
Hangul 1512
 
5.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 6035
25.2%
0 5528
23.1%
- 4736
19.8%
1 3297
13.8%
3 818
 
3.4%
9 808
 
3.4%
8 681
 
2.8%
5 488
 
2.0%
6 438
 
1.8%
4 432
 
1.8%
Other values (2) 671
 
2.8%
Hangul
ValueCountFrequency (%)
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%

수수료
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
20000
1258 
30000
563 
40000
281 
데이터 미집계
252 
<NA>
224 
Other values (2)
 
42

Length

Max length7
Median length5
Mean length5.0916031
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
20000 1258
48.0%
30000 563
21.5%
40000 281
 
10.7%
데이터 미집계 252
 
9.6%
<NA> 224
 
8.5%
60000 32
 
1.2%
0 10
 
0.4%

Length

2023-12-13T07:30:59.340042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:30:59.461932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20000 1258
43.8%
30000 563
19.6%
40000 281
 
9.8%
데이터 252
 
8.8%
미집계 252
 
8.8%
na 224
 
7.8%
60000 32
 
1.1%
0 10
 
0.3%

재검사여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
신규
2368 
데이터 미집계
252 

Length

Max length7
Median length2
Mean length2.480916
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규
2nd row신규
3rd row신규
4th row신규
5th row신규

Common Values

ValueCountFrequency (%)
신규 2368
90.4%
데이터 미집계 252
 
9.6%

Length

2023-12-13T07:30:59.621190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:30:59.723739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규 2368
82.5%
데이터 252
 
8.8%
미집계 252
 
8.8%

판정
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
적합
2368 
데이터 미집계
252 

Length

Max length7
Median length2
Mean length2.480916
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적합
2nd row적합
3rd row적합
4th row적합
5th row적합

Common Values

ValueCountFrequency (%)
적합 2368
90.4%
데이터 미집계 252
 
9.6%

Length

2023-12-13T07:30:59.850847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:30:59.965802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
적합 2368
82.5%
데이터 252
 
8.8%
미집계 252
 
8.8%
Distinct268
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-13T07:31:00.232773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length14
Mean length7.651145
Min length3

Characters and Unicode

Total characters20046
Distinct characters227
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique120 ?
Unique (%)4.6%

Sample

1st row에스엠컴
2nd row(주)레이텍
3rd row(주)레이텍
4th row(주)하나정보기술
5th row(주)동하통신기술
ValueCountFrequency (%)
주)동하통신기술 387
 
12.5%
데이터 252
 
8.1%
미집계 252
 
8.1%
주)레이텍 249
 
8.0%
주식회사 215
 
6.9%
에스엠컴 170
 
5.5%
주)재성이엔씨 164
 
5.3%
하나정보기술(주 117
 
3.8%
주)거성 80
 
2.6%
주)광명텔레콤 70
 
2.3%
Other values (261) 1142
36.9%
2023-12-13T07:31:00.718095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1903
 
9.5%
) 1657
 
8.3%
( 1655
 
8.3%
1012
 
5.0%
902
 
4.5%
828
 
4.1%
650
 
3.2%
544
 
2.7%
543
 
2.7%
478
 
2.4%
Other values (217) 9874
49.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16164
80.6%
Close Punctuation 1657
 
8.3%
Open Punctuation 1655
 
8.3%
Space Separator 478
 
2.4%
Lowercase Letter 59
 
0.3%
Uppercase Letter 20
 
0.1%
Other Punctuation 13
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1903
 
11.8%
1012
 
6.3%
902
 
5.6%
828
 
5.1%
650
 
4.0%
544
 
3.4%
543
 
3.4%
432
 
2.7%
419
 
2.6%
405
 
2.5%
Other values (189) 8526
52.7%
Lowercase Letter
ValueCountFrequency (%)
t 13
22.0%
e 8
13.6%
o 8
13.6%
l 7
11.9%
d 6
10.2%
i 6
10.2%
c 3
 
5.1%
m 1
 
1.7%
r 1
 
1.7%
a 1
 
1.7%
Other values (5) 5
 
8.5%
Uppercase Letter
ValueCountFrequency (%)
L 6
30.0%
C 6
30.0%
K 5
25.0%
A 1
 
5.0%
I 1
 
5.0%
N 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 6
46.2%
, 5
38.5%
& 1
 
7.7%
; 1
 
7.7%
Close Punctuation
ValueCountFrequency (%)
) 1657
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1655
100.0%
Space Separator
ValueCountFrequency (%)
478
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16164
80.6%
Common 3803
 
19.0%
Latin 79
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1903
 
11.8%
1012
 
6.3%
902
 
5.6%
828
 
5.1%
650
 
4.0%
544
 
3.4%
543
 
3.4%
432
 
2.7%
419
 
2.6%
405
 
2.5%
Other values (189) 8526
52.7%
Latin
ValueCountFrequency (%)
t 13
16.5%
e 8
10.1%
o 8
10.1%
l 7
8.9%
L 6
7.6%
d 6
7.6%
C 6
7.6%
i 6
7.6%
K 5
 
6.3%
c 3
 
3.8%
Other values (11) 11
13.9%
Common
ValueCountFrequency (%)
) 1657
43.6%
( 1655
43.5%
478
 
12.6%
. 6
 
0.2%
, 5
 
0.1%
& 1
 
< 0.1%
; 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16164
80.6%
ASCII 3882
 
19.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1903
 
11.8%
1012
 
6.3%
902
 
5.6%
828
 
5.1%
650
 
4.0%
544
 
3.4%
543
 
3.4%
432
 
2.7%
419
 
2.6%
405
 
2.5%
Other values (189) 8526
52.7%
ASCII
ValueCountFrequency (%)
) 1657
42.7%
( 1655
42.6%
478
 
12.3%
t 13
 
0.3%
e 8
 
0.2%
o 8
 
0.2%
l 7
 
0.2%
L 6
 
0.2%
d 6
 
0.2%
C 6
 
0.2%
Other values (18) 38
 
1.0%
Distinct235
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-13T07:31:01.069238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.0958015
Min length5

Characters and Unicode

Total characters15971
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)3.7%

Sample

1st row112648
2nd row111676
3rd row111676
4th row350067
5th row350052
ValueCountFrequency (%)
350052 393
 
13.7%
111676 253
 
8.8%
데이터 252
 
8.8%
미집계 252
 
8.8%
112648 170
 
5.9%
350004 166
 
5.8%
350067 120
 
4.2%
350002 87
 
3.0%
111787 74
 
2.6%
350091 51
 
1.8%
Other values (226) 1054
36.7%
2023-12-13T07:31:01.609939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3016
18.9%
1 2976
18.6%
5 1678
10.5%
3 1586
9.9%
2 1476
9.2%
6 1037
 
6.5%
4 856
 
5.4%
7 773
 
4.8%
8 540
 
3.4%
9 269
 
1.7%
Other values (7) 1764
11.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14207
89.0%
Other Letter 1512
 
9.5%
Space Separator 252
 
1.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3016
21.2%
1 2976
20.9%
5 1678
11.8%
3 1586
11.2%
2 1476
10.4%
6 1037
 
7.3%
4 856
 
6.0%
7 773
 
5.4%
8 540
 
3.8%
9 269
 
1.9%
Other Letter
ValueCountFrequency (%)
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
Space Separator
ValueCountFrequency (%)
252
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14459
90.5%
Hangul 1512
 
9.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3016
20.9%
1 2976
20.6%
5 1678
11.6%
3 1586
11.0%
2 1476
10.2%
6 1037
 
7.2%
4 856
 
5.9%
7 773
 
5.3%
8 540
 
3.7%
9 269
 
1.9%
Hangul
ValueCountFrequency (%)
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14459
90.5%
Hangul 1512
 
9.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3016
20.9%
1 2976
20.6%
5 1678
11.6%
3 1586
11.0%
2 1476
10.2%
6 1037
 
7.2%
4 856
 
5.9%
7 773
 
5.3%
8 540
 
3.7%
9 269
 
1.9%
Hangul
ValueCountFrequency (%)
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%
252
16.7%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
양주시 정보통신과
2620 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시 정보통신과
2nd row양주시 정보통신과
3rd row양주시 정보통신과
4th row양주시 정보통신과
5th row양주시 정보통신과

Common Values

ValueCountFrequency (%)
양주시 정보통신과 2620
100.0%

Length

2023-12-13T07:31:01.763984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:01.861956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 2620
50.0%
정보통신과 2620
50.0%

관리기관 전화번호
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
031-8082-5373
1672 
031-8082-5372
948 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row031-8082-5372
2nd row031-8082-5372
3rd row031-8082-5372
4th row031-8082-5372
5th row031-8082-5372

Common Values

ValueCountFrequency (%)
031-8082-5373 1672
63.8%
031-8082-5372 948
36.2%

Length

2023-12-13T07:31:01.950829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:02.054698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
031-8082-5373 1672
63.8%
031-8082-5372 948
36.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-07-21
2620 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-21
2nd row2023-07-21
3rd row2023-07-21
4th row2023-07-21
5th row2023-07-21

Common Values

ValueCountFrequency (%)
2023-07-21 2620
100.0%

Length

2023-12-13T07:31:02.187348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:02.282915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-21 2620
100.0%

Interactions

2023-12-13T07:30:55.524244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:31:02.344963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번공사의 종류수수료재검사여부판정관리기관 전화번호
연번1.0000.5790.6480.9610.9610.996
공사의 종류0.5791.0000.7421.0001.0000.568
수수료0.6480.7421.0001.0001.0000.708
재검사여부0.9611.0001.0001.0001.0000.627
판정0.9611.0001.0001.0001.0000.627
관리기관 전화번호0.9960.5680.7080.6270.6271.000
2023-12-13T07:31:02.469686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사의 종류판정관리기관 전화번호수수료재검사여부
공사의 종류1.0000.9980.4430.5010.998
판정0.9981.0000.4320.9990.998
관리기관 전화번호0.4430.4321.0000.5210.432
수수료0.5010.9990.5211.0000.999
재검사여부0.9980.9980.4320.9991.000
2023-12-13T07:31:02.582888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번공사의 종류수수료재검사여부판정관리기관 전화번호
연번1.0000.2870.4120.8290.8290.946
공사의 종류0.2871.0000.5010.9980.9980.443
수수료0.4120.5011.0000.9990.9990.521
재검사여부0.8290.9980.9991.0000.9980.432
판정0.8290.9980.9990.9981.0000.432
관리기관 전화번호0.9460.4430.5210.4320.4321.000

Missing values

2023-12-13T07:30:55.678502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:30:55.868270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번필증관리번호처리일자현장주소공사의 종류교부연월일수수료재검사여부판정시공자상호명시공자등록번호관리기관명관리기관 전화번호데이터기준일자
01양주시-2016-1492018-01-02경기도 양주시 백석읍 방성리 336-3, 외 1필지구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합에스엠컴112648양주시 정보통신과031-8082-53722023-07-21
12양주시-2017-3632018-01-02경기도 양주시 봉양동 363-3구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합(주)레이텍111676양주시 정보통신과031-8082-53722023-07-21
23양주시-2017-4892018-01-02경기도 양주시 칠봉산로 251-15 (봉양동)구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합(주)레이텍111676양주시 정보통신과031-8082-53722023-07-21
34양주시-2017-4922018-01-02경기도 양주시 광적면 광적로 368-68구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합(주)하나정보기술350067양주시 정보통신과031-8082-53722023-07-21
45양주시-2017-4932018-01-02경기도 양주시 부흥로 2155 (삼숭동)구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53722023-07-21
56양주시-2017-4942018-01-02경기도 양주시 평화로 1285 (산북동)구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53722023-07-21
67양주시-2017-4952018-01-02경기도 양주시 백석읍 호명로 96, 복지1리 마을회관구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합금성통신건설(주)112394양주시 정보통신과031-8082-53722023-07-21
78양주시-2017-4962018-01-02경기도 양주시 광적면 삼일로65번길 157구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-02<NA>신규적합(주)레이텍111676양주시 정보통신과031-8082-53722023-07-21
89양주시-2018-0012018-01-03경기도 양주시 삼숭로 56-19 (삼숭동)구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-03<NA>신규적합에스엠컴112648양주시 정보통신과031-8082-53722023-07-21
910양주시-2018-0022018-01-03경기도 양주시 장흥면 호국로441번길 46-29, 주식회사 어울림구내통신선로설비,방송공동수신설비(종합유선방송)2018-01-03<NA>신규적합중앙데이타통신111591양주시 정보통신과031-8082-53722023-07-21
연번필증관리번호처리일자현장주소공사의 종류교부연월일수수료재검사여부판정시공자상호명시공자등록번호관리기관명관리기관 전화번호데이터기준일자
261026112023-002202023-07-12경기도 양주시 삼숭동 149-1구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1220000신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53732023-07-21
261126122023-002222023-07-14경기도 양주시 광적면 광석리 370-6구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1430000신규적합(주)거성350002양주시 정보통신과031-8082-53732023-07-21
261226132023-002272023-07-19경기도 양주시 은현면 하패리 128-8 외 2필지구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1930000신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53732023-07-21
261326142023-002232023-07-19경기도 양주시 봉양동 564-26 외 1필지구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1940000신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53732023-07-21
261426152023-002252023-07-19경기도 양주시 은현면 운암리 195-8구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1920000신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53732023-07-21
261526162023-002262023-07-19경기도 양주시 은현면 운암리 195-8구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1920000신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53732023-07-21
261626172023-002242023-07-19경기도 양주시 은현면 운암리 195-7구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1920000신규적합(주)동하통신기술350052양주시 정보통신과031-8082-53732023-07-21
261726182023-002282023-07-19경기도 양주시 백석읍 방성리 16-22구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-1930000신규적합(주)광명텔레콤111787양주시 정보통신과031-8082-53732023-07-21
261826192023-002292023-07-21경기도 양주시 만송동 477-15구내통신선로설비,방송공동수신설비(종합유선방송)2023-07-2120000신규적합(주)광명텔레콤111787양주시 정보통신과031-8082-53732023-07-21
261926202023-002302023-07-21경기도 양주시 삼숭동 280-9 외 1필지구내통신선로설비,방송공동수신설비(종합유선방송),이동통신구내선로설비2023-07-2140000신규적합동호이앤씨 주식회사311831양주시 정보통신과031-8082-53732023-07-21