Amino acid dipepetide frequency for Pseudomonas phage vB_PaeP_MAG4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.399AlaAla: 9.399 ± 1.076
1.098AlaCys: 1.098 ± 0.297
4.919AlaAsp: 4.919 ± 0.56
7.159AlaGlu: 7.159 ± 0.651
2.987AlaPhe: 2.987 ± 0.347
6.149AlaGly: 6.149 ± 0.616
1.186AlaHis: 1.186 ± 0.213
3.645AlaIle: 3.645 ± 0.564
6.5AlaLys: 6.5 ± 0.629
8.213AlaLeu: 8.213 ± 0.894
2.503AlaMet: 2.503 ± 0.246
4.085AlaAsn: 4.085 ± 0.401
2.459AlaPro: 2.459 ± 0.442
4.26AlaGln: 4.26 ± 0.648
4.963AlaArg: 4.963 ± 0.516
5.226AlaSer: 5.226 ± 0.554
5.402AlaThr: 5.402 ± 0.616
7.993AlaVal: 7.993 ± 0.59
1.054AlaTrp: 1.054 ± 0.274
2.767AlaTyr: 2.767 ± 0.286
0.0AlaXaa: 0.0 ± 0.0
Cys
0.527CysAla: 0.527 ± 0.153
0.264CysCys: 0.264 ± 0.118
0.527CysAsp: 0.527 ± 0.195
0.615CysGlu: 0.615 ± 0.206
0.395CysPhe: 0.395 ± 0.169
0.791CysGly: 0.791 ± 0.229
0.307CysHis: 0.307 ± 0.113
0.527CysIle: 0.527 ± 0.145
0.527CysLys: 0.527 ± 0.158
0.791CysLeu: 0.791 ± 0.211
0.176CysMet: 0.176 ± 0.079
0.527CysAsn: 0.527 ± 0.23
0.351CysPro: 0.351 ± 0.11
0.395CysGln: 0.395 ± 0.135
0.615CysArg: 0.615 ± 0.163
0.483CysSer: 0.483 ± 0.147
0.395CysThr: 0.395 ± 0.122
0.659CysVal: 0.659 ± 0.181
0.176CysTrp: 0.176 ± 0.114
0.307CysTyr: 0.307 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
6.105AspAla: 6.105 ± 0.53
0.483AspCys: 0.483 ± 0.163
3.25AspAsp: 3.25 ± 0.422
4.304AspGlu: 4.304 ± 0.407
2.416AspPhe: 2.416 ± 0.403
4.216AspGly: 4.216 ± 0.373
1.01AspHis: 1.01 ± 0.269
3.382AspIle: 3.382 ± 0.306
2.811AspLys: 2.811 ± 0.385
5.578AspLeu: 5.578 ± 0.446
1.713AspMet: 1.713 ± 0.287
2.416AspAsn: 2.416 ± 0.275
3.118AspPro: 3.118 ± 0.451
2.591AspGln: 2.591 ± 0.36
3.074AspArg: 3.074 ± 0.475
3.909AspSer: 3.909 ± 0.384
3.118AspThr: 3.118 ± 0.326
3.382AspVal: 3.382 ± 0.384
0.966AspTrp: 0.966 ± 0.2
2.02AspTyr: 2.02 ± 0.329
0.0AspXaa: 0.0 ± 0.0
Glu
7.335GluAla: 7.335 ± 0.92
0.703GluCys: 0.703 ± 0.261
3.953GluAsp: 3.953 ± 0.331
5.402GluGlu: 5.402 ± 0.768
2.284GluPhe: 2.284 ± 0.323
4.348GluGly: 4.348 ± 0.551
1.669GluHis: 1.669 ± 0.263
3.645GluIle: 3.645 ± 0.458
3.689GluLys: 3.689 ± 0.376
6.324GluLeu: 6.324 ± 0.483
2.591GluMet: 2.591 ± 0.368
2.416GluAsn: 2.416 ± 0.263
2.459GluPro: 2.459 ± 0.445
3.733GluGln: 3.733 ± 0.494
3.426GluArg: 3.426 ± 0.472
3.689GluSer: 3.689 ± 0.468
2.811GluThr: 2.811 ± 0.313
5.27GluVal: 5.27 ± 0.466
1.01GluTrp: 1.01 ± 0.239
2.416GluTyr: 2.416 ± 0.329
0.0GluXaa: 0.0 ± 0.0
Phe
2.459PheAla: 2.459 ± 0.309
0.351PheCys: 0.351 ± 0.119
2.723PheAsp: 2.723 ± 0.348
1.625PheGlu: 1.625 ± 0.396
2.064PhePhe: 2.064 ± 0.304
2.503PheGly: 2.503 ± 0.31
0.747PheHis: 0.747 ± 0.213
2.591PheIle: 2.591 ± 0.332
2.284PheLys: 2.284 ± 0.296
3.25PheLeu: 3.25 ± 0.387
1.054PheMet: 1.054 ± 0.213
2.24PheAsn: 2.24 ± 0.27
1.098PhePro: 1.098 ± 0.303
1.932PheGln: 1.932 ± 0.293
2.503PheArg: 2.503 ± 0.358
2.723PheSer: 2.723 ± 0.372
2.328PheThr: 2.328 ± 0.375
2.416PheVal: 2.416 ± 0.275
0.307PheTrp: 0.307 ± 0.139
1.405PheTyr: 1.405 ± 0.261
0.0PheXaa: 0.0 ± 0.0
Gly
5.71GlyAla: 5.71 ± 0.465
0.571GlyCys: 0.571 ± 0.174
3.294GlyAsp: 3.294 ± 0.419
4.919GlyGlu: 4.919 ± 0.551
3.47GlyPhe: 3.47 ± 0.421
4.128GlyGly: 4.128 ± 0.51
1.054GlyHis: 1.054 ± 0.263
3.645GlyIle: 3.645 ± 0.41
4.041GlyLys: 4.041 ± 0.495
6.017GlyLeu: 6.017 ± 0.588
2.24GlyMet: 2.24 ± 0.27
3.03GlyAsn: 3.03 ± 0.359
1.932GlyPro: 1.932 ± 0.268
3.206GlyGln: 3.206 ± 0.419
3.074GlyArg: 3.074 ± 0.377
4.128GlySer: 4.128 ± 0.347
3.865GlyThr: 3.865 ± 0.484
5.622GlyVal: 5.622 ± 0.509
1.142GlyTrp: 1.142 ± 0.213
2.196GlyTyr: 2.196 ± 0.323
0.0GlyXaa: 0.0 ± 0.0
His
1.669HisAla: 1.669 ± 0.26
0.088HisCys: 0.088 ± 0.065
1.01HisAsp: 1.01 ± 0.24
1.186HisGlu: 1.186 ± 0.201
1.142HisPhe: 1.142 ± 0.233
1.23HisGly: 1.23 ± 0.248
0.571HisHis: 0.571 ± 0.175
1.098HisIle: 1.098 ± 0.234
0.834HisLys: 0.834 ± 0.2
1.713HisLeu: 1.713 ± 0.337
0.571HisMet: 0.571 ± 0.143
0.966HisAsn: 0.966 ± 0.211
0.747HisPro: 0.747 ± 0.225
0.571HisGln: 0.571 ± 0.18
1.23HisArg: 1.23 ± 0.265
1.23HisSer: 1.23 ± 0.319
0.703HisThr: 0.703 ± 0.167
1.493HisVal: 1.493 ± 0.264
0.439HisTrp: 0.439 ± 0.183
0.747HisTyr: 0.747 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
5.095IleAla: 5.095 ± 0.503
0.483IleCys: 0.483 ± 0.149
2.943IleAsp: 2.943 ± 0.456
3.162IleGlu: 3.162 ± 0.33
1.713IlePhe: 1.713 ± 0.281
3.689IleGly: 3.689 ± 0.4
1.405IleHis: 1.405 ± 0.3
1.889IleIle: 1.889 ± 0.394
3.206IleLys: 3.206 ± 0.364
3.514IleLeu: 3.514 ± 0.425
1.054IleMet: 1.054 ± 0.29
2.767IleAsn: 2.767 ± 0.383
2.196IlePro: 2.196 ± 0.421
2.635IleGln: 2.635 ± 0.367
3.689IleArg: 3.689 ± 0.378
2.284IleSer: 2.284 ± 0.318
3.118IleThr: 3.118 ± 0.373
3.426IleVal: 3.426 ± 0.307
0.307IleTrp: 0.307 ± 0.126
1.318IleTyr: 1.318 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
6.061LysAla: 6.061 ± 0.641
0.527LysCys: 0.527 ± 0.184
4.172LysAsp: 4.172 ± 0.505
3.733LysGlu: 3.733 ± 0.588
1.669LysPhe: 1.669 ± 0.223
3.382LysGly: 3.382 ± 0.424
1.142LysHis: 1.142 ± 0.228
2.503LysIle: 2.503 ± 0.332
3.865LysLys: 3.865 ± 0.634
5.973LysLeu: 5.973 ± 0.559
1.669LysMet: 1.669 ± 0.27
1.932LysAsn: 1.932 ± 0.28
2.591LysPro: 2.591 ± 0.324
2.196LysGln: 2.196 ± 0.357
3.074LysArg: 3.074 ± 0.446
3.601LysSer: 3.601 ± 0.383
3.294LysThr: 3.294 ± 0.314
4.612LysVal: 4.612 ± 0.52
0.615LysTrp: 0.615 ± 0.136
2.24LysTyr: 2.24 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
7.466LeuAla: 7.466 ± 0.571
1.186LeuCys: 1.186 ± 0.29
6.412LeuAsp: 6.412 ± 0.594
5.797LeuGlu: 5.797 ± 0.544
3.382LeuPhe: 3.382 ± 0.465
5.095LeuGly: 5.095 ± 0.567
1.932LeuHis: 1.932 ± 0.292
4.348LeuIle: 4.348 ± 0.535
5.929LeuLys: 5.929 ± 0.565
7.949LeuLeu: 7.949 ± 0.85
2.987LeuMet: 2.987 ± 0.329
4.304LeuAsn: 4.304 ± 0.44
4.26LeuPro: 4.26 ± 0.459
3.25LeuGln: 3.25 ± 0.439
4.655LeuArg: 4.655 ± 0.558
5.051LeuSer: 5.051 ± 0.491
5.446LeuThr: 5.446 ± 0.521
6.544LeuVal: 6.544 ± 0.65
0.966LeuTrp: 0.966 ± 0.241
2.635LeuTyr: 2.635 ± 0.326
0.0LeuXaa: 0.0 ± 0.0
Met
3.03MetAla: 3.03 ± 0.351
0.22MetCys: 0.22 ± 0.103
2.24MetAsp: 2.24 ± 0.271
1.845MetGlu: 1.845 ± 0.22
1.274MetPhe: 1.274 ± 0.225
1.889MetGly: 1.889 ± 0.274
0.659MetHis: 0.659 ± 0.215
1.405MetIle: 1.405 ± 0.275
1.054MetLys: 1.054 ± 0.226
2.943MetLeu: 2.943 ± 0.305
0.966MetMet: 0.966 ± 0.245
1.186MetAsn: 1.186 ± 0.224
1.757MetPro: 1.757 ± 0.337
1.493MetGln: 1.493 ± 0.246
1.625MetArg: 1.625 ± 0.268
2.416MetSer: 2.416 ± 0.377
2.284MetThr: 2.284 ± 0.295
1.493MetVal: 1.493 ± 0.349
0.264MetTrp: 0.264 ± 0.123
0.703MetTyr: 0.703 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
4.172AsnAla: 4.172 ± 0.49
0.351AsnCys: 0.351 ± 0.147
2.108AsnAsp: 2.108 ± 0.376
3.162AsnGlu: 3.162 ± 0.376
1.581AsnPhe: 1.581 ± 0.349
3.821AsnGly: 3.821 ± 0.522
0.966AsnHis: 0.966 ± 0.252
2.372AsnIle: 2.372 ± 0.289
2.679AsnLys: 2.679 ± 0.327
4.48AsnLeu: 4.48 ± 0.429
1.801AsnMet: 1.801 ± 0.286
2.064AsnAsn: 2.064 ± 0.416
2.591AsnPro: 2.591 ± 0.31
1.493AsnGln: 1.493 ± 0.274
2.372AsnArg: 2.372 ± 0.237
2.284AsnSer: 2.284 ± 0.295
3.03AsnThr: 3.03 ± 0.35
2.328AsnVal: 2.328 ± 0.318
0.615AsnTrp: 0.615 ± 0.177
1.186AsnTyr: 1.186 ± 0.22
0.0AsnXaa: 0.0 ± 0.0
Pro
3.47ProAla: 3.47 ± 0.461
0.264ProCys: 0.264 ± 0.105
2.24ProAsp: 2.24 ± 0.428
3.03ProGlu: 3.03 ± 0.393
1.976ProPhe: 1.976 ± 0.248
3.294ProGly: 3.294 ± 0.374
0.615ProHis: 0.615 ± 0.235
1.976ProIle: 1.976 ± 0.255
1.757ProLys: 1.757 ± 0.321
3.118ProLeu: 3.118 ± 0.318
1.23ProMet: 1.23 ± 0.264
1.889ProAsn: 1.889 ± 0.247
1.054ProPro: 1.054 ± 0.29
1.976ProGln: 1.976 ± 0.324
1.449ProArg: 1.449 ± 0.287
2.284ProSer: 2.284 ± 0.254
2.855ProThr: 2.855 ± 0.488
3.689ProVal: 3.689 ± 0.327
0.395ProTrp: 0.395 ± 0.132
1.318ProTyr: 1.318 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
4.787GlnAla: 4.787 ± 0.637
0.176GlnCys: 0.176 ± 0.098
2.503GlnAsp: 2.503 ± 0.404
3.865GlnGlu: 3.865 ± 0.459
1.625GlnPhe: 1.625 ± 0.248
2.284GlnGly: 2.284 ± 0.36
0.571GlnHis: 0.571 ± 0.143
2.328GlnIle: 2.328 ± 0.337
2.459GlnLys: 2.459 ± 0.331
4.392GlnLeu: 4.392 ± 0.502
1.449GlnMet: 1.449 ± 0.292
1.493GlnAsn: 1.493 ± 0.295
1.318GlnPro: 1.318 ± 0.219
2.064GlnGln: 2.064 ± 0.401
2.372GlnArg: 2.372 ± 0.408
1.932GlnSer: 1.932 ± 0.257
2.459GlnThr: 2.459 ± 0.416
4.041GlnVal: 4.041 ± 0.617
0.395GlnTrp: 0.395 ± 0.135
1.362GlnTyr: 1.362 ± 0.201
0.0GlnXaa: 0.0 ± 0.0
Arg
5.27ArgAla: 5.27 ± 0.578
0.527ArgCys: 0.527 ± 0.16
3.426ArgAsp: 3.426 ± 0.359
3.733ArgGlu: 3.733 ± 0.614
2.196ArgPhe: 2.196 ± 0.32
3.426ArgGly: 3.426 ± 0.43
1.01ArgHis: 1.01 ± 0.248
2.679ArgIle: 2.679 ± 0.333
3.03ArgLys: 3.03 ± 0.408
5.402ArgLeu: 5.402 ± 0.507
1.362ArgMet: 1.362 ± 0.166
2.591ArgAsn: 2.591 ± 0.311
1.976ArgPro: 1.976 ± 0.322
2.767ArgGln: 2.767 ± 0.417
3.118ArgArg: 3.118 ± 0.394
3.382ArgSer: 3.382 ± 0.379
2.503ArgThr: 2.503 ± 0.264
3.865ArgVal: 3.865 ± 0.429
0.747ArgTrp: 0.747 ± 0.212
1.449ArgTyr: 1.449 ± 0.224
0.0ArgXaa: 0.0 ± 0.0
Ser
4.436SerAla: 4.436 ± 0.46
0.351SerCys: 0.351 ± 0.13
3.733SerAsp: 3.733 ± 0.425
3.821SerGlu: 3.821 ± 0.38
1.976SerPhe: 1.976 ± 0.326
4.787SerGly: 4.787 ± 0.399
1.01SerHis: 1.01 ± 0.221
2.503SerIle: 2.503 ± 0.276
3.909SerLys: 3.909 ± 0.456
4.699SerLeu: 4.699 ± 0.49
2.064SerMet: 2.064 ± 0.27
2.767SerAsn: 2.767 ± 0.347
2.372SerPro: 2.372 ± 0.264
2.24SerGln: 2.24 ± 0.294
3.074SerArg: 3.074 ± 0.27
2.811SerSer: 2.811 ± 0.423
3.645SerThr: 3.645 ± 0.362
2.987SerVal: 2.987 ± 0.375
0.791SerTrp: 0.791 ± 0.189
2.152SerTyr: 2.152 ± 0.225
0.0SerXaa: 0.0 ± 0.0
Thr
5.051ThrAla: 5.051 ± 0.526
0.351ThrCys: 0.351 ± 0.13
2.943ThrAsp: 2.943 ± 0.299
4.392ThrGlu: 4.392 ± 0.415
2.02ThrPhe: 2.02 ± 0.332
4.172ThrGly: 4.172 ± 0.434
1.318ThrHis: 1.318 ± 0.261
3.47ThrIle: 3.47 ± 0.392
3.645ThrLys: 3.645 ± 0.417
4.392ThrLeu: 4.392 ± 0.446
1.801ThrMet: 1.801 ± 0.286
3.338ThrAsn: 3.338 ± 0.435
2.767ThrPro: 2.767 ± 0.307
2.24ThrGln: 2.24 ± 0.287
3.206ThrArg: 3.206 ± 0.433
2.547ThrSer: 2.547 ± 0.353
3.865ThrThr: 3.865 ± 0.456
4.436ThrVal: 4.436 ± 0.452
1.186ThrTrp: 1.186 ± 0.206
1.537ThrTyr: 1.537 ± 0.44
0.0ThrXaa: 0.0 ± 0.0
Val
6.456ValAla: 6.456 ± 0.825
0.834ValCys: 0.834 ± 0.219
4.568ValAsp: 4.568 ± 0.435
4.655ValGlu: 4.655 ± 0.423
2.416ValPhe: 2.416 ± 0.417
5.051ValGly: 5.051 ± 0.499
1.274ValHis: 1.274 ± 0.254
3.601ValIle: 3.601 ± 0.467
4.216ValLys: 4.216 ± 0.375
6.676ValLeu: 6.676 ± 0.622
2.284ValMet: 2.284 ± 0.367
3.557ValAsn: 3.557 ± 0.301
3.074ValPro: 3.074 ± 0.502
3.25ValGln: 3.25 ± 0.366
4.26ValArg: 4.26 ± 0.477
3.821ValSer: 3.821 ± 0.501
4.392ValThr: 4.392 ± 0.452
4.48ValVal: 4.48 ± 0.587
0.966ValTrp: 0.966 ± 0.184
2.24ValTyr: 2.24 ± 0.355
0.0ValXaa: 0.0 ± 0.0
Trp
1.054TrpAla: 1.054 ± 0.261
0.22TrpCys: 0.22 ± 0.107
0.615TrpAsp: 0.615 ± 0.165
0.878TrpGlu: 0.878 ± 0.182
0.571TrpPhe: 0.571 ± 0.159
0.791TrpGly: 0.791 ± 0.208
0.176TrpHis: 0.176 ± 0.101
0.615TrpIle: 0.615 ± 0.17
0.659TrpLys: 0.659 ± 0.168
1.186TrpLeu: 1.186 ± 0.298
0.395TrpMet: 0.395 ± 0.139
0.395TrpAsn: 0.395 ± 0.149
0.527TrpPro: 0.527 ± 0.151
0.571TrpGln: 0.571 ± 0.2
0.703TrpArg: 0.703 ± 0.149
0.791TrpSer: 0.791 ± 0.187
0.834TrpThr: 0.834 ± 0.187
1.362TrpVal: 1.362 ± 0.29
0.307TrpTrp: 0.307 ± 0.137
0.527TrpTyr: 0.527 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.319
0.307TyrCys: 0.307 ± 0.119
2.284TyrAsp: 2.284 ± 0.291
1.889TyrGlu: 1.889 ± 0.25
1.449TyrPhe: 1.449 ± 0.211
2.24TyrGly: 2.24 ± 0.398
0.615TyrHis: 0.615 ± 0.195
1.669TyrIle: 1.669 ± 0.402
1.757TyrLys: 1.757 ± 0.284
3.03TyrLeu: 3.03 ± 0.387
0.791TyrMet: 0.791 ± 0.206
1.493TyrAsn: 1.493 ± 0.242
1.23TyrPro: 1.23 ± 0.258
1.01TyrGln: 1.01 ± 0.225
1.976TyrArg: 1.976 ± 0.294
1.537TyrSer: 1.537 ± 0.18
2.328TyrThr: 2.328 ± 0.337
1.976TyrVal: 1.976 ± 0.297
0.527TyrTrp: 0.527 ± 0.165
1.186TyrTyr: 1.186 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (22770 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski