Amino acid dipepetide frequency for Nodularia phage vB_NspS-kac68v161

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.845AlaAla: 5.845 ± 0.673
0.525AlaCys: 0.525 ± 0.111
4.11AlaAsp: 4.11 ± 0.278
5.343AlaGlu: 5.343 ± 0.522
2.306AlaPhe: 2.306 ± 0.208
4.612AlaGly: 4.612 ± 0.439
0.799AlaHis: 0.799 ± 0.154
5.617AlaIle: 5.617 ± 0.364
5.845AlaLys: 5.845 ± 0.538
6.485AlaLeu: 6.485 ± 0.434
1.644AlaMet: 1.644 ± 0.181
3.539AlaAsn: 3.539 ± 0.281
2.512AlaPro: 2.512 ± 0.311
3.197AlaGln: 3.197 ± 0.427
3.105AlaArg: 3.105 ± 0.274
4.749AlaSer: 4.749 ± 0.429
4.475AlaThr: 4.475 ± 0.421
4.727AlaVal: 4.727 ± 0.33
0.891AlaTrp: 0.891 ± 0.15
1.895AlaTyr: 1.895 ± 0.177
0.0AlaXaa: 0.0 ± 0.0
Cys
0.365CysAla: 0.365 ± 0.091
0.274CysCys: 0.274 ± 0.082
0.891CysAsp: 0.891 ± 0.144
0.868CysGlu: 0.868 ± 0.163
0.685CysPhe: 0.685 ± 0.154
0.913CysGly: 0.913 ± 0.172
0.343CysHis: 0.343 ± 0.096
0.502CysIle: 0.502 ± 0.106
0.936CysLys: 0.936 ± 0.163
1.096CysLeu: 1.096 ± 0.177
0.274CysMet: 0.274 ± 0.088
0.502CysAsn: 0.502 ± 0.117
0.548CysPro: 0.548 ± 0.114
0.548CysGln: 0.548 ± 0.103
0.365CysArg: 0.365 ± 0.088
0.845CysSer: 0.845 ± 0.159
0.434CysThr: 0.434 ± 0.106
0.639CysVal: 0.639 ± 0.143
0.206CysTrp: 0.206 ± 0.072
0.571CysTyr: 0.571 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
3.836AspAla: 3.836 ± 0.327
0.662AspCys: 0.662 ± 0.138
3.425AspAsp: 3.425 ± 0.278
3.379AspGlu: 3.379 ± 0.274
2.854AspPhe: 2.854 ± 0.262
3.905AspGly: 3.905 ± 0.364
0.731AspHis: 0.731 ± 0.144
4.042AspIle: 4.042 ± 0.335
4.795AspLys: 4.795 ± 0.388
5.96AspLeu: 5.96 ± 0.39
0.936AspMet: 0.936 ± 0.176
2.809AspAsn: 2.809 ± 0.261
2.443AspPro: 2.443 ± 0.35
2.192AspGln: 2.192 ± 0.329
3.448AspArg: 3.448 ± 0.275
3.836AspSer: 3.836 ± 0.276
3.06AspThr: 3.06 ± 0.254
3.836AspVal: 3.836 ± 0.311
0.891AspTrp: 0.891 ± 0.153
2.717AspTyr: 2.717 ± 0.245
0.0AspXaa: 0.0 ± 0.0
Glu
5.434GluAla: 5.434 ± 0.537
0.776GluCys: 0.776 ± 0.137
3.471GluAsp: 3.471 ± 0.34
3.676GluGlu: 3.676 ± 0.435
2.923GluPhe: 2.923 ± 0.305
3.014GluGly: 3.014 ± 0.281
0.845GluHis: 0.845 ± 0.167
4.978GluIle: 4.978 ± 0.383
4.224GluLys: 4.224 ± 0.383
7.741GluLeu: 7.741 ± 0.385
1.507GluMet: 1.507 ± 0.228
2.991GluAsn: 2.991 ± 0.296
2.786GluPro: 2.786 ± 0.339
2.831GluGln: 2.831 ± 0.349
3.471GluArg: 3.471 ± 0.319
4.384GluSer: 4.384 ± 0.303
3.836GluThr: 3.836 ± 0.415
5.023GluVal: 5.023 ± 0.378
0.639GluTrp: 0.639 ± 0.129
2.9GluTyr: 2.9 ± 0.28
0.0GluXaa: 0.0 ± 0.0
Phe
1.941PheAla: 1.941 ± 0.211
0.457PheCys: 0.457 ± 0.107
2.763PheAsp: 2.763 ± 0.295
2.169PheGlu: 2.169 ± 0.232
1.279PhePhe: 1.279 ± 0.184
2.443PheGly: 2.443 ± 0.316
0.502PheHis: 0.502 ± 0.124
2.535PheIle: 2.535 ± 0.29
2.946PheLys: 2.946 ± 0.314
3.151PheLeu: 3.151 ± 0.292
0.891PheMet: 0.891 ± 0.123
2.489PheAsn: 2.489 ± 0.239
1.735PhePro: 1.735 ± 0.222
1.735PheGln: 1.735 ± 0.196
1.598PheArg: 1.598 ± 0.186
3.128PheSer: 3.128 ± 0.262
3.037PheThr: 3.037 ± 0.277
2.055PheVal: 2.055 ± 0.234
0.343PheTrp: 0.343 ± 0.094
1.484PheTyr: 1.484 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
3.768GlyAla: 3.768 ± 0.344
1.005GlyCys: 1.005 ± 0.168
4.042GlyAsp: 4.042 ± 0.304
4.179GlyGlu: 4.179 ± 0.361
2.923GlyPhe: 2.923 ± 0.292
3.973GlyGly: 3.973 ± 0.325
0.982GlyHis: 0.982 ± 0.167
3.996GlyIle: 3.996 ± 0.293
4.612GlyLys: 4.612 ± 0.395
5.777GlyLeu: 5.777 ± 0.367
1.279GlyMet: 1.279 ± 0.194
3.151GlyAsn: 3.151 ± 0.262
0.0GlyPro: 0.0 ± 0.0
2.261GlyGln: 2.261 ± 0.232
2.786GlyArg: 2.786 ± 0.293
4.681GlySer: 4.681 ± 0.322
3.653GlyThr: 3.653 ± 0.388
3.996GlyVal: 3.996 ± 0.334
1.05GlyTrp: 1.05 ± 0.17
2.717GlyTyr: 2.717 ± 0.269
0.0GlyXaa: 0.0 ± 0.0
His
0.776HisAla: 0.776 ± 0.137
0.365HisCys: 0.365 ± 0.089
0.662HisAsp: 0.662 ± 0.133
0.845HisGlu: 0.845 ± 0.144
0.685HisPhe: 0.685 ± 0.131
0.639HisGly: 0.639 ± 0.119
0.343HisHis: 0.343 ± 0.098
0.799HisIle: 0.799 ± 0.169
1.302HisLys: 1.302 ± 0.209
1.598HisLeu: 1.598 ± 0.221
0.091HisMet: 0.091 ± 0.041
0.776HisAsn: 0.776 ± 0.115
0.868HisPro: 0.868 ± 0.18
0.868HisGln: 0.868 ± 0.156
1.005HisArg: 1.005 ± 0.163
0.891HisSer: 0.891 ± 0.189
0.822HisThr: 0.822 ± 0.146
0.685HisVal: 0.685 ± 0.138
0.16HisTrp: 0.16 ± 0.073
0.913HisTyr: 0.913 ± 0.124
0.0HisXaa: 0.0 ± 0.0
Ile
6.462IleAla: 6.462 ± 0.365
0.457IleCys: 0.457 ± 0.103
4.224IleAsp: 4.224 ± 0.313
4.293IleGlu: 4.293 ± 0.312
2.009IlePhe: 2.009 ± 0.241
3.539IleGly: 3.539 ± 0.323
0.868IleHis: 0.868 ± 0.172
2.786IleIle: 2.786 ± 0.276
5.16IleLys: 5.16 ± 0.345
4.361IleLeu: 4.361 ± 0.371
0.845IleMet: 0.845 ± 0.148
3.448IleAsn: 3.448 ± 0.313
2.557IlePro: 2.557 ± 0.284
2.557IleGln: 2.557 ± 0.313
2.9IleArg: 2.9 ± 0.262
4.727IleSer: 4.727 ± 0.445
4.247IleThr: 4.247 ± 0.433
3.448IleVal: 3.448 ± 0.281
0.502IleTrp: 0.502 ± 0.116
2.283IleTyr: 2.283 ± 0.253
0.0IleXaa: 0.0 ± 0.0
Lys
5.32LysAla: 5.32 ± 0.385
0.776LysCys: 0.776 ± 0.139
4.042LysAsp: 4.042 ± 0.473
5.001LysGlu: 5.001 ± 0.36
2.809LysPhe: 2.809 ± 0.276
4.019LysGly: 4.019 ± 0.347
1.119LysHis: 1.119 ± 0.187
4.156LysIle: 4.156 ± 0.292
5.389LysLys: 5.389 ± 0.547
8.06LysLeu: 8.06 ± 0.583
1.279LysMet: 1.279 ± 0.196
2.991LysAsn: 2.991 ± 0.278
3.973LysPro: 3.973 ± 0.327
3.905LysGln: 3.905 ± 0.422
3.905LysArg: 3.905 ± 0.395
5.549LysSer: 5.549 ± 0.344
4.338LysThr: 4.338 ± 0.456
5.297LysVal: 5.297 ± 0.394
0.685LysTrp: 0.685 ± 0.116
2.238LysTyr: 2.238 ± 0.312
0.0LysXaa: 0.0 ± 0.0
Leu
7.786LeuAla: 7.786 ± 0.576
0.822LeuCys: 0.822 ± 0.162
5.891LeuAsp: 5.891 ± 0.385
6.667LeuGlu: 6.667 ± 0.38
2.991LeuPhe: 2.991 ± 0.215
5.686LeuGly: 5.686 ± 0.418
1.461LeuHis: 1.461 ± 0.218
5.32LeuIle: 5.32 ± 0.362
6.439LeuLys: 6.439 ± 0.477
8.106LeuLeu: 8.106 ± 0.609
1.233LeuMet: 1.233 ± 0.178
5.229LeuAsn: 5.229 ± 0.461
4.475LeuPro: 4.475 ± 0.313
4.544LeuGln: 4.544 ± 0.452
4.11LeuArg: 4.11 ± 0.373
6.576LeuSer: 6.576 ± 0.361
6.256LeuThr: 6.256 ± 0.401
5.434LeuVal: 5.434 ± 0.399
0.936LeuTrp: 0.936 ± 0.186
2.672LeuTyr: 2.672 ± 0.279
0.0LeuXaa: 0.0 ± 0.0
Met
1.21MetAla: 1.21 ± 0.171
0.502MetCys: 0.502 ± 0.137
0.822MetAsp: 0.822 ± 0.129
1.028MetGlu: 1.028 ± 0.148
0.639MetPhe: 0.639 ± 0.118
1.05MetGly: 1.05 ± 0.146
0.206MetHis: 0.206 ± 0.079
1.279MetIle: 1.279 ± 0.166
1.302MetLys: 1.302 ± 0.223
1.758MetLeu: 1.758 ± 0.199
0.32MetMet: 0.32 ± 0.091
0.754MetAsn: 0.754 ± 0.138
0.799MetPro: 0.799 ± 0.144
0.822MetGln: 0.822 ± 0.119
0.776MetArg: 0.776 ± 0.153
1.713MetSer: 1.713 ± 0.231
1.279MetThr: 1.279 ± 0.166
0.982MetVal: 0.982 ± 0.143
0.091MetTrp: 0.091 ± 0.041
0.685MetTyr: 0.685 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
3.448AsnAla: 3.448 ± 0.243
1.028AsnCys: 1.028 ± 0.19
2.375AsnAsp: 2.375 ± 0.238
2.831AsnGlu: 2.831 ± 0.246
2.192AsnPhe: 2.192 ± 0.248
3.471AsnGly: 3.471 ± 0.356
0.754AsnHis: 0.754 ± 0.168
3.037AsnIle: 3.037 ± 0.279
3.745AsnLys: 3.745 ± 0.287
4.704AsnLeu: 4.704 ± 0.377
0.868AsnMet: 0.868 ± 0.159
2.603AsnAsn: 2.603 ± 0.269
3.06AsnPro: 3.06 ± 0.252
2.466AsnGln: 2.466 ± 0.302
2.124AsnArg: 2.124 ± 0.212
3.288AsnSer: 3.288 ± 0.39
2.603AsnThr: 2.603 ± 0.277
2.512AsnVal: 2.512 ± 0.231
0.617AsnTrp: 0.617 ± 0.119
2.078AsnTyr: 2.078 ± 0.266
0.0AsnXaa: 0.0 ± 0.0
Pro
2.74ProAla: 2.74 ± 0.294
0.48ProCys: 0.48 ± 0.11
3.151ProAsp: 3.151 ± 0.313
3.836ProGlu: 3.836 ± 0.412
1.439ProPhe: 1.439 ± 0.204
2.557ProGly: 2.557 ± 0.279
0.822ProHis: 0.822 ± 0.155
1.804ProIle: 1.804 ± 0.2
3.379ProLys: 3.379 ± 0.3
2.786ProLeu: 2.786 ± 0.256
0.617ProMet: 0.617 ± 0.147
2.078ProAsn: 2.078 ± 0.219
1.553ProPro: 1.553 ± 0.27
1.964ProGln: 1.964 ± 0.201
1.233ProArg: 1.233 ± 0.147
2.946ProSer: 2.946 ± 0.277
3.265ProThr: 3.265 ± 0.336
2.877ProVal: 2.877 ± 0.267
0.32ProTrp: 0.32 ± 0.084
1.233ProTyr: 1.233 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
3.516GlnAla: 3.516 ± 0.417
0.365GlnCys: 0.365 ± 0.097
2.078GlnAsp: 2.078 ± 0.205
3.676GlnGlu: 3.676 ± 0.391
1.735GlnPhe: 1.735 ± 0.245
2.238GlnGly: 2.238 ± 0.256
0.708GlnHis: 0.708 ± 0.12
3.653GlnIle: 3.653 ± 0.32
2.603GlnLys: 2.603 ± 0.291
5.023GlnLeu: 5.023 ± 0.608
0.891GlnMet: 0.891 ± 0.16
1.69GlnAsn: 1.69 ± 0.202
2.375GlnPro: 2.375 ± 0.311
3.174GlnGln: 3.174 ± 0.482
1.895GlnArg: 1.895 ± 0.219
3.22GlnSer: 3.22 ± 0.437
2.512GlnThr: 2.512 ± 0.269
3.174GlnVal: 3.174 ± 0.285
0.388GlnTrp: 0.388 ± 0.1
1.119GlnTyr: 1.119 ± 0.148
0.0GlnXaa: 0.0 ± 0.0
Arg
2.694ArgAla: 2.694 ± 0.225
0.48ArgCys: 0.48 ± 0.095
2.763ArgAsp: 2.763 ± 0.248
3.676ArgGlu: 3.676 ± 0.307
1.987ArgPhe: 1.987 ± 0.216
2.535ArgGly: 2.535 ± 0.273
0.822ArgHis: 0.822 ± 0.167
2.489ArgIle: 2.489 ± 0.229
3.722ArgLys: 3.722 ± 0.335
5.092ArgLeu: 5.092 ± 0.435
1.005ArgMet: 1.005 ± 0.178
2.329ArgAsn: 2.329 ± 0.227
1.233ArgPro: 1.233 ± 0.167
2.192ArgGln: 2.192 ± 0.225
2.512ArgArg: 2.512 ± 0.237
3.745ArgSer: 3.745 ± 0.292
2.398ArgThr: 2.398 ± 0.213
3.539ArgVal: 3.539 ± 0.304
0.502ArgTrp: 0.502 ± 0.121
1.621ArgTyr: 1.621 ± 0.193
0.0ArgXaa: 0.0 ± 0.0
Ser
4.498SerAla: 4.498 ± 0.332
0.502SerCys: 0.502 ± 0.109
5.092SerAsp: 5.092 ± 0.477
4.567SerGlu: 4.567 ± 0.339
2.991SerPhe: 2.991 ± 0.288
5.48SerGly: 5.48 ± 0.437
1.187SerHis: 1.187 ± 0.178
3.95SerIle: 3.95 ± 0.329
5.526SerLys: 5.526 ± 0.453
6.69SerLeu: 6.69 ± 0.471
1.142SerMet: 1.142 ± 0.183
3.768SerAsn: 3.768 ± 0.315
3.014SerPro: 3.014 ± 0.24
3.288SerGln: 3.288 ± 0.463
3.562SerArg: 3.562 ± 0.323
4.864SerSer: 4.864 ± 0.51
4.316SerThr: 4.316 ± 0.351
4.293SerVal: 4.293 ± 0.34
0.571SerTrp: 0.571 ± 0.136
1.987SerTyr: 1.987 ± 0.255
0.0SerXaa: 0.0 ± 0.0
Thr
5.16ThrAla: 5.16 ± 0.388
0.685ThrCys: 0.685 ± 0.118
3.311ThrAsp: 3.311 ± 0.327
3.927ThrGlu: 3.927 ± 0.345
2.101ThrPhe: 2.101 ± 0.278
3.768ThrGly: 3.768 ± 0.298
1.028ThrHis: 1.028 ± 0.16
4.201ThrIle: 4.201 ± 0.29
4.727ThrLys: 4.727 ± 0.35
5.297ThrLeu: 5.297 ± 0.348
0.708ThrMet: 0.708 ± 0.137
2.968ThrAsn: 2.968 ± 0.267
3.334ThrPro: 3.334 ± 0.315
2.877ThrGln: 2.877 ± 0.299
2.786ThrArg: 2.786 ± 0.273
4.224ThrSer: 4.224 ± 0.375
4.407ThrThr: 4.407 ± 0.634
4.019ThrVal: 4.019 ± 0.367
0.48ThrTrp: 0.48 ± 0.1
2.169ThrTyr: 2.169 ± 0.225
0.0ThrXaa: 0.0 ± 0.0
Val
5.138ValAla: 5.138 ± 0.311
0.822ValCys: 0.822 ± 0.15
4.019ValAsp: 4.019 ± 0.281
5.046ValGlu: 5.046 ± 0.399
2.146ValPhe: 2.146 ± 0.243
3.745ValGly: 3.745 ± 0.382
0.754ValHis: 0.754 ± 0.165
3.996ValIle: 3.996 ± 0.519
4.475ValLys: 4.475 ± 0.412
4.59ValLeu: 4.59 ± 0.343
1.393ValMet: 1.393 ± 0.204
3.471ValAsn: 3.471 ± 0.319
2.238ValPro: 2.238 ± 0.245
2.512ValGln: 2.512 ± 0.244
3.197ValArg: 3.197 ± 0.292
4.59ValSer: 4.59 ± 0.33
4.475ValThr: 4.475 ± 0.382
3.927ValVal: 3.927 ± 0.3
0.708ValTrp: 0.708 ± 0.156
2.055ValTyr: 2.055 ± 0.228
0.0ValXaa: 0.0 ± 0.0
Trp
0.502TrpAla: 0.502 ± 0.129
0.251TrpCys: 0.251 ± 0.082
0.891TrpAsp: 0.891 ± 0.164
0.731TrpGlu: 0.731 ± 0.149
0.388TrpPhe: 0.388 ± 0.089
0.845TrpGly: 0.845 ± 0.136
0.16TrpHis: 0.16 ± 0.06
0.434TrpIle: 0.434 ± 0.092
0.868TrpLys: 0.868 ± 0.158
1.05TrpLeu: 1.05 ± 0.176
0.32TrpMet: 0.32 ± 0.1
0.457TrpAsn: 0.457 ± 0.1
0.046TrpPro: 0.046 ± 0.031
0.48TrpGln: 0.48 ± 0.122
0.708TrpArg: 0.708 ± 0.138
0.639TrpSer: 0.639 ± 0.138
0.365TrpThr: 0.365 ± 0.088
1.073TrpVal: 1.073 ± 0.166
0.251TrpTrp: 0.251 ± 0.082
0.411TrpTyr: 0.411 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.781TyrAla: 1.781 ± 0.188
0.571TyrCys: 0.571 ± 0.108
1.735TyrAsp: 1.735 ± 0.193
1.713TyrGlu: 1.713 ± 0.183
1.598TyrPhe: 1.598 ± 0.243
2.42TyrGly: 2.42 ± 0.312
0.662TyrHis: 0.662 ± 0.129
2.101TyrIle: 2.101 ± 0.296
2.854TyrLys: 2.854 ± 0.362
3.311TyrLeu: 3.311 ± 0.343
0.754TyrMet: 0.754 ± 0.136
1.758TyrAsn: 1.758 ± 0.191
1.53TyrPro: 1.53 ± 0.204
1.621TyrGln: 1.621 ± 0.215
1.85TyrArg: 1.85 ± 0.237
2.694TyrSer: 2.694 ± 0.268
2.398TyrThr: 2.398 ± 0.217
1.758TyrVal: 1.758 ± 0.254
0.639TyrTrp: 0.639 ± 0.123
1.37TyrTyr: 1.37 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 198 proteins (43796 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski