Amino acid dipepetide frequency for Lactococcus virus KSY1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.194AlaAla: 5.194 ± 0.936
0.528AlaCys: 0.528 ± 0.153
3.571AlaAsp: 3.571 ± 0.428
3.206AlaGlu: 3.206 ± 0.452
2.516AlaPhe: 2.516 ± 0.347
3.774AlaGly: 3.774 ± 0.587
0.974AlaHis: 0.974 ± 0.24
4.383AlaIle: 4.383 ± 0.398
5.397AlaLys: 5.397 ± 0.414
5.154AlaLeu: 5.154 ± 0.457
1.826AlaMet: 1.826 ± 0.301
4.626AlaAsn: 4.626 ± 0.599
1.542AlaPro: 1.542 ± 0.255
2.8AlaGln: 2.8 ± 0.475
2.354AlaArg: 2.354 ± 0.374
3.531AlaSer: 3.531 ± 0.418
3.896AlaThr: 3.896 ± 0.399
4.342AlaVal: 4.342 ± 0.414
0.933AlaTrp: 0.933 ± 0.168
3.774AlaTyr: 3.774 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.528CysAla: 0.528 ± 0.176
0.041CysCys: 0.041 ± 0.044
0.609CysAsp: 0.609 ± 0.135
0.568CysGlu: 0.568 ± 0.144
0.325CysPhe: 0.325 ± 0.106
0.487CysGly: 0.487 ± 0.127
0.122CysHis: 0.122 ± 0.071
0.609CysIle: 0.609 ± 0.181
0.69CysLys: 0.69 ± 0.173
0.325CysLeu: 0.325 ± 0.103
0.284CysMet: 0.284 ± 0.095
0.649CysAsn: 0.649 ± 0.148
0.203CysPro: 0.203 ± 0.086
0.203CysGln: 0.203 ± 0.091
0.284CysArg: 0.284 ± 0.108
0.568CysSer: 0.568 ± 0.132
0.243CysThr: 0.243 ± 0.101
0.609CysVal: 0.609 ± 0.138
0.081CysTrp: 0.081 ± 0.054
0.69CysTyr: 0.69 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
4.139AspAla: 4.139 ± 0.52
0.69AspCys: 0.69 ± 0.155
4.139AspAsp: 4.139 ± 0.385
4.707AspGlu: 4.707 ± 0.501
3.044AspPhe: 3.044 ± 0.281
3.449AspGly: 3.449 ± 0.361
0.933AspHis: 0.933 ± 0.191
5.235AspIle: 5.235 ± 0.433
5.154AspLys: 5.154 ± 0.421
5.073AspLeu: 5.073 ± 0.524
2.394AspMet: 2.394 ± 0.28
4.505AspAsn: 4.505 ± 0.361
1.988AspPro: 1.988 ± 0.241
1.542AspGln: 1.542 ± 0.222
2.557AspArg: 2.557 ± 0.402
5.56AspSer: 5.56 ± 0.515
3.693AspThr: 3.693 ± 0.368
3.693AspVal: 3.693 ± 0.378
1.136AspTrp: 1.136 ± 0.256
4.505AspTyr: 4.505 ± 0.428
0.0AspXaa: 0.0 ± 0.0
Glu
4.667GluAla: 4.667 ± 0.485
0.528GluCys: 0.528 ± 0.116
4.667GluAsp: 4.667 ± 0.522
5.6GluGlu: 5.6 ± 0.603
2.354GluPhe: 2.354 ± 0.365
3.612GluGly: 3.612 ± 0.456
1.136GluHis: 1.136 ± 0.225
3.855GluIle: 3.855 ± 0.359
3.612GluLys: 3.612 ± 0.431
6.331GluLeu: 6.331 ± 0.508
1.786GluMet: 1.786 ± 0.289
3.936GluAsn: 3.936 ± 0.429
1.745GluPro: 1.745 ± 0.25
2.557GluGln: 2.557 ± 0.295
2.191GluArg: 2.191 ± 0.284
3.774GluSer: 3.774 ± 0.368
3.733GluThr: 3.733 ± 0.424
4.626GluVal: 4.626 ± 0.485
0.812GluTrp: 0.812 ± 0.181
2.475GluTyr: 2.475 ± 0.272
0.0GluXaa: 0.0 ± 0.0
Phe
2.394PheAla: 2.394 ± 0.296
0.365PheCys: 0.365 ± 0.102
3.206PheAsp: 3.206 ± 0.296
2.516PheGlu: 2.516 ± 0.317
1.258PhePhe: 1.258 ± 0.253
2.07PheGly: 2.07 ± 0.321
0.771PheHis: 0.771 ± 0.15
2.557PheIle: 2.557 ± 0.302
3.165PheLys: 3.165 ± 0.349
2.435PheLeu: 2.435 ± 0.301
1.217PheMet: 1.217 ± 0.211
2.719PheAsn: 2.719 ± 0.342
1.542PhePro: 1.542 ± 0.276
1.177PheGln: 1.177 ± 0.17
1.136PheArg: 1.136 ± 0.196
2.719PheSer: 2.719 ± 0.303
2.76PheThr: 2.76 ± 0.374
2.354PheVal: 2.354 ± 0.322
0.528PheTrp: 0.528 ± 0.144
1.38PheTyr: 1.38 ± 0.224
0.0PheXaa: 0.0 ± 0.0
Gly
4.139GlyAla: 4.139 ± 0.577
0.365GlyCys: 0.365 ± 0.122
3.531GlyAsp: 3.531 ± 0.41
3.246GlyGlu: 3.246 ± 0.374
2.354GlyPhe: 2.354 ± 0.36
4.505GlyGly: 4.505 ± 0.614
1.096GlyHis: 1.096 ± 0.217
4.22GlyIle: 4.22 ± 0.447
4.829GlyLys: 4.829 ± 0.351
4.586GlyLeu: 4.586 ± 0.319
2.232GlyMet: 2.232 ± 0.265
3.409GlyAsn: 3.409 ± 0.384
1.055GlyPro: 1.055 ± 0.205
2.313GlyGln: 2.313 ± 0.347
2.435GlyArg: 2.435 ± 0.323
4.261GlySer: 4.261 ± 0.479
4.018GlyThr: 4.018 ± 0.365
4.505GlyVal: 4.505 ± 0.356
1.136GlyTrp: 1.136 ± 0.332
2.922GlyTyr: 2.922 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
0.852HisAla: 0.852 ± 0.161
0.162HisCys: 0.162 ± 0.083
1.664HisAsp: 1.664 ± 0.294
0.974HisGlu: 0.974 ± 0.193
0.852HisPhe: 0.852 ± 0.219
0.73HisGly: 0.73 ± 0.185
0.243HisHis: 0.243 ± 0.095
1.258HisIle: 1.258 ± 0.259
1.136HisLys: 1.136 ± 0.208
1.461HisLeu: 1.461 ± 0.237
0.365HisMet: 0.365 ± 0.14
1.217HisAsn: 1.217 ± 0.221
0.69HisPro: 0.69 ± 0.142
0.487HisGln: 0.487 ± 0.165
0.852HisArg: 0.852 ± 0.18
1.502HisSer: 1.502 ± 0.254
1.177HisThr: 1.177 ± 0.198
0.893HisVal: 0.893 ± 0.191
0.203HisTrp: 0.203 ± 0.088
0.933HisTyr: 0.933 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
4.951IleAla: 4.951 ± 0.378
0.487IleCys: 0.487 ± 0.146
5.6IleAsp: 5.6 ± 0.446
4.87IleGlu: 4.87 ± 0.475
2.719IlePhe: 2.719 ± 0.369
3.815IleGly: 3.815 ± 0.641
1.299IleHis: 1.299 ± 0.286
4.22IleIle: 4.22 ± 0.548
5.113IleLys: 5.113 ± 0.484
5.032IleLeu: 5.032 ± 0.461
1.704IleMet: 1.704 ± 0.243
5.032IleAsn: 5.032 ± 0.468
2.922IlePro: 2.922 ± 0.291
2.8IleGln: 2.8 ± 0.336
2.394IleArg: 2.394 ± 0.323
5.519IleSer: 5.519 ± 0.509
4.789IleThr: 4.789 ± 0.304
4.464IleVal: 4.464 ± 0.396
0.365IleTrp: 0.365 ± 0.108
2.11IleTyr: 2.11 ± 0.263
0.0IleXaa: 0.0 ± 0.0
Lys
4.261LysAla: 4.261 ± 0.414
0.649LysCys: 0.649 ± 0.17
6.006LysAsp: 6.006 ± 0.556
5.56LysGlu: 5.56 ± 0.555
2.962LysPhe: 2.962 ± 0.377
4.22LysGly: 4.22 ± 0.383
1.704LysHis: 1.704 ± 0.221
4.991LysIle: 4.991 ± 0.43
4.626LysLys: 4.626 ± 0.586
5.722LysLeu: 5.722 ± 0.639
2.8LysMet: 2.8 ± 0.364
3.612LysAsn: 3.612 ± 0.399
2.394LysPro: 2.394 ± 0.287
2.313LysGln: 2.313 ± 0.325
2.881LysArg: 2.881 ± 0.414
4.91LysSer: 4.91 ± 0.428
3.977LysThr: 3.977 ± 0.402
5.235LysVal: 5.235 ± 0.414
0.893LysTrp: 0.893 ± 0.183
2.841LysTyr: 2.841 ± 0.392
0.0LysXaa: 0.0 ± 0.0
Leu
4.829LeuAla: 4.829 ± 0.478
0.406LeuCys: 0.406 ± 0.128
5.763LeuAsp: 5.763 ± 0.464
4.423LeuGlu: 4.423 ± 0.462
2.354LeuPhe: 2.354 ± 0.319
4.748LeuGly: 4.748 ± 0.469
1.339LeuHis: 1.339 ± 0.246
5.763LeuIle: 5.763 ± 0.462
6.128LeuLys: 6.128 ± 0.612
5.844LeuLeu: 5.844 ± 0.592
2.313LeuMet: 2.313 ± 0.291
4.139LeuAsn: 4.139 ± 0.393
3.125LeuPro: 3.125 ± 0.436
2.638LeuGln: 2.638 ± 0.351
3.246LeuArg: 3.246 ± 0.292
6.615LeuSer: 6.615 ± 0.592
5.357LeuThr: 5.357 ± 0.484
4.991LeuVal: 4.991 ± 0.568
1.136LeuTrp: 1.136 ± 0.222
2.841LeuTyr: 2.841 ± 0.303
0.0LeuXaa: 0.0 ± 0.0
Met
1.826MetAla: 1.826 ± 0.253
0.203MetCys: 0.203 ± 0.085
1.542MetAsp: 1.542 ± 0.287
1.42MetGlu: 1.42 ± 0.251
1.217MetPhe: 1.217 ± 0.188
1.299MetGly: 1.299 ± 0.288
0.446MetHis: 0.446 ± 0.151
2.07MetIle: 2.07 ± 0.297
2.516MetLys: 2.516 ± 0.349
1.988MetLeu: 1.988 ± 0.336
1.136MetMet: 1.136 ± 0.235
2.273MetAsn: 2.273 ± 0.335
1.461MetPro: 1.461 ± 0.242
1.258MetGln: 1.258 ± 0.197
1.38MetArg: 1.38 ± 0.247
1.826MetSer: 1.826 ± 0.312
2.557MetThr: 2.557 ± 0.336
1.502MetVal: 1.502 ± 0.264
0.203MetTrp: 0.203 ± 0.086
1.177MetTyr: 1.177 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
4.667AsnAla: 4.667 ± 0.551
0.649AsnCys: 0.649 ± 0.188
3.449AsnAsp: 3.449 ± 0.401
3.733AsnGlu: 3.733 ± 0.399
2.191AsnPhe: 2.191 ± 0.301
4.505AsnGly: 4.505 ± 0.465
0.933AsnHis: 0.933 ± 0.225
4.626AsnIle: 4.626 ± 0.435
4.829AsnLys: 4.829 ± 0.47
5.073AsnLeu: 5.073 ± 0.398
2.273AsnMet: 2.273 ± 0.29
4.383AsnAsn: 4.383 ± 0.481
2.232AsnPro: 2.232 ± 0.328
1.988AsnGln: 1.988 ± 0.253
1.745AsnArg: 1.745 ± 0.207
4.545AsnSer: 4.545 ± 0.497
3.936AsnThr: 3.936 ± 0.409
3.449AsnVal: 3.449 ± 0.348
0.609AsnTrp: 0.609 ± 0.15
2.719AsnTyr: 2.719 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
2.313ProAla: 2.313 ± 0.428
0.243ProCys: 0.243 ± 0.096
2.232ProAsp: 2.232 ± 0.237
2.557ProGlu: 2.557 ± 0.305
1.826ProPhe: 1.826 ± 0.279
2.394ProGly: 2.394 ± 0.312
0.771ProHis: 0.771 ± 0.178
2.678ProIle: 2.678 ± 0.347
2.394ProLys: 2.394 ± 0.327
2.232ProLeu: 2.232 ± 0.3
0.771ProMet: 0.771 ± 0.171
2.029ProAsn: 2.029 ± 0.297
0.771ProPro: 0.771 ± 0.156
1.096ProGln: 1.096 ± 0.183
1.299ProArg: 1.299 ± 0.267
2.475ProSer: 2.475 ± 0.34
2.638ProThr: 2.638 ± 0.479
2.557ProVal: 2.557 ± 0.315
0.243ProTrp: 0.243 ± 0.089
1.502ProTyr: 1.502 ± 0.253
0.0ProXaa: 0.0 ± 0.0
Gln
2.273GlnAla: 2.273 ± 0.4
0.325GlnCys: 0.325 ± 0.123
2.638GlnAsp: 2.638 ± 0.27
1.826GlnGlu: 1.826 ± 0.26
1.177GlnPhe: 1.177 ± 0.247
2.394GlnGly: 2.394 ± 0.349
0.528GlnHis: 0.528 ± 0.14
2.07GlnIle: 2.07 ± 0.294
2.354GlnLys: 2.354 ± 0.338
3.774GlnLeu: 3.774 ± 0.363
0.933GlnMet: 0.933 ± 0.195
1.623GlnAsn: 1.623 ± 0.291
1.907GlnPro: 1.907 ± 0.289
1.907GlnGln: 1.907 ± 0.291
1.055GlnArg: 1.055 ± 0.18
2.354GlnSer: 2.354 ± 0.366
1.704GlnThr: 1.704 ± 0.276
2.273GlnVal: 2.273 ± 0.306
0.609GlnTrp: 0.609 ± 0.166
1.339GlnTyr: 1.339 ± 0.202
0.0GlnXaa: 0.0 ± 0.0
Arg
2.273ArgAla: 2.273 ± 0.311
0.203ArgCys: 0.203 ± 0.076
2.557ArgAsp: 2.557 ± 0.317
2.881ArgGlu: 2.881 ± 0.315
1.826ArgPhe: 1.826 ± 0.312
1.988ArgGly: 1.988 ± 0.334
0.528ArgHis: 0.528 ± 0.132
3.003ArgIle: 3.003 ± 0.327
2.922ArgLys: 2.922 ± 0.356
3.084ArgLeu: 3.084 ± 0.295
0.812ArgMet: 0.812 ± 0.165
1.948ArgAsn: 1.948 ± 0.282
1.299ArgPro: 1.299 ± 0.226
0.933ArgGln: 0.933 ± 0.167
1.42ArgArg: 1.42 ± 0.233
2.435ArgSer: 2.435 ± 0.335
2.191ArgThr: 2.191 ± 0.346
2.232ArgVal: 2.232 ± 0.387
0.284ArgTrp: 0.284 ± 0.123
2.07ArgTyr: 2.07 ± 0.25
0.0ArgXaa: 0.0 ± 0.0
Ser
4.464SerAla: 4.464 ± 0.508
0.487SerCys: 0.487 ± 0.134
4.87SerAsp: 4.87 ± 0.507
4.626SerGlu: 4.626 ± 0.45
2.557SerPhe: 2.557 ± 0.304
4.667SerGly: 4.667 ± 0.449
1.258SerHis: 1.258 ± 0.248
5.438SerIle: 5.438 ± 0.532
5.763SerLys: 5.763 ± 0.481
5.397SerLeu: 5.397 ± 0.427
1.704SerMet: 1.704 ± 0.273
5.032SerAsn: 5.032 ± 0.437
2.435SerPro: 2.435 ± 0.385
2.597SerGln: 2.597 ± 0.367
2.76SerArg: 2.76 ± 0.35
5.397SerSer: 5.397 ± 0.425
5.113SerThr: 5.113 ± 0.449
4.018SerVal: 4.018 ± 0.501
0.974SerTrp: 0.974 ± 0.178
3.165SerTyr: 3.165 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
3.287ThrAla: 3.287 ± 0.376
0.487ThrCys: 0.487 ± 0.14
4.22ThrAsp: 4.22 ± 0.4
3.368ThrGlu: 3.368 ± 0.362
2.273ThrPhe: 2.273 ± 0.305
4.586ThrGly: 4.586 ± 0.487
0.933ThrHis: 0.933 ± 0.184
5.073ThrIle: 5.073 ± 0.393
4.058ThrLys: 4.058 ± 0.355
5.681ThrLeu: 5.681 ± 0.49
1.542ThrMet: 1.542 ± 0.259
3.896ThrAsn: 3.896 ± 0.525
3.165ThrPro: 3.165 ± 0.494
2.354ThrGln: 2.354 ± 0.335
2.232ThrArg: 2.232 ± 0.251
5.357ThrSer: 5.357 ± 0.491
4.667ThrThr: 4.667 ± 0.512
4.383ThrVal: 4.383 ± 0.392
0.487ThrTrp: 0.487 ± 0.132
3.003ThrTyr: 3.003 ± 0.315
0.0ThrXaa: 0.0 ± 0.0
Val
3.855ValAla: 3.855 ± 0.428
0.528ValCys: 0.528 ± 0.125
4.058ValAsp: 4.058 ± 0.447
4.626ValGlu: 4.626 ± 0.451
2.191ValPhe: 2.191 ± 0.392
4.342ValGly: 4.342 ± 0.393
1.217ValHis: 1.217 ± 0.221
3.977ValIle: 3.977 ± 0.363
4.261ValLys: 4.261 ± 0.371
3.896ValLeu: 3.896 ± 0.39
1.38ValMet: 1.38 ± 0.244
4.018ValAsn: 4.018 ± 0.374
2.475ValPro: 2.475 ± 0.314
2.191ValGln: 2.191 ± 0.322
2.313ValArg: 2.313 ± 0.288
5.032ValSer: 5.032 ± 0.42
5.438ValThr: 5.438 ± 0.437
3.936ValVal: 3.936 ± 0.401
0.528ValTrp: 0.528 ± 0.149
3.531ValTyr: 3.531 ± 0.374
0.0ValXaa: 0.0 ± 0.0
Trp
0.609TrpAla: 0.609 ± 0.188
0.243TrpCys: 0.243 ± 0.094
0.609TrpAsp: 0.609 ± 0.142
0.568TrpGlu: 0.568 ± 0.131
0.406TrpPhe: 0.406 ± 0.13
0.812TrpGly: 0.812 ± 0.183
0.406TrpHis: 0.406 ± 0.154
0.69TrpIle: 0.69 ± 0.173
0.649TrpLys: 0.649 ± 0.152
1.583TrpLeu: 1.583 ± 0.306
0.284TrpMet: 0.284 ± 0.113
0.73TrpAsn: 0.73 ± 0.188
0.243TrpPro: 0.243 ± 0.088
0.487TrpGln: 0.487 ± 0.133
0.284TrpArg: 0.284 ± 0.092
0.893TrpSer: 0.893 ± 0.209
0.446TrpThr: 0.446 ± 0.125
1.217TrpVal: 1.217 ± 0.206
0.284TrpTrp: 0.284 ± 0.113
0.568TrpTyr: 0.568 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.8TyrAla: 2.8 ± 0.388
0.528TyrCys: 0.528 ± 0.166
3.206TyrAsp: 3.206 ± 0.471
2.719TyrGlu: 2.719 ± 0.331
1.867TyrPhe: 1.867 ± 0.228
2.76TyrGly: 2.76 ± 0.389
1.055TyrHis: 1.055 ± 0.233
3.49TyrIle: 3.49 ± 0.412
2.962TyrLys: 2.962 ± 0.299
3.328TyrLeu: 3.328 ± 0.433
1.502TyrMet: 1.502 ± 0.273
2.76TyrAsn: 2.76 ± 0.301
1.826TyrPro: 1.826 ± 0.204
1.42TyrGln: 1.42 ± 0.232
2.07TyrArg: 2.07 ± 0.331
3.449TyrSer: 3.449 ± 0.38
2.719TyrThr: 2.719 ± 0.356
2.516TyrVal: 2.516 ± 0.279
0.568TyrTrp: 0.568 ± 0.164
1.664TyrTyr: 1.664 ± 0.298
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 130 proteins (24643 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski