Amino acid dipepetide frequency for Xylella phage Salvo

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.627AlaAla: 14.627 ± 2.567
1.125AlaCys: 1.125 ± 0.336
6.582AlaAsp: 6.582 ± 0.766
7.257AlaGlu: 7.257 ± 0.889
3.769AlaPhe: 3.769 ± 0.504
8.27AlaGly: 8.27 ± 0.83
2.363AlaHis: 2.363 ± 0.357
4.669AlaIle: 4.669 ± 0.494
5.288AlaLys: 5.288 ± 0.797
8.889AlaLeu: 8.889 ± 0.728
3.038AlaMet: 3.038 ± 0.426
3.376AlaAsn: 3.376 ± 0.408
3.601AlaPro: 3.601 ± 0.43
4.557AlaGln: 4.557 ± 0.9
8.664AlaArg: 8.664 ± 0.79
5.626AlaSer: 5.626 ± 0.683
4.726AlaThr: 4.726 ± 0.473
6.076AlaVal: 6.076 ± 0.729
1.744AlaTrp: 1.744 ± 0.288
3.15AlaTyr: 3.15 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
1.294CysAla: 1.294 ± 0.268
0.056CysCys: 0.056 ± 0.057
0.788CysAsp: 0.788 ± 0.26
0.9CysGlu: 0.9 ± 0.299
0.281CysPhe: 0.281 ± 0.117
0.619CysGly: 0.619 ± 0.162
0.169CysHis: 0.169 ± 0.08
0.45CysIle: 0.45 ± 0.179
0.45CysLys: 0.45 ± 0.179
0.563CysLeu: 0.563 ± 0.174
0.113CysMet: 0.113 ± 0.083
0.169CysAsn: 0.169 ± 0.092
0.619CysPro: 0.619 ± 0.204
0.113CysGln: 0.113 ± 0.077
0.563CysArg: 0.563 ± 0.169
0.281CysSer: 0.281 ± 0.105
0.169CysThr: 0.169 ± 0.091
0.45CysVal: 0.45 ± 0.185
0.281CysTrp: 0.281 ± 0.125
0.338CysTyr: 0.338 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
7.539AspAla: 7.539 ± 0.962
0.281AspCys: 0.281 ± 0.114
5.738AspAsp: 5.738 ± 1.171
5.907AspGlu: 5.907 ± 0.599
2.644AspPhe: 2.644 ± 0.431
5.795AspGly: 5.795 ± 0.539
1.125AspHis: 1.125 ± 0.299
3.432AspIle: 3.432 ± 0.375
2.982AspLys: 2.982 ± 0.378
4.163AspLeu: 4.163 ± 0.473
1.969AspMet: 1.969 ± 0.27
2.419AspAsn: 2.419 ± 0.361
2.644AspPro: 2.644 ± 0.448
1.744AspGln: 1.744 ± 0.256
3.713AspArg: 3.713 ± 0.444
3.094AspSer: 3.094 ± 0.5
3.713AspThr: 3.713 ± 0.533
4.388AspVal: 4.388 ± 0.436
1.238AspTrp: 1.238 ± 0.291
2.307AspTyr: 2.307 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
6.301GluAla: 6.301 ± 0.753
0.394GluCys: 0.394 ± 0.136
3.713GluAsp: 3.713 ± 0.479
4.951GluGlu: 4.951 ± 0.686
3.038GluPhe: 3.038 ± 0.444
3.882GluGly: 3.882 ± 0.471
1.463GluHis: 1.463 ± 0.343
3.488GluIle: 3.488 ± 0.374
3.207GluLys: 3.207 ± 0.465
6.976GluLeu: 6.976 ± 0.597
1.969GluMet: 1.969 ± 0.246
2.869GluAsn: 2.869 ± 0.419
2.813GluPro: 2.813 ± 0.441
3.938GluGln: 3.938 ± 0.573
4.107GluArg: 4.107 ± 0.494
3.488GluSer: 3.488 ± 0.37
2.925GluThr: 2.925 ± 0.361
3.882GluVal: 3.882 ± 0.497
1.294GluTrp: 1.294 ± 0.338
1.913GluTyr: 1.913 ± 0.303
0.0GluXaa: 0.0 ± 0.0
Phe
3.319PheAla: 3.319 ± 0.364
0.225PheCys: 0.225 ± 0.088
3.544PheAsp: 3.544 ± 0.533
2.194PheGlu: 2.194 ± 0.266
1.463PhePhe: 1.463 ± 0.313
2.982PheGly: 2.982 ± 0.436
0.225PheHis: 0.225 ± 0.118
1.406PheIle: 1.406 ± 0.28
1.913PheLys: 1.913 ± 0.27
2.307PheLeu: 2.307 ± 0.353
1.125PheMet: 1.125 ± 0.251
2.25PheAsn: 2.25 ± 0.305
1.463PhePro: 1.463 ± 0.307
0.731PheGln: 0.731 ± 0.172
2.532PheArg: 2.532 ± 0.389
2.925PheSer: 2.925 ± 0.435
1.913PheThr: 1.913 ± 0.308
2.588PheVal: 2.588 ± 0.361
0.619PheTrp: 0.619 ± 0.205
1.181PheTyr: 1.181 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
7.482GlyAla: 7.482 ± 0.484
1.013GlyCys: 1.013 ± 0.248
5.401GlyAsp: 5.401 ± 0.543
4.501GlyGlu: 4.501 ± 0.49
2.813GlyPhe: 2.813 ± 0.336
5.738GlyGly: 5.738 ± 0.639
1.125GlyHis: 1.125 ± 0.309
3.038GlyIle: 3.038 ± 0.347
3.544GlyLys: 3.544 ± 0.497
5.288GlyLeu: 5.288 ± 0.47
2.419GlyMet: 2.419 ± 0.374
2.757GlyAsn: 2.757 ± 0.478
3.094GlyPro: 3.094 ± 0.382
2.419GlyGln: 2.419 ± 0.471
5.063GlyArg: 5.063 ± 0.52
4.669GlySer: 4.669 ± 0.625
4.895GlyThr: 4.895 ± 0.7
5.57GlyVal: 5.57 ± 0.551
1.575GlyTrp: 1.575 ± 0.302
2.757GlyTyr: 2.757 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
2.475HisAla: 2.475 ± 0.434
0.169HisCys: 0.169 ± 0.105
1.575HisAsp: 1.575 ± 0.24
0.956HisGlu: 0.956 ± 0.244
0.788HisPhe: 0.788 ± 0.25
1.632HisGly: 1.632 ± 0.299
0.563HisHis: 0.563 ± 0.184
1.238HisIle: 1.238 ± 0.274
0.844HisLys: 0.844 ± 0.231
1.013HisLeu: 1.013 ± 0.26
0.45HisMet: 0.45 ± 0.139
0.619HisAsn: 0.619 ± 0.177
0.844HisPro: 0.844 ± 0.229
0.45HisGln: 0.45 ± 0.15
0.9HisArg: 0.9 ± 0.189
1.013HisSer: 1.013 ± 0.26
0.956HisThr: 0.956 ± 0.228
1.238HisVal: 1.238 ± 0.252
0.338HisTrp: 0.338 ± 0.147
0.675HisTyr: 0.675 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.626IleAla: 5.626 ± 0.541
0.281IleCys: 0.281 ± 0.12
3.769IleAsp: 3.769 ± 0.416
3.657IleGlu: 3.657 ± 0.567
1.406IlePhe: 1.406 ± 0.285
2.982IleGly: 2.982 ± 0.43
0.788IleHis: 0.788 ± 0.213
2.588IleIle: 2.588 ± 0.412
1.688IleLys: 1.688 ± 0.339
2.869IleLeu: 2.869 ± 0.295
1.069IleMet: 1.069 ± 0.226
2.138IleAsn: 2.138 ± 0.334
1.857IlePro: 1.857 ± 0.263
1.632IleGln: 1.632 ± 0.418
2.7IleArg: 2.7 ± 0.47
1.8IleSer: 1.8 ± 0.261
2.588IleThr: 2.588 ± 0.398
3.657IleVal: 3.657 ± 0.471
0.394IleTrp: 0.394 ± 0.146
1.35IleTyr: 1.35 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
5.232LysAla: 5.232 ± 0.864
0.506LysCys: 0.506 ± 0.172
1.688LysAsp: 1.688 ± 0.396
2.925LysGlu: 2.925 ± 0.42
1.8LysPhe: 1.8 ± 0.308
3.207LysGly: 3.207 ± 0.391
0.788LysHis: 0.788 ± 0.235
1.519LysIle: 1.519 ± 0.299
4.388LysLys: 4.388 ± 0.689
4.163LysLeu: 4.163 ± 0.482
1.632LysMet: 1.632 ± 0.354
1.406LysAsn: 1.406 ± 0.292
2.588LysPro: 2.588 ± 0.403
2.138LysGln: 2.138 ± 0.394
3.713LysArg: 3.713 ± 0.488
2.982LysSer: 2.982 ± 0.463
3.15LysThr: 3.15 ± 0.457
3.263LysVal: 3.263 ± 0.338
1.181LysTrp: 1.181 ± 0.349
1.406LysTyr: 1.406 ± 0.241
0.0LysXaa: 0.0 ± 0.0
Leu
8.326LeuAla: 8.326 ± 1.006
1.013LeuCys: 1.013 ± 0.231
5.682LeuAsp: 5.682 ± 0.621
4.782LeuGlu: 4.782 ± 0.743
2.982LeuPhe: 2.982 ± 0.354
5.795LeuGly: 5.795 ± 0.657
2.025LeuHis: 2.025 ± 0.323
2.757LeuIle: 2.757 ± 0.358
3.544LeuLys: 3.544 ± 0.449
7.257LeuLeu: 7.257 ± 0.66
1.35LeuMet: 1.35 ± 0.252
2.813LeuAsn: 2.813 ± 0.54
4.388LeuPro: 4.388 ± 0.569
2.813LeuGln: 2.813 ± 0.394
6.751LeuArg: 6.751 ± 0.724
5.288LeuSer: 5.288 ± 0.562
4.557LeuThr: 4.557 ± 0.535
5.738LeuVal: 5.738 ± 0.495
1.069LeuTrp: 1.069 ± 0.271
2.082LeuTyr: 2.082 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
2.925MetAla: 2.925 ± 0.393
0.169MetCys: 0.169 ± 0.099
1.181MetAsp: 1.181 ± 0.318
0.956MetGlu: 0.956 ± 0.216
0.731MetPhe: 0.731 ± 0.198
1.35MetGly: 1.35 ± 0.284
0.788MetHis: 0.788 ± 0.211
1.35MetIle: 1.35 ± 0.29
1.294MetLys: 1.294 ± 0.245
2.7MetLeu: 2.7 ± 0.342
0.675MetMet: 0.675 ± 0.205
0.788MetAsn: 0.788 ± 0.159
0.788MetPro: 0.788 ± 0.201
1.069MetGln: 1.069 ± 0.201
2.644MetArg: 2.644 ± 0.392
2.475MetSer: 2.475 ± 0.42
2.25MetThr: 2.25 ± 0.308
1.125MetVal: 1.125 ± 0.209
0.506MetTrp: 0.506 ± 0.153
0.45MetTyr: 0.45 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
4.501AsnAla: 4.501 ± 0.479
0.281AsnCys: 0.281 ± 0.12
2.813AsnAsp: 2.813 ± 0.374
2.138AsnGlu: 2.138 ± 0.299
1.294AsnPhe: 1.294 ± 0.235
3.713AsnGly: 3.713 ± 0.533
0.563AsnHis: 0.563 ± 0.166
1.857AsnIle: 1.857 ± 0.387
1.463AsnLys: 1.463 ± 0.229
2.644AsnLeu: 2.644 ± 0.407
0.788AsnMet: 0.788 ± 0.153
1.8AsnAsn: 1.8 ± 0.285
1.969AsnPro: 1.969 ± 0.33
1.013AsnGln: 1.013 ± 0.286
2.7AsnArg: 2.7 ± 0.37
1.969AsnSer: 1.969 ± 0.349
2.194AsnThr: 2.194 ± 0.483
2.925AsnVal: 2.925 ± 0.422
0.731AsnTrp: 0.731 ± 0.232
1.013AsnTyr: 1.013 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
4.501ProAla: 4.501 ± 0.54
0.45ProCys: 0.45 ± 0.135
2.982ProAsp: 2.982 ± 0.461
2.869ProGlu: 2.869 ± 0.431
1.744ProPhe: 1.744 ± 0.358
3.657ProGly: 3.657 ± 0.543
1.069ProHis: 1.069 ± 0.277
1.857ProIle: 1.857 ± 0.337
2.363ProLys: 2.363 ± 0.444
3.432ProLeu: 3.432 ± 0.375
1.519ProMet: 1.519 ± 0.323
1.913ProAsn: 1.913 ± 0.375
2.644ProPro: 2.644 ± 0.526
1.857ProGln: 1.857 ± 0.296
2.475ProArg: 2.475 ± 0.438
2.419ProSer: 2.419 ± 0.34
2.082ProThr: 2.082 ± 0.436
2.588ProVal: 2.588 ± 0.409
0.956ProTrp: 0.956 ± 0.193
1.519ProTyr: 1.519 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
4.613GlnAla: 4.613 ± 1.018
0.169GlnCys: 0.169 ± 0.095
1.519GlnAsp: 1.519 ± 0.271
1.857GlnGlu: 1.857 ± 0.437
1.238GlnPhe: 1.238 ± 0.258
2.363GlnGly: 2.363 ± 0.426
0.619GlnHis: 0.619 ± 0.175
1.857GlnIle: 1.857 ± 0.345
2.025GlnLys: 2.025 ± 0.359
3.769GlnLeu: 3.769 ± 0.536
0.619GlnMet: 0.619 ± 0.188
1.238GlnAsn: 1.238 ± 0.294
1.744GlnPro: 1.744 ± 0.414
2.138GlnGln: 2.138 ± 0.565
3.319GlnArg: 3.319 ± 0.55
1.519GlnSer: 1.519 ± 0.286
2.082GlnThr: 2.082 ± 0.318
2.588GlnVal: 2.588 ± 0.403
0.788GlnTrp: 0.788 ± 0.252
1.013GlnTyr: 1.013 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
6.582ArgAla: 6.582 ± 0.99
0.506ArgCys: 0.506 ± 0.139
4.726ArgAsp: 4.726 ± 0.576
6.245ArgGlu: 6.245 ± 0.727
2.475ArgPhe: 2.475 ± 0.412
4.726ArgGly: 4.726 ± 0.404
1.294ArgHis: 1.294 ± 0.283
4.051ArgIle: 4.051 ± 0.522
3.713ArgLys: 3.713 ± 0.449
5.907ArgLeu: 5.907 ± 0.731
2.419ArgMet: 2.419 ± 0.28
2.925ArgAsn: 2.925 ± 0.393
3.376ArgPro: 3.376 ± 0.371
1.913ArgGln: 1.913 ± 0.48
5.795ArgArg: 5.795 ± 0.58
3.376ArgSer: 3.376 ± 0.395
2.644ArgThr: 2.644 ± 0.278
5.176ArgVal: 5.176 ± 0.541
1.238ArgTrp: 1.238 ± 0.315
2.307ArgTyr: 2.307 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
5.851SerAla: 5.851 ± 0.764
0.394SerCys: 0.394 ± 0.169
3.657SerAsp: 3.657 ± 0.421
3.094SerGlu: 3.094 ± 0.374
2.307SerPhe: 2.307 ± 0.361
5.007SerGly: 5.007 ± 0.522
0.675SerHis: 0.675 ± 0.226
2.363SerIle: 2.363 ± 0.256
3.713SerLys: 3.713 ± 0.506
4.163SerLeu: 4.163 ± 0.53
1.125SerMet: 1.125 ± 0.242
2.419SerAsn: 2.419 ± 0.388
2.588SerPro: 2.588 ± 0.391
1.632SerGln: 1.632 ± 0.303
4.557SerArg: 4.557 ± 0.566
3.432SerSer: 3.432 ± 0.481
2.925SerThr: 2.925 ± 0.392
3.094SerVal: 3.094 ± 0.369
0.9SerTrp: 0.9 ± 0.241
1.857SerTyr: 1.857 ± 0.292
0.0SerXaa: 0.0 ± 0.0
Thr
5.682ThrAla: 5.682 ± 0.64
0.281ThrCys: 0.281 ± 0.107
3.713ThrAsp: 3.713 ± 0.553
3.15ThrGlu: 3.15 ± 0.455
1.857ThrPhe: 1.857 ± 0.338
4.895ThrGly: 4.895 ± 0.679
0.956ThrHis: 0.956 ± 0.238
2.025ThrIle: 2.025 ± 0.307
2.757ThrLys: 2.757 ± 0.37
4.613ThrLeu: 4.613 ± 0.49
1.744ThrMet: 1.744 ± 0.339
2.025ThrAsn: 2.025 ± 0.387
2.588ThrPro: 2.588 ± 0.442
1.8ThrGln: 1.8 ± 0.343
3.15ThrArg: 3.15 ± 0.55
2.869ThrSer: 2.869 ± 0.497
3.826ThrThr: 3.826 ± 0.453
4.501ThrVal: 4.501 ± 0.599
1.125ThrTrp: 1.125 ± 0.275
1.575ThrTyr: 1.575 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
6.301ValAla: 6.301 ± 0.747
0.619ValCys: 0.619 ± 0.192
4.726ValAsp: 4.726 ± 0.59
5.682ValGlu: 5.682 ± 0.607
2.307ValPhe: 2.307 ± 0.287
5.063ValGly: 5.063 ± 0.616
0.956ValHis: 0.956 ± 0.228
2.644ValIle: 2.644 ± 0.449
2.194ValLys: 2.194 ± 0.338
5.57ValLeu: 5.57 ± 0.496
1.35ValMet: 1.35 ± 0.223
1.913ValAsn: 1.913 ± 0.304
3.488ValPro: 3.488 ± 0.468
2.925ValGln: 2.925 ± 0.407
4.613ValArg: 4.613 ± 0.433
3.882ValSer: 3.882 ± 0.543
3.882ValThr: 3.882 ± 0.493
4.613ValVal: 4.613 ± 0.58
0.844ValTrp: 0.844 ± 0.199
2.588ValTyr: 2.588 ± 0.412
0.0ValXaa: 0.0 ± 0.0
Trp
0.956TrpAla: 0.956 ± 0.217
0.506TrpCys: 0.506 ± 0.193
1.069TrpAsp: 1.069 ± 0.299
0.844TrpGlu: 0.844 ± 0.2
0.731TrpPhe: 0.731 ± 0.189
1.294TrpGly: 1.294 ± 0.285
0.619TrpHis: 0.619 ± 0.202
0.956TrpIle: 0.956 ± 0.257
0.788TrpLys: 0.788 ± 0.218
1.857TrpLeu: 1.857 ± 0.352
0.281TrpMet: 0.281 ± 0.149
0.9TrpAsn: 0.9 ± 0.244
0.619TrpPro: 0.619 ± 0.208
1.069TrpGln: 1.069 ± 0.249
1.35TrpArg: 1.35 ± 0.287
1.238TrpSer: 1.238 ± 0.268
1.181TrpThr: 1.181 ± 0.227
0.506TrpVal: 0.506 ± 0.201
0.675TrpTrp: 0.675 ± 0.214
0.563TrpTyr: 0.563 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.263TyrAla: 3.263 ± 0.431
0.225TyrCys: 0.225 ± 0.11
1.969TyrAsp: 1.969 ± 0.302
1.8TyrGlu: 1.8 ± 0.307
1.238TyrPhe: 1.238 ± 0.218
2.194TyrGly: 2.194 ± 0.324
0.563TyrHis: 0.563 ± 0.164
1.294TyrIle: 1.294 ± 0.299
1.519TyrLys: 1.519 ± 0.284
2.869TyrLeu: 2.869 ± 0.342
0.506TyrMet: 0.506 ± 0.239
1.632TyrAsn: 1.632 ± 0.346
1.181TyrPro: 1.181 ± 0.31
1.125TyrGln: 1.125 ± 0.21
2.082TyrArg: 2.082 ± 0.3
1.406TyrSer: 1.406 ± 0.294
2.475TyrThr: 2.475 ± 0.434
2.194TyrVal: 2.194 ± 0.38
0.506TyrTrp: 0.506 ± 0.169
1.125TyrTyr: 1.125 ± 0.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (17776 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski