Amino acid dipepetide frequency for Potato mop-top virus (isolate Potato/Sweden/Sw) (PMTV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.846AlaAla: 6.846 ± 2.065
1.917AlaCys: 1.917 ± 0.656
1.643AlaAsp: 1.643 ± 0.522
6.024AlaGlu: 6.024 ± 0.868
2.738AlaPhe: 2.738 ± 0.877
2.738AlaGly: 2.738 ± 0.951
1.369AlaHis: 1.369 ± 0.266
1.917AlaIle: 1.917 ± 0.693
3.834AlaLys: 3.834 ± 1.18
7.393AlaLeu: 7.393 ± 2.037
2.464AlaMet: 2.464 ± 0.695
3.012AlaAsn: 3.012 ± 0.501
0.821AlaPro: 0.821 ± 0.478
3.834AlaGln: 3.834 ± 0.816
4.107AlaArg: 4.107 ± 1.42
4.655AlaSer: 4.655 ± 1.06
4.381AlaThr: 4.381 ± 0.625
7.393AlaVal: 7.393 ± 1.842
0.821AlaTrp: 0.821 ± 0.365
2.191AlaTyr: 2.191 ± 1.466
0.274AlaXaa: 0.274 ± 0.27
Cys
0.821CysAla: 0.821 ± 0.685
0.821CysCys: 0.821 ± 0.552
1.643CysAsp: 1.643 ± 0.665
1.643CysGlu: 1.643 ± 0.722
0.548CysPhe: 0.548 ± 0.319
2.464CysGly: 2.464 ± 0.66
0.274CysHis: 0.274 ± 0.159
0.821CysIle: 0.821 ± 1.431
0.548CysLys: 0.548 ± 0.319
1.643CysLeu: 1.643 ± 0.798
0.548CysMet: 0.548 ± 0.434
1.095CysAsn: 1.095 ± 0.764
0.274CysPro: 0.274 ± 0.159
1.095CysGln: 1.095 ± 0.489
2.191CysArg: 2.191 ± 1.418
1.917CysSer: 1.917 ± 1.609
0.274CysThr: 0.274 ± 0.159
1.095CysVal: 1.095 ± 0.867
0.0CysTrp: 0.0 ± 0.0
0.821CysTyr: 0.821 ± 0.776
0.0CysXaa: 0.0 ± 0.0
Asp
2.738AspAla: 2.738 ± 0.705
2.191AspCys: 2.191 ± 1.431
4.381AspAsp: 4.381 ± 0.931
6.024AspGlu: 6.024 ± 0.917
1.917AspPhe: 1.917 ± 0.829
2.738AspGly: 2.738 ± 0.739
1.643AspHis: 1.643 ± 0.45
3.012AspIle: 3.012 ± 0.742
4.107AspLys: 4.107 ± 0.901
6.298AspLeu: 6.298 ± 1.18
0.821AspMet: 0.821 ± 0.379
3.56AspAsn: 3.56 ± 0.922
3.286AspPro: 3.286 ± 0.772
1.643AspGln: 1.643 ± 0.273
3.012AspArg: 3.012 ± 1.22
3.834AspSer: 3.834 ± 1.037
1.095AspThr: 1.095 ± 0.638
4.655AspVal: 4.655 ± 1.013
1.095AspTrp: 1.095 ± 0.424
0.821AspTyr: 0.821 ± 0.261
0.0AspXaa: 0.0 ± 0.0
Glu
6.572GluAla: 6.572 ± 2.132
0.821GluCys: 0.821 ± 0.366
4.381GluAsp: 4.381 ± 1.057
4.929GluGlu: 4.929 ± 2.282
2.191GluPhe: 2.191 ± 0.974
5.476GluGly: 5.476 ± 1.134
1.369GluHis: 1.369 ± 0.266
6.572GluIle: 6.572 ± 0.795
4.381GluLys: 4.381 ± 2.293
6.024GluLeu: 6.024 ± 0.84
1.917GluMet: 1.917 ± 0.906
3.012GluAsn: 3.012 ± 0.636
0.548GluPro: 0.548 ± 0.395
2.191GluGln: 2.191 ± 0.492
5.476GluArg: 5.476 ± 1.408
7.393GluSer: 7.393 ± 2.078
1.917GluThr: 1.917 ± 0.492
4.655GluVal: 4.655 ± 0.541
1.095GluTrp: 1.095 ± 0.472
1.917GluTyr: 1.917 ± 0.961
0.0GluXaa: 0.0 ± 0.0
Phe
2.738PheAla: 2.738 ± 0.438
1.643PheCys: 1.643 ± 0.814
4.107PheAsp: 4.107 ± 0.93
1.643PheGlu: 1.643 ± 0.744
1.369PhePhe: 1.369 ± 0.698
3.286PheGly: 3.286 ± 0.714
0.274PheHis: 0.274 ± 0.159
2.191PheIle: 2.191 ± 1.407
1.643PheLys: 1.643 ± 0.544
4.381PheLeu: 4.381 ± 1.376
1.095PheMet: 1.095 ± 0.274
1.369PheAsn: 1.369 ± 0.546
1.643PhePro: 1.643 ± 0.733
0.821PheGln: 0.821 ± 0.503
1.095PheArg: 1.095 ± 0.472
4.381PheSer: 4.381 ± 0.916
2.191PheThr: 2.191 ± 0.807
1.917PheVal: 1.917 ± 0.604
0.274PheTrp: 0.274 ± 0.27
1.369PheTyr: 1.369 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
3.834GlyAla: 3.834 ± 1.712
0.548GlyCys: 0.548 ± 0.309
4.655GlyAsp: 4.655 ± 0.857
6.846GlyGlu: 6.846 ± 0.962
2.738GlyPhe: 2.738 ± 1.555
7.667GlyGly: 7.667 ± 2.567
0.274GlyHis: 0.274 ± 0.415
2.191GlyIle: 2.191 ± 0.548
5.476GlyLys: 5.476 ± 1.584
4.381GlyLeu: 4.381 ± 0.737
2.191GlyMet: 2.191 ± 0.718
3.56GlyAsn: 3.56 ± 0.829
2.191GlyPro: 2.191 ± 0.676
1.917GlyGln: 1.917 ± 0.987
3.56GlyArg: 3.56 ± 1.154
6.298GlySer: 6.298 ± 2.331
2.464GlyThr: 2.464 ± 0.579
3.012GlyVal: 3.012 ± 0.763
0.0GlyTrp: 0.0 ± 0.0
1.095GlyTyr: 1.095 ± 0.377
0.0GlyXaa: 0.0 ± 0.0
His
1.917HisAla: 1.917 ± 0.731
0.274HisCys: 0.274 ± 0.159
0.821HisAsp: 0.821 ± 0.478
1.643HisGlu: 1.643 ± 0.273
0.821HisPhe: 0.821 ± 0.365
1.095HisGly: 1.095 ± 1.079
0.0HisHis: 0.0 ± 0.0
0.274HisIle: 0.274 ± 0.159
0.821HisLys: 0.821 ± 0.582
1.369HisLeu: 1.369 ± 0.333
0.274HisMet: 0.274 ± 0.415
0.548HisAsn: 0.548 ± 0.319
0.548HisPro: 0.548 ± 0.309
0.548HisGln: 0.548 ± 0.44
1.643HisArg: 1.643 ± 0.544
1.643HisSer: 1.643 ± 0.814
0.548HisThr: 0.548 ± 0.379
0.821HisVal: 0.821 ± 0.379
0.274HisTrp: 0.274 ± 0.159
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.191IleAla: 2.191 ± 0.56
0.548IleCys: 0.548 ± 0.558
4.107IleAsp: 4.107 ± 1.099
3.834IleGlu: 3.834 ± 0.816
1.369IlePhe: 1.369 ± 0.602
4.107IleGly: 4.107 ± 1.138
1.369IleHis: 1.369 ± 0.699
3.286IleIle: 3.286 ± 1.102
4.107IleLys: 4.107 ± 1.081
3.834IleLeu: 3.834 ± 1.063
1.095IleMet: 1.095 ± 0.377
2.191IleAsn: 2.191 ± 0.548
1.917IlePro: 1.917 ± 0.755
0.548IleGln: 0.548 ± 0.212
2.738IleArg: 2.738 ± 1.397
2.464IleSer: 2.464 ± 0.639
3.012IleThr: 3.012 ± 0.44
3.834IleVal: 3.834 ± 1.402
0.0IleTrp: 0.0 ± 0.0
2.464IleTyr: 2.464 ± 0.961
0.0IleXaa: 0.0 ± 0.0
Lys
3.834LysAla: 3.834 ± 0.771
1.095LysCys: 1.095 ± 0.472
3.012LysAsp: 3.012 ± 0.495
3.012LysGlu: 3.012 ± 0.769
3.286LysPhe: 3.286 ± 0.508
2.191LysGly: 2.191 ± 0.512
0.548LysHis: 0.548 ± 0.558
1.917LysIle: 1.917 ± 0.892
4.929LysLys: 4.929 ± 1.747
6.572LysLeu: 6.572 ± 1.811
1.369LysMet: 1.369 ± 0.448
4.381LysAsn: 4.381 ± 1.029
2.738LysPro: 2.738 ± 0.666
2.464LysGln: 2.464 ± 0.877
4.929LysArg: 4.929 ± 1.594
3.286LysSer: 3.286 ± 1.058
3.286LysThr: 3.286 ± 1.034
3.834LysVal: 3.834 ± 1.158
0.548LysTrp: 0.548 ± 0.319
2.738LysTyr: 2.738 ± 1.127
0.274LysXaa: 0.274 ± 0.159
Leu
4.381LeuAla: 4.381 ± 0.602
1.917LeuCys: 1.917 ± 1.094
3.56LeuAsp: 3.56 ± 0.588
5.203LeuGlu: 5.203 ± 1.415
4.107LeuPhe: 4.107 ± 1.103
5.476LeuGly: 5.476 ± 1.3
2.191LeuHis: 2.191 ± 0.977
3.286LeuIle: 3.286 ± 0.806
4.929LeuLys: 4.929 ± 1.25
9.036LeuLeu: 9.036 ± 2.66
1.643LeuMet: 1.643 ± 0.665
6.298LeuAsn: 6.298 ± 1.92
4.655LeuPro: 4.655 ± 1.581
3.56LeuGln: 3.56 ± 0.838
5.476LeuArg: 5.476 ± 2.105
7.941LeuSer: 7.941 ± 1.038
6.024LeuThr: 6.024 ± 1.321
6.846LeuVal: 6.846 ± 0.928
1.369LeuTrp: 1.369 ± 0.654
3.56LeuTyr: 3.56 ± 1.15
0.0LeuXaa: 0.0 ± 0.0
Met
2.191MetAla: 2.191 ± 0.577
0.274MetCys: 0.274 ± 0.159
0.821MetAsp: 0.821 ± 0.407
1.917MetGlu: 1.917 ± 0.68
0.821MetPhe: 0.821 ± 0.478
0.274MetGly: 0.274 ± 0.159
0.0MetHis: 0.0 ± 0.0
1.643MetIle: 1.643 ± 0.522
2.464MetLys: 2.464 ± 0.778
1.643MetLeu: 1.643 ± 0.45
0.274MetMet: 0.274 ± 0.27
0.548MetAsn: 0.548 ± 0.319
0.548MetPro: 0.548 ± 0.319
0.821MetGln: 0.821 ± 0.3
1.643MetArg: 1.643 ± 0.677
1.917MetSer: 1.917 ± 1.188
1.369MetThr: 1.369 ± 0.363
2.191MetVal: 2.191 ± 0.724
0.548MetTrp: 0.548 ± 0.319
0.548MetTyr: 0.548 ± 0.319
0.0MetXaa: 0.0 ± 0.0
Asn
4.107AsnAla: 4.107 ± 1.059
1.643AsnCys: 1.643 ± 1.131
2.464AsnAsp: 2.464 ± 0.981
2.738AsnGlu: 2.738 ± 0.676
2.191AsnPhe: 2.191 ± 0.507
4.107AsnGly: 4.107 ± 1.213
0.548AsnHis: 0.548 ± 0.54
3.286AsnIle: 3.286 ± 2.043
2.191AsnLys: 2.191 ± 0.528
4.107AsnLeu: 4.107 ± 0.914
0.548AsnMet: 0.548 ± 0.682
3.834AsnAsn: 3.834 ± 2.15
2.191AsnPro: 2.191 ± 1.17
1.095AsnGln: 1.095 ± 0.543
3.56AsnArg: 3.56 ± 0.686
3.56AsnSer: 3.56 ± 1.046
2.738AsnThr: 2.738 ± 0.739
4.107AsnVal: 4.107 ± 0.839
1.369AsnTrp: 1.369 ± 0.266
1.369AsnTyr: 1.369 ± 0.718
0.0AsnXaa: 0.0 ± 0.0
Pro
1.095ProAla: 1.095 ± 0.593
0.821ProCys: 0.821 ± 0.379
2.464ProAsp: 2.464 ± 0.544
3.56ProGlu: 3.56 ± 1.762
0.274ProPhe: 0.274 ± 0.159
2.738ProGly: 2.738 ± 0.805
0.821ProHis: 0.821 ± 0.618
1.369ProIle: 1.369 ± 0.797
2.191ProLys: 2.191 ± 1.276
2.738ProLeu: 2.738 ± 1.048
0.821ProMet: 0.821 ± 0.478
1.369ProAsn: 1.369 ± 0.654
0.274ProPro: 0.274 ± 0.415
1.917ProGln: 1.917 ± 0.879
2.191ProArg: 2.191 ± 0.686
2.738ProSer: 2.738 ± 1.181
1.095ProThr: 1.095 ± 1.079
2.191ProVal: 2.191 ± 1.398
0.274ProTrp: 0.274 ± 0.27
1.369ProTyr: 1.369 ± 0.399
0.0ProXaa: 0.0 ± 0.0
Gln
2.464GlnAla: 2.464 ± 1.781
1.095GlnCys: 1.095 ± 0.274
1.643GlnAsp: 1.643 ± 0.815
2.191GlnGlu: 2.191 ± 0.705
1.369GlnPhe: 1.369 ± 0.638
2.464GlnGly: 2.464 ± 1.129
0.0GlnHis: 0.0 ± 0.0
1.643GlnIle: 1.643 ± 0.957
1.643GlnLys: 1.643 ± 0.598
4.107GlnLeu: 4.107 ± 1.317
0.821GlnMet: 0.821 ± 0.478
1.095GlnAsn: 1.095 ± 0.478
0.821GlnPro: 0.821 ± 0.675
1.369GlnGln: 1.369 ± 0.92
1.917GlnArg: 1.917 ± 0.579
4.107GlnSer: 4.107 ± 1.16
1.917GlnThr: 1.917 ± 0.522
3.286GlnVal: 3.286 ± 0.402
0.548GlnTrp: 0.548 ± 0.319
1.917GlnTyr: 1.917 ± 0.593
0.0GlnXaa: 0.0 ± 0.0
Arg
5.203ArgAla: 5.203 ± 0.887
0.821ArgCys: 0.821 ± 0.503
2.464ArgAsp: 2.464 ± 0.637
5.476ArgGlu: 5.476 ± 0.663
2.191ArgPhe: 2.191 ± 0.707
3.012ArgGly: 3.012 ± 0.755
1.643ArgHis: 1.643 ± 0.628
3.286ArgIle: 3.286 ± 0.963
4.107ArgLys: 4.107 ± 0.995
4.655ArgLeu: 4.655 ± 1.885
1.643ArgMet: 1.643 ± 1.053
3.56ArgAsn: 3.56 ± 0.898
2.191ArgPro: 2.191 ± 1.328
3.286ArgGln: 3.286 ± 0.651
5.75ArgArg: 5.75 ± 1.972
3.286ArgSer: 3.286 ± 1.029
2.464ArgThr: 2.464 ± 0.721
5.75ArgVal: 5.75 ± 1.652
0.821ArgTrp: 0.821 ± 0.365
2.464ArgTyr: 2.464 ± 0.745
0.0ArgXaa: 0.0 ± 0.0
Ser
6.572SerAla: 6.572 ± 1.21
1.369SerCys: 1.369 ± 0.833
5.203SerAsp: 5.203 ± 0.624
5.476SerGlu: 5.476 ± 1.833
4.107SerPhe: 4.107 ± 0.924
6.024SerGly: 6.024 ± 2.243
0.274SerHis: 0.274 ± 0.159
3.56SerIle: 3.56 ± 0.562
4.107SerLys: 4.107 ± 1.11
6.572SerLeu: 6.572 ± 1.012
1.095SerMet: 1.095 ± 0.335
2.191SerAsn: 2.191 ± 1.349
2.738SerPro: 2.738 ± 0.938
3.56SerGln: 3.56 ± 0.529
3.834SerArg: 3.834 ± 1.335
8.488SerSer: 8.488 ± 2.264
4.381SerThr: 4.381 ± 1.199
5.476SerVal: 5.476 ± 0.608
0.274SerTrp: 0.274 ± 0.159
3.834SerTyr: 3.834 ± 0.882
0.0SerXaa: 0.0 ± 0.0
Thr
3.286ThrAla: 3.286 ± 0.739
0.821ThrCys: 0.821 ± 0.84
2.464ThrAsp: 2.464 ± 0.385
2.464ThrGlu: 2.464 ± 0.373
2.191ThrPhe: 2.191 ± 1.017
3.012ThrGly: 3.012 ± 1.269
1.369ThrHis: 1.369 ± 0.471
2.191ThrIle: 2.191 ± 0.566
2.191ThrLys: 2.191 ± 1.043
3.834ThrLeu: 3.834 ± 1.383
1.369ThrMet: 1.369 ± 0.636
2.191ThrAsn: 2.191 ± 1.125
0.548ThrPro: 0.548 ± 0.379
2.191ThrGln: 2.191 ± 0.686
3.286ThrArg: 3.286 ± 0.694
4.655ThrSer: 4.655 ± 1.629
1.917ThrThr: 1.917 ± 0.587
4.655ThrVal: 4.655 ± 1.485
0.548ThrTrp: 0.548 ± 0.319
1.917ThrTyr: 1.917 ± 0.873
0.0ThrXaa: 0.0 ± 0.0
Val
6.572ValAla: 6.572 ± 1.419
1.917ValCys: 1.917 ± 2.195
4.381ValAsp: 4.381 ± 1.236
4.929ValGlu: 4.929 ± 0.564
2.464ValPhe: 2.464 ± 0.731
3.834ValGly: 3.834 ± 0.863
1.917ValHis: 1.917 ± 0.579
5.75ValIle: 5.75 ± 0.978
3.56ValLys: 3.56 ± 1.234
6.298ValLeu: 6.298 ± 2.166
1.369ValMet: 1.369 ± 0.429
4.929ValAsn: 4.929 ± 0.72
2.191ValPro: 2.191 ± 0.817
1.917ValGln: 1.917 ± 0.604
4.929ValArg: 4.929 ± 0.785
4.107ValSer: 4.107 ± 0.618
4.929ValThr: 4.929 ± 0.757
8.215ValVal: 8.215 ± 1.333
0.0ValTrp: 0.0 ± 0.0
3.012ValTyr: 3.012 ± 1.268
0.0ValXaa: 0.0 ± 0.0
Trp
1.095TrpAla: 1.095 ± 0.353
0.0TrpCys: 0.0 ± 0.0
1.095TrpAsp: 1.095 ± 0.525
0.274TrpGlu: 0.274 ± 0.159
1.095TrpPhe: 1.095 ± 0.504
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.274TrpIle: 0.274 ± 0.159
1.369TrpLys: 1.369 ± 0.522
1.095TrpLeu: 1.095 ± 0.83
0.548TrpMet: 0.548 ± 0.463
0.548TrpAsn: 0.548 ± 0.319
0.548TrpPro: 0.548 ± 0.558
0.0TrpGln: 0.0 ± 0.0
0.821TrpArg: 0.821 ± 0.365
0.274TrpSer: 0.274 ± 0.159
0.0TrpThr: 0.0 ± 0.0
0.548TrpVal: 0.548 ± 0.319
0.274TrpTrp: 0.274 ± 0.159
0.274TrpTyr: 0.274 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.917TyrAla: 1.917 ± 0.688
0.0TyrCys: 0.0 ± 0.0
3.56TyrAsp: 3.56 ± 0.965
3.012TyrGlu: 3.012 ± 0.49
1.643TyrPhe: 1.643 ± 0.665
2.191TyrGly: 2.191 ± 0.732
0.0TyrHis: 0.0 ± 0.0
0.548TyrIle: 0.548 ± 0.319
1.643TyrLys: 1.643 ± 0.889
5.476TyrLeu: 5.476 ± 1.234
0.274TyrMet: 0.274 ± 0.159
2.464TyrAsn: 2.464 ± 2.018
1.643TyrPro: 1.643 ± 0.444
1.369TyrGln: 1.369 ± 0.602
1.643TyrArg: 1.643 ± 0.535
2.191TyrSer: 2.191 ± 0.577
1.095TyrThr: 1.095 ± 0.593
2.738TyrVal: 2.738 ± 1.483
0.274TyrTrp: 0.274 ± 0.608
1.643TyrTyr: 1.643 ± 1.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.274XaaGln: 0.274 ± 0.27
0.274XaaArg: 0.274 ± 0.159
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3653 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski