Amino acid dipepetide frequency for Vaprio virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.416AlaAla: 3.416 ± 1.357
1.051AlaCys: 1.051 ± 0.535
3.941AlaAsp: 3.941 ± 1.782
1.051AlaGlu: 1.051 ± 0.499
0.788AlaPhe: 0.788 ± 0.372
3.153AlaGly: 3.153 ± 2.172
1.576AlaHis: 1.576 ± 0.603
2.627AlaIle: 2.627 ± 0.7
2.365AlaLys: 2.365 ± 0.597
4.992AlaLeu: 4.992 ± 1.608
0.263AlaMet: 0.263 ± 0.151
2.89AlaAsn: 2.89 ± 1.421
1.051AlaPro: 1.051 ± 0.612
1.576AlaGln: 1.576 ± 0.755
2.365AlaArg: 2.365 ± 0.438
2.89AlaSer: 2.89 ± 0.579
4.729AlaThr: 4.729 ± 1.97
2.627AlaVal: 2.627 ± 1.035
0.525AlaTrp: 0.525 ± 0.303
1.839AlaTyr: 1.839 ± 0.656
0.0AlaXaa: 0.0 ± 0.0
Cys
1.051CysAla: 1.051 ± 0.297
0.263CysCys: 0.263 ± 0.151
0.788CysAsp: 0.788 ± 0.409
0.525CysGlu: 0.525 ± 0.642
1.314CysPhe: 1.314 ± 0.466
1.051CysGly: 1.051 ± 0.381
0.525CysHis: 0.525 ± 0.484
1.576CysIle: 1.576 ± 0.907
2.102CysLys: 2.102 ± 0.969
2.365CysLeu: 2.365 ± 0.615
0.263CysMet: 0.263 ± 0.365
0.525CysAsn: 0.525 ± 0.302
0.788CysPro: 0.788 ± 0.492
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.576CysSer: 1.576 ± 0.451
1.576CysThr: 1.576 ± 0.477
0.788CysVal: 0.788 ± 0.293
0.788CysTrp: 0.788 ± 0.452
0.525CysTyr: 0.525 ± 0.267
0.0CysXaa: 0.0 ± 0.0
Asp
2.102AspAla: 2.102 ± 1.624
1.839AspCys: 1.839 ± 0.967
2.102AspAsp: 2.102 ± 0.979
7.62AspGlu: 7.62 ± 3.148
3.416AspPhe: 3.416 ± 0.806
2.627AspGly: 2.627 ± 0.79
2.102AspHis: 2.102 ± 0.687
3.678AspIle: 3.678 ± 0.662
3.678AspLys: 3.678 ± 0.912
6.569AspLeu: 6.569 ± 0.898
2.89AspMet: 2.89 ± 0.514
2.627AspAsn: 2.627 ± 1.185
3.941AspPro: 3.941 ± 0.315
2.365AspGln: 2.365 ± 0.653
2.365AspArg: 2.365 ± 0.567
4.992AspSer: 4.992 ± 1.383
1.051AspThr: 1.051 ± 0.451
2.627AspVal: 2.627 ± 0.818
0.525AspTrp: 0.525 ± 0.267
2.102AspTyr: 2.102 ± 0.73
0.0AspXaa: 0.0 ± 0.0
Glu
4.204GluAla: 4.204 ± 1.381
0.788GluCys: 0.788 ± 0.453
4.992GluAsp: 4.992 ± 1.932
8.671GluGlu: 8.671 ± 4.949
3.153GluPhe: 3.153 ± 1.177
4.204GluGly: 4.204 ± 1.255
1.314GluHis: 1.314 ± 0.536
4.204GluIle: 4.204 ± 0.717
5.255GluLys: 5.255 ± 0.707
6.043GluLeu: 6.043 ± 1.507
1.576GluMet: 1.576 ± 0.517
2.89GluAsn: 2.89 ± 1.511
3.416GluPro: 3.416 ± 0.667
1.576GluGln: 1.576 ± 0.71
2.365GluArg: 2.365 ± 0.999
6.043GluSer: 6.043 ± 2.265
3.941GluThr: 3.941 ± 0.827
4.729GluVal: 4.729 ± 0.356
1.314GluTrp: 1.314 ± 0.951
3.416GluTyr: 3.416 ± 1.201
0.0GluXaa: 0.0 ± 0.0
Phe
0.788PheAla: 0.788 ± 0.409
1.051PheCys: 1.051 ± 1.014
2.89PheAsp: 2.89 ± 0.963
2.102PheGlu: 2.102 ± 0.627
2.627PhePhe: 2.627 ± 0.764
2.365PheGly: 2.365 ± 1.12
1.314PheHis: 1.314 ± 1.049
2.365PheIle: 2.365 ± 0.563
4.204PheLys: 4.204 ± 0.478
4.467PheLeu: 4.467 ± 1.308
0.788PheMet: 0.788 ± 0.609
1.314PheAsn: 1.314 ± 0.798
2.627PhePro: 2.627 ± 0.533
2.365PheGln: 2.365 ± 0.748
2.89PheArg: 2.89 ± 1.133
4.992PheSer: 4.992 ± 1.236
1.314PheThr: 1.314 ± 0.85
3.416PheVal: 3.416 ± 0.929
0.788PheTrp: 0.788 ± 0.293
1.051PheTyr: 1.051 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
1.839GlyAla: 1.839 ± 0.89
1.051GlyCys: 1.051 ± 0.499
3.678GlyAsp: 3.678 ± 1.141
4.204GlyGlu: 4.204 ± 1.135
1.839GlyPhe: 1.839 ± 0.463
4.204GlyGly: 4.204 ± 1.281
1.314GlyHis: 1.314 ± 0.501
3.416GlyIle: 3.416 ± 0.547
4.204GlyLys: 4.204 ± 1.394
7.357GlyLeu: 7.357 ± 2.111
3.153GlyMet: 3.153 ± 1.899
1.839GlyAsn: 1.839 ± 0.46
3.153GlyPro: 3.153 ± 1.009
1.576GlyGln: 1.576 ± 1.018
2.89GlyArg: 2.89 ± 0.895
4.204GlySer: 4.204 ± 0.551
3.153GlyThr: 3.153 ± 0.67
3.416GlyVal: 3.416 ± 0.747
0.788GlyTrp: 0.788 ± 0.417
2.365GlyTyr: 2.365 ± 0.826
0.0GlyXaa: 0.0 ± 0.0
His
1.576HisAla: 1.576 ± 0.465
0.525HisCys: 0.525 ± 0.368
1.314HisAsp: 1.314 ± 0.931
1.839HisGlu: 1.839 ± 0.615
2.102HisPhe: 2.102 ± 0.587
0.263HisGly: 0.263 ± 0.502
1.314HisHis: 1.314 ± 0.536
2.102HisIle: 2.102 ± 0.917
2.365HisLys: 2.365 ± 0.68
2.89HisLeu: 2.89 ± 0.929
0.0HisMet: 0.0 ± 0.0
0.525HisAsn: 0.525 ± 0.302
1.839HisPro: 1.839 ± 0.8
0.263HisGln: 0.263 ± 0.151
1.314HisArg: 1.314 ± 0.501
1.576HisSer: 1.576 ± 0.451
0.525HisThr: 0.525 ± 0.267
2.365HisVal: 2.365 ± 1.18
0.788HisTrp: 0.788 ± 0.453
1.314HisTyr: 1.314 ± 0.29
0.0HisXaa: 0.0 ± 0.0
Ile
2.627IleAla: 2.627 ± 1.029
1.314IleCys: 1.314 ± 0.395
4.467IleAsp: 4.467 ± 0.986
6.043IleGlu: 6.043 ± 2.265
2.365IlePhe: 2.365 ± 0.57
4.467IleGly: 4.467 ± 0.856
2.365IleHis: 2.365 ± 0.794
2.365IleIle: 2.365 ± 0.723
6.831IleLys: 6.831 ± 1.419
5.518IleLeu: 5.518 ± 1.449
2.102IleMet: 2.102 ± 0.489
2.89IleAsn: 2.89 ± 1.357
2.365IlePro: 2.365 ± 0.911
2.102IleGln: 2.102 ± 0.645
3.416IleArg: 3.416 ± 1.126
6.306IleSer: 6.306 ± 1.78
1.839IleThr: 1.839 ± 0.46
2.102IleVal: 2.102 ± 1.131
0.263IleTrp: 0.263 ± 0.151
2.627IleTyr: 2.627 ± 0.952
0.0IleXaa: 0.0 ± 0.0
Lys
3.941LysAla: 3.941 ± 0.951
1.051LysCys: 1.051 ± 0.438
4.729LysAsp: 4.729 ± 1.502
6.569LysGlu: 6.569 ± 1.108
1.839LysPhe: 1.839 ± 0.593
5.255LysGly: 5.255 ± 1.704
2.102LysHis: 2.102 ± 0.799
9.459LysIle: 9.459 ± 2.608
6.306LysLys: 6.306 ± 1.027
4.992LysLeu: 4.992 ± 1.841
2.365LysMet: 2.365 ± 0.792
5.255LysAsn: 5.255 ± 1.129
2.89LysPro: 2.89 ± 0.666
1.576LysGln: 1.576 ± 0.641
4.204LysArg: 4.204 ± 0.356
4.467LysSer: 4.467 ± 1.01
5.518LysThr: 5.518 ± 1.455
3.941LysVal: 3.941 ± 1.159
1.314LysTrp: 1.314 ± 0.756
2.102LysTyr: 2.102 ± 0.556
0.0LysXaa: 0.0 ± 0.0
Leu
2.89LeuAla: 2.89 ± 0.62
2.102LeuCys: 2.102 ± 0.314
6.306LeuAsp: 6.306 ± 0.874
6.569LeuGlu: 6.569 ± 1.266
4.467LeuPhe: 4.467 ± 0.77
7.357LeuGly: 7.357 ± 1.545
2.365LeuHis: 2.365 ± 0.996
7.357LeuIle: 7.357 ± 1.384
6.043LeuLys: 6.043 ± 1.76
9.196LeuLeu: 9.196 ± 2.341
2.627LeuMet: 2.627 ± 1.222
5.78LeuAsn: 5.78 ± 2.152
2.89LeuPro: 2.89 ± 1.201
2.89LeuGln: 2.89 ± 0.506
6.043LeuArg: 6.043 ± 1.53
7.357LeuSer: 7.357 ± 1.706
5.255LeuThr: 5.255 ± 0.935
4.992LeuVal: 4.992 ± 1.346
0.788LeuTrp: 0.788 ± 0.736
2.89LeuTyr: 2.89 ± 0.916
0.0LeuXaa: 0.0 ± 0.0
Met
2.102MetAla: 2.102 ± 0.761
0.525MetCys: 0.525 ± 0.267
1.314MetAsp: 1.314 ± 0.581
2.102MetGlu: 2.102 ± 0.67
1.576MetPhe: 1.576 ± 0.508
1.051MetGly: 1.051 ± 0.606
0.263MetHis: 0.263 ± 0.321
1.839MetIle: 1.839 ± 0.463
2.102MetLys: 2.102 ± 0.805
1.314MetLeu: 1.314 ± 0.752
0.788MetMet: 0.788 ± 0.632
1.314MetAsn: 1.314 ± 0.664
0.788MetPro: 0.788 ± 1.095
0.788MetGln: 0.788 ± 0.293
0.788MetArg: 0.788 ± 0.653
2.102MetSer: 2.102 ± 0.871
1.576MetThr: 1.576 ± 0.907
3.678MetVal: 3.678 ± 1.019
0.525MetTrp: 0.525 ± 0.267
0.525MetTyr: 0.525 ± 0.73
0.0MetXaa: 0.0 ± 0.0
Asn
2.102AsnAla: 2.102 ± 0.594
0.263AsnCys: 0.263 ± 0.502
2.365AsnAsp: 2.365 ± 0.558
3.153AsnGlu: 3.153 ± 1.188
2.365AsnPhe: 2.365 ± 0.723
3.153AsnGly: 3.153 ± 1.252
2.102AsnHis: 2.102 ± 0.979
3.153AsnIle: 3.153 ± 1.506
3.416AsnLys: 3.416 ± 0.745
5.255AsnLeu: 5.255 ± 1.536
1.576AsnMet: 1.576 ± 0.55
2.627AsnAsn: 2.627 ± 1.044
1.839AsnPro: 1.839 ± 1.046
1.576AsnGln: 1.576 ± 0.648
3.153AsnArg: 3.153 ± 0.771
3.153AsnSer: 3.153 ± 0.727
1.051AsnThr: 1.051 ± 0.604
2.102AsnVal: 2.102 ± 0.845
1.576AsnTrp: 1.576 ± 0.354
2.365AsnTyr: 2.365 ± 0.545
0.0AsnXaa: 0.0 ± 0.0
Pro
3.941ProAla: 3.941 ± 1.832
0.788ProCys: 0.788 ± 0.453
4.992ProAsp: 4.992 ± 0.67
2.627ProGlu: 2.627 ± 0.876
1.314ProPhe: 1.314 ± 0.29
1.839ProGly: 1.839 ± 0.404
0.788ProHis: 0.788 ± 0.293
2.89ProIle: 2.89 ± 0.663
3.941ProLys: 3.941 ± 1.115
5.255ProLeu: 5.255 ± 0.527
0.263ProMet: 0.263 ± 0.382
2.627ProAsn: 2.627 ± 1.421
2.89ProPro: 2.89 ± 1.511
0.263ProGln: 0.263 ± 0.365
1.314ProArg: 1.314 ± 0.832
2.627ProSer: 2.627 ± 1.216
2.627ProThr: 2.627 ± 0.653
2.365ProVal: 2.365 ± 0.54
0.263ProTrp: 0.263 ± 0.151
1.314ProTyr: 1.314 ± 0.598
0.0ProXaa: 0.0 ± 0.0
Gln
1.314GlnAla: 1.314 ± 0.672
0.263GlnCys: 0.263 ± 0.151
1.314GlnAsp: 1.314 ± 1.346
1.576GlnGlu: 1.576 ± 0.89
1.314GlnPhe: 1.314 ± 0.746
1.839GlnGly: 1.839 ± 0.935
0.525GlnHis: 0.525 ± 0.267
1.839GlnIle: 1.839 ± 0.567
1.051GlnLys: 1.051 ± 0.795
3.941GlnLeu: 3.941 ± 0.989
0.788GlnMet: 0.788 ± 0.653
2.365GlnAsn: 2.365 ± 0.808
1.314GlnPro: 1.314 ± 0.485
0.788GlnGln: 0.788 ± 0.944
1.314GlnArg: 1.314 ± 0.58
2.365GlnSer: 2.365 ± 1.039
1.576GlnThr: 1.576 ± 0.517
1.576GlnVal: 1.576 ± 0.617
0.0GlnTrp: 0.0 ± 0.0
0.525GlnTyr: 0.525 ± 0.484
0.0GlnXaa: 0.0 ± 0.0
Arg
3.416ArgAla: 3.416 ± 0.541
2.102ArgCys: 2.102 ± 1.086
1.576ArgAsp: 1.576 ± 0.354
2.627ArgGlu: 2.627 ± 0.543
3.678ArgPhe: 3.678 ± 0.979
2.627ArgGly: 2.627 ± 1.185
1.576ArgHis: 1.576 ± 0.517
2.102ArgIle: 2.102 ± 1.209
4.204ArgLys: 4.204 ± 0.964
3.153ArgLeu: 3.153 ± 0.892
1.314ArgMet: 1.314 ± 0.526
2.365ArgAsn: 2.365 ± 0.762
3.678ArgPro: 3.678 ± 0.871
1.576ArgGln: 1.576 ± 0.508
2.627ArgArg: 2.627 ± 0.874
4.467ArgSer: 4.467 ± 0.595
3.153ArgThr: 3.153 ± 0.849
2.365ArgVal: 2.365 ± 0.818
1.051ArgTrp: 1.051 ± 0.381
1.314ArgTyr: 1.314 ± 0.594
0.0ArgXaa: 0.0 ± 0.0
Ser
3.416SerAla: 3.416 ± 0.968
1.051SerCys: 1.051 ± 0.381
5.255SerAsp: 5.255 ± 1.258
4.992SerGlu: 4.992 ± 2.947
4.467SerPhe: 4.467 ± 1.447
4.467SerGly: 4.467 ± 0.988
1.051SerHis: 1.051 ± 0.499
4.467SerIle: 4.467 ± 1.059
7.62SerLys: 7.62 ± 1.781
6.831SerLeu: 6.831 ± 1.697
1.314SerMet: 1.314 ± 0.395
2.365SerAsn: 2.365 ± 0.449
2.627SerPro: 2.627 ± 0.622
2.102SerGln: 2.102 ± 0.627
5.78SerArg: 5.78 ± 0.877
5.518SerSer: 5.518 ± 1.023
3.416SerThr: 3.416 ± 0.929
4.204SerVal: 4.204 ± 0.931
1.314SerTrp: 1.314 ± 0.501
2.102SerTyr: 2.102 ± 0.765
0.0SerXaa: 0.0 ± 0.0
Thr
1.051ThrAla: 1.051 ± 0.606
0.525ThrCys: 0.525 ± 0.403
1.314ThrAsp: 1.314 ± 0.536
3.941ThrGlu: 3.941 ± 0.78
1.576ThrPhe: 1.576 ± 0.619
3.153ThrGly: 3.153 ± 1.119
0.788ThrHis: 0.788 ± 0.453
3.153ThrIle: 3.153 ± 0.979
3.416ThrLys: 3.416 ± 1.179
4.729ThrLeu: 4.729 ± 0.745
1.576ThrMet: 1.576 ± 0.585
2.365ThrAsn: 2.365 ± 0.878
2.627ThrPro: 2.627 ± 1.396
0.788ThrGln: 0.788 ± 0.372
2.89ThrArg: 2.89 ± 1.003
3.678ThrSer: 3.678 ± 1.242
2.102ThrThr: 2.102 ± 0.896
3.941ThrVal: 3.941 ± 0.928
2.102ThrTrp: 2.102 ± 1.121
2.102ThrTyr: 2.102 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
2.102ValAla: 2.102 ± 1.067
1.314ValCys: 1.314 ± 0.756
4.992ValAsp: 4.992 ± 0.511
4.729ValGlu: 4.729 ± 0.915
2.365ValPhe: 2.365 ± 0.519
3.416ValGly: 3.416 ± 0.664
2.102ValHis: 2.102 ± 0.62
2.89ValIle: 2.89 ± 0.506
4.992ValLys: 4.992 ± 1.836
7.62ValLeu: 7.62 ± 2.18
1.314ValMet: 1.314 ± 0.598
2.365ValAsn: 2.365 ± 0.736
2.102ValPro: 2.102 ± 1.209
2.627ValGln: 2.627 ± 0.741
2.89ValArg: 2.89 ± 0.409
3.416ValSer: 3.416 ± 1.465
1.839ValThr: 1.839 ± 0.485
2.365ValVal: 2.365 ± 1.16
0.788ValTrp: 0.788 ± 0.452
1.576ValTyr: 1.576 ± 0.721
0.0ValXaa: 0.0 ± 0.0
Trp
0.525TrpAla: 0.525 ± 0.484
0.0TrpCys: 0.0 ± 0.0
1.051TrpAsp: 1.051 ± 0.535
1.314TrpGlu: 1.314 ± 0.756
1.839TrpPhe: 1.839 ± 0.656
1.576TrpGly: 1.576 ± 0.907
0.525TrpHis: 0.525 ± 0.267
1.576TrpIle: 1.576 ± 1.164
2.365TrpLys: 2.365 ± 0.808
0.525TrpLeu: 0.525 ± 0.302
0.263TrpMet: 0.263 ± 0.321
1.051TrpAsn: 1.051 ± 0.604
0.525TrpPro: 0.525 ± 0.302
0.0TrpGln: 0.0 ± 0.0
0.525TrpArg: 0.525 ± 0.302
0.263TrpSer: 0.263 ± 0.398
0.525TrpThr: 0.525 ± 0.303
1.314TrpVal: 1.314 ± 0.54
0.263TrpTrp: 0.263 ± 0.151
0.263TrpTyr: 0.263 ± 0.321
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.314TyrAla: 1.314 ± 0.56
0.525TyrCys: 0.525 ± 0.576
1.839TyrAsp: 1.839 ± 0.967
1.839TyrGlu: 1.839 ± 0.52
1.576TyrPhe: 1.576 ± 0.354
1.576TyrGly: 1.576 ± 0.517
0.788TyrHis: 0.788 ± 0.293
1.314TyrIle: 1.314 ± 0.388
3.416TyrLys: 3.416 ± 1.246
3.153TyrLeu: 3.153 ± 1.19
1.576TyrMet: 1.576 ± 0.835
2.365TyrAsn: 2.365 ± 0.832
1.576TyrPro: 1.576 ± 0.279
0.788TyrGln: 0.788 ± 0.434
1.839TyrArg: 1.839 ± 0.698
2.365TyrSer: 2.365 ± 0.493
1.051TyrThr: 1.051 ± 0.887
2.89TyrVal: 2.89 ± 1.154
0.525TyrTrp: 0.525 ± 0.267
0.525TyrTyr: 0.525 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski