Amino acid dipepetide frequency for American plum line pattern virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.202AlaAla: 5.202 ± 2.253
1.3AlaCys: 1.3 ± 0.55
5.202AlaAsp: 5.202 ± 1.038
3.901AlaGlu: 3.901 ± 1.315
1.3AlaPhe: 1.3 ± 0.635
3.901AlaGly: 3.901 ± 1.016
1.734AlaHis: 1.734 ± 0.546
3.034AlaIle: 3.034 ± 0.615
3.901AlaLys: 3.901 ± 1.315
4.768AlaLeu: 4.768 ± 0.869
2.167AlaMet: 2.167 ± 1.335
2.167AlaAsn: 2.167 ± 1.181
3.468AlaPro: 3.468 ± 1.522
2.601AlaGln: 2.601 ± 0.507
2.601AlaArg: 2.601 ± 2.326
2.601AlaSer: 2.601 ± 2.019
2.601AlaThr: 2.601 ± 0.393
6.068AlaVal: 6.068 ± 1.758
0.867AlaTrp: 0.867 ± 0.595
0.867AlaTyr: 0.867 ± 0.632
0.0AlaXaa: 0.0 ± 0.0
Cys
1.3CysAla: 1.3 ± 0.635
2.167CysCys: 2.167 ± 0.533
2.167CysAsp: 2.167 ± 0.788
1.3CysGlu: 1.3 ± 0.529
0.867CysPhe: 0.867 ± 0.273
0.867CysGly: 0.867 ± 0.534
0.867CysHis: 0.867 ± 0.534
0.867CysIle: 0.867 ± 0.273
0.0CysLys: 0.0 ± 0.0
2.601CysLeu: 2.601 ± 0.786
0.433CysMet: 0.433 ± 0.267
0.433CysAsn: 0.433 ± 0.653
0.433CysPro: 0.433 ± 0.267
0.0CysGln: 0.0 ± 0.0
1.3CysArg: 1.3 ± 0.55
1.734CysSer: 1.734 ± 0.869
1.734CysThr: 1.734 ± 0.554
0.867CysVal: 0.867 ± 0.273
0.867CysTrp: 0.867 ± 0.67
1.3CysTyr: 1.3 ± 0.55
0.0CysXaa: 0.0 ± 0.0
Asp
4.768AspAla: 4.768 ± 0.967
1.3AspCys: 1.3 ± 0.424
2.167AspAsp: 2.167 ± 1.291
3.034AspGlu: 3.034 ± 1.709
3.034AspPhe: 3.034 ± 0.54
2.601AspGly: 2.601 ± 0.847
0.433AspHis: 0.433 ± 0.335
3.034AspIle: 3.034 ± 1.047
5.635AspLys: 5.635 ± 1.407
4.768AspLeu: 4.768 ± 1.259
1.3AspMet: 1.3 ± 0.44
1.3AspAsn: 1.3 ± 1.005
2.167AspPro: 2.167 ± 0.801
3.468AspGln: 3.468 ± 1.466
4.335AspArg: 4.335 ± 0.771
3.034AspSer: 3.034 ± 1.068
4.335AspThr: 4.335 ± 1.025
5.635AspVal: 5.635 ± 1.047
0.0AspTrp: 0.0 ± 0.0
3.901AspTyr: 3.901 ± 1.182
0.0AspXaa: 0.0 ± 0.0
Glu
5.202GluAla: 5.202 ± 1.266
1.3GluCys: 1.3 ± 0.424
3.034GluAsp: 3.034 ± 1.235
5.635GluGlu: 5.635 ± 1.681
3.034GluPhe: 3.034 ± 1.047
4.335GluGly: 4.335 ± 1.873
0.0GluHis: 0.0 ± 0.0
4.768GluIle: 4.768 ± 1.009
3.034GluLys: 3.034 ± 0.615
5.202GluLeu: 5.202 ± 1.506
3.034GluMet: 3.034 ± 0.986
1.734GluAsn: 1.734 ± 0.546
1.734GluPro: 1.734 ± 1.01
1.734GluGln: 1.734 ± 0.653
4.768GluArg: 4.768 ± 0.832
4.335GluSer: 4.335 ± 0.799
3.901GluThr: 3.901 ± 0.634
6.935GluVal: 6.935 ± 1.044
0.433GluTrp: 0.433 ± 0.623
1.734GluTyr: 1.734 ± 0.653
0.0GluXaa: 0.0 ± 0.0
Phe
2.601PheAla: 2.601 ± 0.57
0.867PheCys: 0.867 ± 0.273
5.202PheAsp: 5.202 ± 1.202
4.335PheGlu: 4.335 ± 0.856
3.468PhePhe: 3.468 ± 1.346
1.3PheGly: 1.3 ± 0.424
0.0PheHis: 0.0 ± 0.0
2.167PheIle: 2.167 ± 0.801
3.468PheLys: 3.468 ± 0.288
6.502PheLeu: 6.502 ± 0.272
0.433PheMet: 0.433 ± 0.335
1.3PheAsn: 1.3 ± 1.005
1.734PhePro: 1.734 ± 0.546
2.601PheGln: 2.601 ± 0.786
3.901PheArg: 3.901 ± 1.009
6.068PheSer: 6.068 ± 1.275
1.734PheThr: 1.734 ± 0.5
2.601PheVal: 2.601 ± 2.441
0.867PheTrp: 0.867 ± 0.67
0.867PheTyr: 0.867 ± 0.534
0.0PheXaa: 0.0 ± 0.0
Gly
2.167GlyAla: 2.167 ± 0.811
2.167GlyCys: 2.167 ± 0.4
4.335GlyAsp: 4.335 ± 0.718
3.034GlyGlu: 3.034 ± 1.003
3.034GlyPhe: 3.034 ± 1.235
4.768GlyGly: 4.768 ± 0.972
0.433GlyHis: 0.433 ± 0.335
0.433GlyIle: 0.433 ± 0.623
5.202GlyLys: 5.202 ± 0.747
2.601GlyLeu: 2.601 ± 0.786
0.867GlyMet: 0.867 ± 0.273
0.867GlyAsn: 0.867 ± 0.273
3.034GlyPro: 3.034 ± 2.078
0.433GlyGln: 0.433 ± 0.623
3.468GlyArg: 3.468 ± 0.288
2.601GlySer: 2.601 ± 0.57
1.3GlyThr: 1.3 ± 0.424
7.369GlyVal: 7.369 ± 0.709
0.433GlyTrp: 0.433 ± 0.335
2.167GlyTyr: 2.167 ± 0.904
0.0GlyXaa: 0.0 ± 0.0
His
0.867HisAla: 0.867 ± 0.534
1.3HisCys: 1.3 ± 0.424
0.433HisAsp: 0.433 ± 0.267
0.867HisGlu: 0.867 ± 0.273
0.433HisPhe: 0.433 ± 0.267
0.867HisGly: 0.867 ± 0.534
1.3HisHis: 1.3 ± 1.005
0.0HisIle: 0.0 ± 0.0
1.3HisLys: 1.3 ± 0.529
2.601HisLeu: 2.601 ± 0.603
0.867HisMet: 0.867 ± 0.67
2.167HisAsn: 2.167 ± 0.602
1.734HisPro: 1.734 ± 0.424
0.867HisGln: 0.867 ± 0.632
1.3HisArg: 1.3 ± 0.651
2.167HisSer: 2.167 ± 0.904
1.3HisThr: 1.3 ± 0.55
0.867HisVal: 0.867 ± 0.534
0.433HisTrp: 0.433 ± 0.267
0.433HisTyr: 0.433 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
2.601IleAla: 2.601 ± 1.785
0.433IleCys: 0.433 ± 0.335
3.034IleAsp: 3.034 ± 0.918
6.068IleGlu: 6.068 ± 1.423
3.034IlePhe: 3.034 ± 1.423
1.734IleGly: 1.734 ± 1.102
1.3IleHis: 1.3 ± 0.801
0.867IleIle: 0.867 ± 0.595
2.601IleLys: 2.601 ± 0.603
3.034IleLeu: 3.034 ± 1.047
1.734IleMet: 1.734 ± 0.964
0.867IleAsn: 0.867 ± 0.534
4.335IlePro: 4.335 ± 1.547
2.167IleGln: 2.167 ± 1.047
3.901IleArg: 3.901 ± 1.091
4.768IleSer: 4.768 ± 1.329
3.034IleThr: 3.034 ± 0.284
2.601IleVal: 2.601 ± 1.085
0.0IleTrp: 0.0 ± 0.0
3.468IleTyr: 3.468 ± 0.826
0.0IleXaa: 0.0 ± 0.0
Lys
3.901LysAla: 3.901 ± 1.547
0.0LysCys: 0.0 ± 0.0
1.734LysAsp: 1.734 ± 0.546
5.635LysGlu: 5.635 ± 1.155
6.068LysPhe: 6.068 ± 3.447
1.734LysGly: 1.734 ± 1.845
2.601LysHis: 2.601 ± 1.161
1.734LysIle: 1.734 ± 0.424
4.768LysLys: 4.768 ± 1.009
3.468LysLeu: 3.468 ± 1.109
0.867LysMet: 0.867 ± 0.923
1.734LysAsn: 1.734 ± 0.798
3.034LysPro: 3.034 ± 0.54
3.468LysGln: 3.468 ± 0.83
3.468LysArg: 3.468 ± 1.346
5.635LysSer: 5.635 ± 1.337
3.034LysThr: 3.034 ± 1.068
4.335LysVal: 4.335 ± 1.025
0.0LysTrp: 0.0 ± 0.0
2.167LysTyr: 2.167 ± 0.801
0.0LysXaa: 0.0 ± 0.0
Leu
3.468LeuAla: 3.468 ± 0.755
3.034LeuCys: 3.034 ± 0.615
6.068LeuAsp: 6.068 ± 1.743
6.068LeuGlu: 6.068 ± 1.535
4.335LeuPhe: 4.335 ± 0.962
6.068LeuGly: 6.068 ± 0.865
2.601LeuHis: 2.601 ± 1.076
4.335LeuIle: 4.335 ± 0.736
3.901LeuLys: 3.901 ± 0.773
7.802LeuLeu: 7.802 ± 1.09
0.867LeuMet: 0.867 ± 0.534
3.901LeuAsn: 3.901 ± 1.567
5.202LeuPro: 5.202 ± 1.984
3.034LeuGln: 3.034 ± 0.615
6.068LeuArg: 6.068 ± 0.389
9.103LeuSer: 9.103 ± 0.659
4.335LeuThr: 4.335 ± 1.328
6.502LeuVal: 6.502 ± 1.609
0.433LeuTrp: 0.433 ± 0.653
2.167LeuTyr: 2.167 ± 0.904
0.0LeuXaa: 0.0 ± 0.0
Met
1.734MetAla: 1.734 ± 0.424
0.433MetCys: 0.433 ± 0.653
2.167MetAsp: 2.167 ± 0.533
1.3MetGlu: 1.3 ± 0.801
0.867MetPhe: 0.867 ± 0.67
1.3MetGly: 1.3 ± 0.635
0.433MetHis: 0.433 ± 0.653
2.167MetIle: 2.167 ± 0.536
0.0MetLys: 0.0 ± 0.0
2.601MetLeu: 2.601 ± 0.47
0.867MetMet: 0.867 ± 0.534
1.3MetAsn: 1.3 ± 0.862
0.433MetPro: 0.433 ± 0.335
0.0MetGln: 0.0 ± 0.0
0.433MetArg: 0.433 ± 0.335
0.0MetSer: 0.0 ± 0.0
3.468MetThr: 3.468 ± 0.83
0.867MetVal: 0.867 ± 0.595
0.433MetTrp: 0.433 ± 0.267
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.034AsnAla: 3.034 ± 1.795
0.867AsnCys: 0.867 ± 0.67
1.734AsnAsp: 1.734 ± 0.554
0.867AsnGlu: 0.867 ± 0.569
2.167AsnPhe: 2.167 ± 0.801
1.3AsnGly: 1.3 ± 0.424
1.734AsnHis: 1.734 ± 0.798
1.3AsnIle: 1.3 ± 0.529
0.867AsnLys: 0.867 ± 0.569
3.034AsnLeu: 3.034 ± 1.235
1.3AsnMet: 1.3 ± 0.424
0.0AsnAsn: 0.0 ± 0.0
2.601AsnPro: 2.601 ± 0.393
0.433AsnGln: 0.433 ± 0.335
1.734AsnArg: 1.734 ± 1.068
4.335AsnSer: 4.335 ± 0.948
2.601AsnThr: 2.601 ± 1.076
2.601AsnVal: 2.601 ± 0.786
1.3AsnTrp: 1.3 ± 1.241
0.433AsnTyr: 0.433 ± 0.335
0.0AsnXaa: 0.0 ± 0.0
Pro
2.601ProAla: 2.601 ± 0.47
1.3ProCys: 1.3 ± 0.55
2.167ProAsp: 2.167 ± 0.931
3.901ProGlu: 3.901 ± 0.634
2.601ProPhe: 2.601 ± 0.603
2.167ProGly: 2.167 ± 0.996
1.734ProHis: 1.734 ± 0.679
3.034ProIle: 3.034 ± 2.302
0.433ProLys: 0.433 ± 0.335
5.635ProLeu: 5.635 ± 1.026
0.867ProMet: 0.867 ± 0.632
2.167ProAsn: 2.167 ± 0.801
2.601ProPro: 2.601 ± 1.215
1.3ProGln: 1.3 ± 0.635
1.734ProArg: 1.734 ± 0.679
5.635ProSer: 5.635 ± 1.085
1.734ProThr: 1.734 ± 0.653
4.335ProVal: 4.335 ± 0.59
0.0ProTrp: 0.0 ± 0.0
2.167ProTyr: 2.167 ± 0.801
0.0ProXaa: 0.0 ± 0.0
Gln
1.3GlnAla: 1.3 ± 0.801
0.0GlnCys: 0.0 ± 0.0
0.867GlnAsp: 0.867 ± 0.595
0.433GlnGlu: 0.433 ± 0.335
2.167GlnPhe: 2.167 ± 0.71
2.601GlnGly: 2.601 ± 1.324
0.433GlnHis: 0.433 ± 0.267
3.034GlnIle: 3.034 ± 1.273
1.3GlnLys: 1.3 ± 0.651
3.901GlnLeu: 3.901 ± 0.923
0.433GlnMet: 0.433 ± 0.546
1.3GlnAsn: 1.3 ± 1.253
0.867GlnPro: 0.867 ± 0.273
0.867GlnGln: 0.867 ± 0.595
2.601GlnArg: 2.601 ± 1.223
1.3GlnSer: 1.3 ± 0.801
0.867GlnThr: 0.867 ± 0.595
3.034GlnVal: 3.034 ± 1.123
0.433GlnTrp: 0.433 ± 0.267
1.734GlnTyr: 1.734 ± 0.761
0.0GlnXaa: 0.0 ± 0.0
Arg
2.601ArgAla: 2.601 ± 0.786
1.3ArgCys: 1.3 ± 0.424
3.468ArgAsp: 3.468 ± 0.755
5.635ArgGlu: 5.635 ± 1.366
3.034ArgPhe: 3.034 ± 0.607
2.601ArgGly: 2.601 ± 0.57
2.601ArgHis: 2.601 ± 1.161
3.901ArgIle: 3.901 ± 1.959
4.768ArgLys: 4.768 ± 0.832
6.068ArgLeu: 6.068 ± 2.057
0.433ArgMet: 0.433 ± 0.325
3.034ArgAsn: 3.034 ± 0.716
1.734ArgPro: 1.734 ± 1.19
0.433ArgGln: 0.433 ± 0.267
4.335ArgArg: 4.335 ± 0.736
6.068ArgSer: 6.068 ± 0.68
3.468ArgThr: 3.468 ± 0.884
5.635ArgVal: 5.635 ± 1.101
1.3ArgTrp: 1.3 ± 0.801
0.867ArgTyr: 0.867 ± 0.67
0.0ArgXaa: 0.0 ± 0.0
Ser
5.202SerAla: 5.202 ± 1.348
1.734SerCys: 1.734 ± 0.653
4.768SerAsp: 4.768 ± 0.487
3.468SerGlu: 3.468 ± 1.707
5.202SerPhe: 5.202 ± 0.786
5.635SerGly: 5.635 ± 1.364
0.867SerHis: 0.867 ± 0.273
3.901SerIle: 3.901 ± 0.806
6.502SerLys: 6.502 ± 1.199
6.502SerLeu: 6.502 ± 0.609
0.867SerMet: 0.867 ± 0.673
3.034SerAsn: 3.034 ± 0.607
3.468SerPro: 3.468 ± 0.755
2.167SerGln: 2.167 ± 0.801
6.068SerArg: 6.068 ± 1.0
8.669SerSer: 8.669 ± 1.219
8.236SerThr: 8.236 ± 2.141
5.202SerVal: 5.202 ± 0.747
1.3SerTrp: 1.3 ± 0.55
0.867SerTyr: 0.867 ± 0.273
0.0SerXaa: 0.0 ± 0.0
Thr
2.167ThrAla: 2.167 ± 1.176
0.867ThrCys: 0.867 ± 0.569
1.3ThrAsp: 1.3 ± 0.55
3.034ThrGlu: 3.034 ± 1.007
1.734ThrPhe: 1.734 ± 0.546
2.167ThrGly: 2.167 ± 1.079
0.867ThrHis: 0.867 ± 0.569
3.901ThrIle: 3.901 ± 1.182
3.901ThrLys: 3.901 ± 0.929
5.202ThrLeu: 5.202 ± 1.602
2.167ThrMet: 2.167 ± 1.219
2.167ThrAsn: 2.167 ± 1.016
2.601ThrPro: 2.601 ± 0.847
1.3ThrGln: 1.3 ± 0.803
3.901ThrArg: 3.901 ± 1.414
6.068ThrSer: 6.068 ± 1.0
4.335ThrThr: 4.335 ± 1.524
6.935ThrVal: 6.935 ± 0.846
0.433ThrTrp: 0.433 ± 0.335
2.167ThrTyr: 2.167 ± 0.931
0.0ThrXaa: 0.0 ± 0.0
Val
6.068ValAla: 6.068 ± 1.168
1.3ValCys: 1.3 ± 0.529
6.935ValAsp: 6.935 ± 1.493
3.468ValGlu: 3.468 ± 1.076
2.601ValPhe: 2.601 ± 1.222
3.901ValGly: 3.901 ± 1.595
1.3ValHis: 1.3 ± 0.801
6.068ValIle: 6.068 ± 2.45
5.202ValLys: 5.202 ± 0.627
8.669ValLeu: 8.669 ± 1.094
0.867ValMet: 0.867 ± 0.595
3.468ValAsn: 3.468 ± 1.466
6.068ValPro: 6.068 ± 1.23
2.167ValGln: 2.167 ± 0.602
4.768ValArg: 4.768 ± 1.496
5.635ValSer: 5.635 ± 0.545
3.034ValThr: 3.034 ± 1.599
9.536ValVal: 9.536 ± 1.914
1.3ValTrp: 1.3 ± 0.862
3.034ValTyr: 3.034 ± 0.959
0.0ValXaa: 0.0 ± 0.0
Trp
1.3TrpAla: 1.3 ± 0.467
0.0TrpCys: 0.0 ± 0.0
0.867TrpAsp: 0.867 ± 0.273
1.3TrpGlu: 1.3 ± 0.529
1.734TrpPhe: 1.734 ± 0.869
0.867TrpGly: 0.867 ± 0.534
0.433TrpHis: 0.433 ± 0.653
0.433TrpIle: 0.433 ± 0.335
0.433TrpLys: 0.433 ± 0.335
0.433TrpLeu: 0.433 ± 0.267
0.0TrpMet: 0.0 ± 0.0
0.433TrpAsn: 0.433 ± 0.335
0.433TrpPro: 0.433 ± 0.653
0.0TrpGln: 0.0 ± 0.0
0.433TrpArg: 0.433 ± 0.335
0.0TrpSer: 0.0 ± 0.0
0.867TrpThr: 0.867 ± 0.595
0.867TrpVal: 0.867 ± 0.673
0.867TrpTrp: 0.867 ± 0.273
0.433TrpTyr: 0.433 ± 0.267
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.167TyrAla: 2.167 ± 1.335
0.433TyrCys: 0.433 ± 0.267
3.034TyrAsp: 3.034 ± 1.063
2.601TyrGlu: 2.601 ± 0.997
1.3TyrPhe: 1.3 ± 0.635
0.0TyrGly: 0.0 ± 0.0
0.433TyrHis: 0.433 ± 0.335
2.601TyrIle: 2.601 ± 0.603
2.167TyrLys: 2.167 ± 0.904
3.901TyrLeu: 3.901 ± 1.689
0.0TyrMet: 0.0 ± 0.0
0.867TyrAsn: 0.867 ± 0.273
0.433TyrPro: 0.433 ± 0.335
0.433TyrGln: 0.433 ± 0.335
2.167TyrArg: 2.167 ± 0.801
3.901TyrSer: 3.901 ± 0.67
1.3TyrThr: 1.3 ± 0.467
2.601TyrVal: 2.601 ± 1.161
0.433TyrTrp: 0.433 ± 0.267
1.3TyrTyr: 1.3 ± 0.801
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2308 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski