Amino acid dipepetide frequency for Tomato bright yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.501AlaAla: 3.501 ± 0.937
1.167AlaCys: 1.167 ± 0.933
1.167AlaAsp: 1.167 ± 0.832
1.167AlaGlu: 1.167 ± 0.933
0.0AlaPhe: 0.0 ± 0.0
1.167AlaGly: 1.167 ± 0.933
0.0AlaHis: 0.0 ± 0.0
4.667AlaIle: 4.667 ± 2.112
1.167AlaLys: 1.167 ± 0.933
4.667AlaLeu: 4.667 ± 0.881
0.0AlaMet: 0.0 ± 0.0
1.167AlaAsn: 1.167 ± 0.832
2.334AlaPro: 2.334 ± 1.651
4.667AlaGln: 4.667 ± 1.51
1.167AlaArg: 1.167 ± 0.933
3.501AlaSer: 3.501 ± 2.193
4.667AlaThr: 4.667 ± 3.002
2.334AlaVal: 2.334 ± 1.651
0.0AlaTrp: 0.0 ± 0.0
3.501AlaTyr: 3.501 ± 2.632
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.167CysGlu: 1.167 ± 0.933
0.0CysPhe: 0.0 ± 0.0
1.167CysGly: 1.167 ± 0.832
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.167CysLys: 1.167 ± 0.933
0.0CysLeu: 0.0 ± 0.0
1.167CysMet: 1.167 ± 0.832
1.167CysAsn: 1.167 ± 0.832
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.501CysSer: 3.501 ± 2.496
1.167CysThr: 1.167 ± 0.933
2.334CysVal: 2.334 ± 1.866
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.334AspAla: 2.334 ± 2.676
1.167AspCys: 1.167 ± 0.832
1.167AspAsp: 1.167 ± 1.338
1.167AspGlu: 1.167 ± 0.933
1.167AspPhe: 1.167 ± 0.933
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
2.334AspIle: 2.334 ± 1.337
1.167AspLys: 1.167 ± 1.277
2.334AspLeu: 2.334 ± 0.826
0.0AspMet: 0.0 ± 0.0
2.334AspAsn: 2.334 ± 1.533
2.334AspPro: 2.334 ± 1.199
2.334AspGln: 2.334 ± 1.664
2.334AspArg: 2.334 ± 1.866
2.334AspSer: 2.334 ± 1.533
0.0AspThr: 0.0 ± 0.0
3.501AspVal: 3.501 ± 2.193
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.167GluAla: 1.167 ± 0.933
1.167GluCys: 1.167 ± 0.832
2.334GluAsp: 2.334 ± 1.651
2.334GluGlu: 2.334 ± 1.664
0.0GluPhe: 0.0 ± 0.0
3.501GluGly: 3.501 ± 1.554
1.167GluHis: 1.167 ± 1.277
4.667GluIle: 4.667 ± 2.263
1.167GluLys: 1.167 ± 1.338
1.167GluLeu: 1.167 ± 1.338
0.0GluMet: 0.0 ± 0.0
4.667GluAsn: 4.667 ± 3.002
1.167GluPro: 1.167 ± 0.933
3.501GluGln: 3.501 ± 1.878
3.501GluArg: 3.501 ± 2.496
0.0GluSer: 0.0 ± 0.0
0.0GluThr: 0.0 ± 0.0
2.334GluVal: 2.334 ± 1.664
1.167GluTrp: 1.167 ± 0.933
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.167PheAla: 1.167 ± 1.277
1.167PheCys: 1.167 ± 0.933
1.167PheAsp: 1.167 ± 0.933
1.167PheGlu: 1.167 ± 0.933
0.0PhePhe: 0.0 ± 0.0
1.167PheGly: 1.167 ± 0.933
2.334PheHis: 2.334 ± 1.199
2.334PheIle: 2.334 ± 2.676
3.501PheLys: 3.501 ± 2.333
1.167PheLeu: 1.167 ± 0.832
0.0PheMet: 0.0 ± 0.0
4.667PheAsn: 4.667 ± 1.259
2.334PhePro: 2.334 ± 1.327
1.167PheGln: 1.167 ± 1.338
1.167PheArg: 1.167 ± 1.277
1.167PheSer: 1.167 ± 0.933
1.167PheThr: 1.167 ± 1.338
1.167PheVal: 1.167 ± 0.832
2.334PheTrp: 2.334 ± 1.866
2.334PheTyr: 2.334 ± 1.866
0.0PheXaa: 0.0 ± 0.0
Gly
1.167GlyAla: 1.167 ± 0.933
1.167GlyCys: 1.167 ± 0.933
0.0GlyAsp: 0.0 ± 0.0
2.334GlyGlu: 2.334 ± 1.651
1.167GlyPhe: 1.167 ± 0.832
2.334GlyGly: 2.334 ± 1.866
1.167GlyHis: 1.167 ± 0.832
3.501GlyIle: 3.501 ± 1.878
9.335GlyLys: 9.335 ± 3.817
1.167GlyLeu: 1.167 ± 1.277
0.0GlyMet: 0.0 ± 0.0
8.168GlyAsn: 8.168 ± 2.163
3.501GlyPro: 3.501 ± 1.554
4.667GlyGln: 4.667 ± 1.653
5.834GlyArg: 5.834 ± 3.011
4.667GlySer: 4.667 ± 2.42
2.334GlyThr: 2.334 ± 1.533
2.334GlyVal: 2.334 ± 2.555
0.0GlyTrp: 0.0 ± 0.0
1.167GlyTyr: 1.167 ± 1.338
0.0GlyXaa: 0.0 ± 0.0
His
1.167HisAla: 1.167 ± 1.338
0.0HisCys: 0.0 ± 0.0
1.167HisAsp: 1.167 ± 0.933
1.167HisGlu: 1.167 ± 1.277
1.167HisPhe: 1.167 ± 0.832
2.334HisGly: 2.334 ± 1.327
1.167HisHis: 1.167 ± 1.338
4.667HisIle: 4.667 ± 3.566
2.334HisLys: 2.334 ± 1.533
3.501HisLeu: 3.501 ± 1.346
0.0HisMet: 0.0 ± 0.0
2.334HisAsn: 2.334 ± 2.555
0.0HisPro: 0.0 ± 0.0
1.167HisGln: 1.167 ± 0.933
4.667HisArg: 4.667 ± 2.158
2.334HisSer: 2.334 ± 1.327
4.667HisThr: 4.667 ± 1.318
2.334HisVal: 2.334 ± 0.826
0.0HisTrp: 0.0 ± 0.0
1.167HisTyr: 1.167 ± 1.277
1.167HisXaa: 1.167 ± 0.933
Ile
1.167IleAla: 1.167 ± 0.832
2.334IleCys: 2.334 ± 0.826
1.167IleAsp: 1.167 ± 1.277
0.0IleGlu: 0.0 ± 0.0
1.167IlePhe: 1.167 ± 1.338
2.334IleGly: 2.334 ± 1.199
4.667IleHis: 4.667 ± 2.525
3.501IleIle: 3.501 ± 1.621
2.334IleLys: 2.334 ± 1.533
7.001IleLeu: 7.001 ± 3.712
2.334IleMet: 2.334 ± 1.236
4.667IleAsn: 4.667 ± 1.259
3.501IlePro: 3.501 ± 1.346
3.501IleGln: 3.501 ± 2.496
4.667IleArg: 4.667 ± 2.573
14.002IleSer: 14.002 ± 3.108
7.001IleThr: 7.001 ± 2.119
1.167IleVal: 1.167 ± 0.933
3.501IleTrp: 3.501 ± 2.632
4.667IleTyr: 4.667 ± 1.69
0.0IleXaa: 0.0 ± 0.0
Lys
4.667LysAla: 4.667 ± 3.066
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
2.334LysGlu: 2.334 ± 1.327
3.501LysPhe: 3.501 ± 1.604
2.334LysGly: 2.334 ± 1.327
3.501LysHis: 3.501 ± 1.621
4.667LysIle: 4.667 ± 1.259
3.501LysLys: 3.501 ± 2.496
7.001LysLeu: 7.001 ± 1.651
1.167LysMet: 1.167 ± 0.832
3.501LysAsn: 3.501 ± 1.554
1.167LysPro: 1.167 ± 0.933
0.0LysGln: 0.0 ± 0.0
7.001LysArg: 7.001 ± 2.523
3.501LysSer: 3.501 ± 0.937
4.667LysThr: 4.667 ± 3.328
4.667LysVal: 4.667 ± 3.732
0.0LysTrp: 0.0 ± 0.0
2.334LysTyr: 2.334 ± 1.337
0.0LysXaa: 0.0 ± 0.0
Leu
1.167LeuAla: 1.167 ± 1.277
0.0LeuCys: 0.0 ± 0.0
2.334LeuAsp: 2.334 ± 1.651
3.501LeuGlu: 3.501 ± 1.129
0.0LeuPhe: 0.0 ± 0.0
5.834LeuGly: 5.834 ± 0.604
3.501LeuHis: 3.501 ± 1.346
5.834LeuIle: 5.834 ± 3.16
8.168LeuLys: 8.168 ± 1.943
7.001LeuLeu: 7.001 ± 1.921
1.167LeuMet: 1.167 ± 1.338
4.667LeuAsn: 4.667 ± 2.398
4.667LeuPro: 4.667 ± 2.42
5.834LeuGln: 5.834 ± 2.033
8.168LeuArg: 8.168 ± 3.33
7.001LeuSer: 7.001 ± 3.531
4.667LeuThr: 4.667 ± 2.42
3.501LeuVal: 3.501 ± 1.129
1.167LeuTrp: 1.167 ± 0.832
5.834LeuTyr: 5.834 ± 1.984
0.0LeuXaa: 0.0 ± 0.0
Met
3.501MetAla: 3.501 ± 0.937
0.0MetCys: 0.0 ± 0.0
3.501MetAsp: 3.501 ± 2.193
1.167MetGlu: 1.167 ± 0.832
3.501MetPhe: 3.501 ± 1.878
1.167MetGly: 1.167 ± 0.832
2.334MetHis: 2.334 ± 1.337
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.167MetLeu: 1.167 ± 0.832
0.0MetMet: 0.0 ± 0.0
1.167MetAsn: 1.167 ± 0.832
2.334MetPro: 2.334 ± 1.664
0.0MetGln: 0.0 ± 0.0
1.167MetArg: 1.167 ± 0.832
2.334MetSer: 2.334 ± 0.826
0.0MetThr: 0.0 ± 0.0
1.167MetVal: 1.167 ± 0.832
0.0MetTrp: 0.0 ± 0.0
2.334MetTyr: 2.334 ± 1.866
0.0MetXaa: 0.0 ± 0.0
Asn
2.334AsnAla: 2.334 ± 0.826
0.0AsnCys: 0.0 ± 0.0
1.167AsnAsp: 1.167 ± 0.933
2.334AsnGlu: 2.334 ± 1.866
2.334AsnPhe: 2.334 ± 1.533
4.667AsnGly: 4.667 ± 1.69
7.001AsnHis: 7.001 ± 3.119
7.001AsnIle: 7.001 ± 1.761
1.167AsnLys: 1.167 ± 0.933
1.167AsnLeu: 1.167 ± 0.832
3.501AsnMet: 3.501 ± 1.432
2.334AsnAsn: 2.334 ± 1.199
4.667AsnPro: 4.667 ± 1.259
0.0AsnGln: 0.0 ± 0.0
2.334AsnArg: 2.334 ± 1.533
7.001AsnSer: 7.001 ± 3.712
4.667AsnThr: 4.667 ± 2.401
4.667AsnVal: 4.667 ± 2.398
0.0AsnTrp: 0.0 ± 0.0
3.501AsnTyr: 3.501 ± 2.496
0.0AsnXaa: 0.0 ± 0.0
Pro
1.167ProAla: 1.167 ± 0.832
1.167ProCys: 1.167 ± 0.933
1.167ProAsp: 1.167 ± 0.933
1.167ProGlu: 1.167 ± 0.832
1.167ProPhe: 1.167 ± 1.338
4.667ProGly: 4.667 ± 2.654
2.334ProHis: 2.334 ± 1.327
1.167ProIle: 1.167 ± 0.832
4.667ProLys: 4.667 ± 2.42
4.667ProLeu: 4.667 ± 1.51
2.334ProMet: 2.334 ± 1.866
3.501ProAsn: 3.501 ± 1.621
2.334ProPro: 2.334 ± 1.651
2.334ProGln: 2.334 ± 1.199
7.001ProArg: 7.001 ± 2.946
3.501ProSer: 3.501 ± 1.371
3.501ProThr: 3.501 ± 1.766
1.167ProVal: 1.167 ± 0.933
0.0ProTrp: 0.0 ± 0.0
2.334ProTyr: 2.334 ± 1.533
0.0ProXaa: 0.0 ± 0.0
Gln
2.334GlnAla: 2.334 ± 1.533
0.0GlnCys: 0.0 ± 0.0
1.167GlnAsp: 1.167 ± 0.832
2.334GlnGlu: 2.334 ± 0.826
2.334GlnPhe: 2.334 ± 1.327
1.167GlnGly: 1.167 ± 1.338
1.167GlnHis: 1.167 ± 0.832
3.501GlnIle: 3.501 ± 1.346
1.167GlnLys: 1.167 ± 1.338
5.834GlnLeu: 5.834 ± 3.011
3.501GlnMet: 3.501 ± 2.496
3.501GlnAsn: 3.501 ± 1.621
1.167GlnPro: 1.167 ± 0.832
0.0GlnGln: 0.0 ± 0.0
3.501GlnArg: 3.501 ± 2.507
5.834GlnSer: 5.834 ± 1.793
0.0GlnThr: 0.0 ± 0.0
2.334GlnVal: 2.334 ± 1.866
0.0GlnTrp: 0.0 ± 0.0
2.334GlnTyr: 2.334 ± 0.826
0.0GlnXaa: 0.0 ± 0.0
Arg
2.334ArgAla: 2.334 ± 1.199
2.334ArgCys: 2.334 ± 1.664
7.001ArgAsp: 7.001 ± 1.521
1.167ArgGlu: 1.167 ± 0.832
7.001ArgPhe: 7.001 ± 3.677
7.001ArgGly: 7.001 ± 2.946
3.501ArgHis: 3.501 ± 2.507
4.667ArgIle: 4.667 ± 2.039
4.667ArgLys: 4.667 ± 1.259
4.667ArgLeu: 4.667 ± 2.401
3.501ArgMet: 3.501 ± 0.937
1.167ArgAsn: 1.167 ± 0.832
7.001ArgPro: 7.001 ± 1.875
2.334ArgGln: 2.334 ± 1.327
7.001ArgArg: 7.001 ± 2.743
8.168ArgSer: 8.168 ± 2.088
2.334ArgThr: 2.334 ± 1.533
3.501ArgVal: 3.501 ± 1.554
1.167ArgTrp: 1.167 ± 1.338
3.501ArgTyr: 3.501 ± 1.604
0.0ArgXaa: 0.0 ± 0.0
Ser
4.667SerAla: 4.667 ± 2.42
1.167SerCys: 1.167 ± 0.832
1.167SerAsp: 1.167 ± 0.933
3.501SerGlu: 3.501 ± 2.532
3.501SerPhe: 3.501 ± 1.371
4.667SerGly: 4.667 ± 2.401
1.167SerHis: 1.167 ± 0.933
8.168SerIle: 8.168 ± 2.243
7.001SerLys: 7.001 ± 1.521
10.502SerLeu: 10.502 ± 4.95
1.167SerMet: 1.167 ± 0.832
7.001SerAsn: 7.001 ± 1.747
3.501SerPro: 3.501 ± 1.371
1.167SerGln: 1.167 ± 1.338
3.501SerArg: 3.501 ± 2.721
11.669SerSer: 11.669 ± 6.319
14.002SerThr: 14.002 ± 5.796
3.501SerVal: 3.501 ± 1.554
1.167SerTrp: 1.167 ± 0.832
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.501ThrAla: 3.501 ± 2.663
0.0ThrCys: 0.0 ± 0.0
1.167ThrAsp: 1.167 ± 1.277
3.501ThrGlu: 3.501 ± 0.937
1.167ThrPhe: 1.167 ± 0.832
4.667ThrGly: 4.667 ± 1.259
1.167ThrHis: 1.167 ± 0.933
2.334ThrIle: 2.334 ± 1.664
2.334ThrLys: 2.334 ± 1.664
8.168ThrLeu: 8.168 ± 3.386
1.167ThrMet: 1.167 ± 0.744
2.334ThrAsn: 2.334 ± 1.866
4.667ThrPro: 4.667 ± 1.318
3.501ThrGln: 3.501 ± 1.766
8.168ThrArg: 8.168 ± 2.859
5.834ThrSer: 5.834 ± 3.647
9.335ThrThr: 9.335 ± 3.405
1.167ThrVal: 1.167 ± 0.933
4.667ThrTrp: 4.667 ± 1.164
2.334ThrTyr: 2.334 ± 0.826
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
1.167ValGlu: 1.167 ± 1.277
1.167ValPhe: 1.167 ± 0.933
1.167ValGly: 1.167 ± 0.933
1.167ValHis: 1.167 ± 1.277
5.834ValIle: 5.834 ± 2.134
3.501ValLys: 3.501 ± 2.799
5.834ValLeu: 5.834 ± 2.727
2.334ValMet: 2.334 ± 1.866
2.334ValAsn: 2.334 ± 1.866
3.501ValPro: 3.501 ± 1.371
3.501ValGln: 3.501 ± 0.937
2.334ValArg: 2.334 ± 1.533
4.667ValSer: 4.667 ± 1.164
2.334ValThr: 2.334 ± 1.866
3.501ValVal: 3.501 ± 1.371
1.167ValTrp: 1.167 ± 0.933
3.501ValTyr: 3.501 ± 2.799
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.167TrpGlu: 1.167 ± 1.277
0.0TrpPhe: 0.0 ± 0.0
1.167TrpGly: 1.167 ± 0.832
0.0TrpHis: 0.0 ± 0.0
1.167TrpIle: 1.167 ± 1.338
1.167TrpLys: 1.167 ± 0.832
1.167TrpLeu: 1.167 ± 0.933
1.167TrpMet: 1.167 ± 0.933
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
4.667TrpArg: 4.667 ± 1.318
0.0TrpSer: 0.0 ± 0.0
2.334TrpThr: 2.334 ± 1.199
2.334TrpVal: 2.334 ± 1.337
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.667TyrAla: 4.667 ± 3.732
0.0TyrCys: 0.0 ± 0.0
1.167TyrAsp: 1.167 ± 0.933
1.167TyrGlu: 1.167 ± 0.933
2.334TyrPhe: 2.334 ± 1.533
2.334TyrGly: 2.334 ± 1.337
1.167TyrHis: 1.167 ± 1.277
4.667TyrIle: 4.667 ± 2.525
0.0TyrLys: 0.0 ± 0.0
5.834TyrLeu: 5.834 ± 3.483
1.167TyrMet: 1.167 ± 1.296
1.167TyrAsn: 1.167 ± 0.933
1.167TyrPro: 1.167 ± 1.338
3.501TyrGln: 3.501 ± 0.937
5.834TyrArg: 5.834 ± 2.308
1.167TyrSer: 1.167 ± 0.832
2.334TyrThr: 2.334 ± 1.199
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
1.167XaaGly: 1.167 ± 0.933
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (858 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski