Amino acid dipepetide frequency for Temperate fruit decay-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.361AlaAla: 2.361 ± 0.921
0.0AlaCys: 0.0 ± 0.0
2.361AlaAsp: 2.361 ± 1.237
0.0AlaGlu: 0.0 ± 0.0
3.542AlaPhe: 3.542 ± 2.588
0.0AlaGly: 0.0 ± 0.0
0.0AlaHis: 0.0 ± 0.0
4.723AlaIle: 4.723 ± 2.349
4.723AlaLys: 4.723 ± 2.136
2.361AlaLeu: 2.361 ± 1.368
0.0AlaMet: 0.0 ± 0.0
5.903AlaAsn: 5.903 ± 3.182
4.723AlaPro: 4.723 ± 1.415
1.181AlaGln: 1.181 ± 1.428
3.542AlaArg: 3.542 ± 1.038
3.542AlaSer: 3.542 ± 0.908
3.542AlaThr: 3.542 ± 1.564
3.542AlaVal: 3.542 ± 1.872
0.0AlaTrp: 0.0 ± 0.0
2.361AlaTyr: 2.361 ± 1.348
0.0AlaXaa: 0.0 ± 0.0
Cys
3.542CysAla: 3.542 ± 1.385
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.181CysIle: 1.181 ± 0.863
2.361CysLys: 2.361 ± 1.348
1.181CysLeu: 1.181 ± 0.864
0.0CysMet: 0.0 ± 0.0
1.181CysAsn: 1.181 ± 0.864
0.0CysPro: 0.0 ± 0.0
2.361CysGln: 2.361 ± 0.921
0.0CysArg: 0.0 ± 0.0
1.181CysSer: 1.181 ± 0.863
1.181CysThr: 1.181 ± 0.864
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.181AspAla: 1.181 ± 0.863
0.0AspCys: 0.0 ± 0.0
3.542AspAsp: 3.542 ± 2.588
3.542AspGlu: 3.542 ± 2.588
1.181AspPhe: 1.181 ± 0.863
2.361AspGly: 2.361 ± 0.921
1.181AspHis: 1.181 ± 1.428
3.542AspIle: 3.542 ± 1.561
1.181AspLys: 1.181 ± 0.864
4.723AspLeu: 4.723 ± 2.987
0.0AspMet: 0.0 ± 0.0
2.361AspAsn: 2.361 ± 0.921
1.181AspPro: 1.181 ± 1.165
1.181AspGln: 1.181 ± 0.863
1.181AspArg: 1.181 ± 0.864
1.181AspSer: 1.181 ± 1.297
0.0AspThr: 0.0 ± 0.0
2.361AspVal: 2.361 ± 0.921
0.0AspTrp: 0.0 ± 0.0
1.181AspTyr: 1.181 ± 0.863
0.0AspXaa: 0.0 ± 0.0
Glu
2.361GluAla: 2.361 ± 0.921
1.181GluCys: 1.181 ± 1.297
1.181GluAsp: 1.181 ± 0.863
3.542GluGlu: 3.542 ± 1.553
1.181GluPhe: 1.181 ± 0.863
2.361GluGly: 2.361 ± 1.068
3.542GluHis: 3.542 ± 1.561
3.542GluIle: 3.542 ± 1.645
0.0GluLys: 0.0 ± 0.0
5.903GluLeu: 5.903 ± 2.006
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
3.542GluPro: 3.542 ± 2.588
3.542GluGln: 3.542 ± 2.588
3.542GluArg: 3.542 ± 1.645
2.361GluSer: 2.361 ± 0.921
2.361GluThr: 2.361 ± 1.068
1.181GluVal: 1.181 ± 0.863
1.181GluTrp: 1.181 ± 0.863
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
5.903PheAla: 5.903 ± 2.193
2.361PheCys: 2.361 ± 1.348
2.361PheAsp: 2.361 ± 0.921
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
1.181PheGly: 1.181 ± 1.165
0.0PheHis: 0.0 ± 0.0
2.361PheIle: 2.361 ± 1.467
1.181PheLys: 1.181 ± 0.864
3.542PheLeu: 3.542 ± 1.786
1.181PheMet: 1.181 ± 0.864
2.361PheAsn: 2.361 ± 1.348
2.361PhePro: 2.361 ± 0.921
1.181PheGln: 1.181 ± 1.165
3.542PheArg: 3.542 ± 1.038
1.181PheSer: 1.181 ± 0.864
4.723PheThr: 4.723 ± 2.386
5.903PheVal: 5.903 ± 3.07
0.0PheTrp: 0.0 ± 0.0
1.181PheTyr: 1.181 ± 0.863
0.0PheXaa: 0.0 ± 0.0
Gly
1.181GlyAla: 1.181 ± 0.863
1.181GlyCys: 1.181 ± 1.165
1.181GlyAsp: 1.181 ± 0.863
4.723GlyGlu: 4.723 ± 2.275
1.181GlyPhe: 1.181 ± 1.165
4.723GlyGly: 4.723 ± 1.842
0.0GlyHis: 0.0 ± 0.0
4.723GlyIle: 4.723 ± 1.022
4.723GlyLys: 4.723 ± 2.349
5.903GlyLeu: 5.903 ± 4.555
1.181GlyMet: 1.181 ± 0.767
2.361GlyAsn: 2.361 ± 1.297
2.361GlyPro: 2.361 ± 0.921
3.542GlyGln: 3.542 ± 1.673
1.181GlyArg: 1.181 ± 0.863
5.903GlySer: 5.903 ± 1.89
3.542GlyThr: 3.542 ± 1.564
3.542GlyVal: 3.542 ± 1.038
0.0GlyTrp: 0.0 ± 0.0
2.361GlyTyr: 2.361 ± 1.348
0.0GlyXaa: 0.0 ± 0.0
His
1.181HisAla: 1.181 ± 1.165
1.181HisCys: 1.181 ± 0.863
1.181HisAsp: 1.181 ± 0.864
1.181HisGlu: 1.181 ± 0.863
0.0HisPhe: 0.0 ± 0.0
1.181HisGly: 1.181 ± 0.863
0.0HisHis: 0.0 ± 0.0
4.723HisIle: 4.723 ± 2.462
1.181HisLys: 1.181 ± 0.863
5.903HisLeu: 5.903 ± 2.886
2.361HisMet: 2.361 ± 1.068
2.361HisAsn: 2.361 ± 0.921
1.181HisPro: 1.181 ± 0.863
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.361HisSer: 2.361 ± 1.725
1.181HisThr: 1.181 ± 0.864
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.181IleAla: 1.181 ± 1.165
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
2.361IleGlu: 2.361 ± 1.725
3.542IlePhe: 3.542 ± 1.564
0.0IleGly: 0.0 ± 0.0
4.723IleHis: 4.723 ± 2.697
8.264IleIle: 8.264 ± 1.882
3.542IleLys: 3.542 ± 1.693
14.168IleLeu: 14.168 ± 6.613
2.361IleMet: 2.361 ± 1.368
4.723IleAsn: 4.723 ± 2.349
7.084IlePro: 7.084 ± 2.642
3.542IleGln: 3.542 ± 2.383
4.723IleArg: 4.723 ± 1.501
7.084IleSer: 7.084 ± 2.961
2.361IleThr: 2.361 ± 1.297
7.084IleVal: 7.084 ± 2.761
1.181IleTrp: 1.181 ± 0.863
4.723IleTyr: 4.723 ± 2.779
0.0IleXaa: 0.0 ± 0.0
Lys
2.361LysAla: 2.361 ± 0.921
1.181LysCys: 1.181 ± 0.864
2.361LysAsp: 2.361 ± 1.725
1.181LysGlu: 1.181 ± 0.863
3.542LysPhe: 3.542 ± 1.756
3.542LysGly: 3.542 ± 1.553
1.181LysHis: 1.181 ± 0.863
3.542LysIle: 3.542 ± 1.756
8.264LysLys: 8.264 ± 3.725
7.084LysLeu: 7.084 ± 2.702
1.181LysMet: 1.181 ± 1.165
2.361LysAsn: 2.361 ± 1.467
2.361LysPro: 2.361 ± 1.297
0.0LysGln: 0.0 ± 0.0
7.084LysArg: 7.084 ± 2.944
2.361LysSer: 2.361 ± 0.921
1.181LysThr: 1.181 ± 0.863
2.361LysVal: 2.361 ± 2.329
2.361LysTrp: 2.361 ± 0.921
3.542LysTyr: 3.542 ± 1.561
0.0LysXaa: 0.0 ± 0.0
Leu
1.181LeuAla: 1.181 ± 1.428
1.181LeuCys: 1.181 ± 0.863
2.361LeuAsp: 2.361 ± 1.467
8.264LeuGlu: 8.264 ± 4.135
5.903LeuPhe: 5.903 ± 3.209
4.723LeuGly: 4.723 ± 3.652
3.542LeuHis: 3.542 ± 1.553
8.264LeuIle: 8.264 ± 4.532
7.084LeuLys: 7.084 ± 3.279
9.445LeuLeu: 9.445 ± 5.517
1.181LeuMet: 1.181 ± 1.237
3.542LeuAsn: 3.542 ± 3.494
4.723LeuPro: 4.723 ± 3.769
3.542LeuGln: 3.542 ± 2.309
2.361LeuArg: 2.361 ± 1.348
12.987LeuSer: 12.987 ± 7.783
3.542LeuThr: 3.542 ± 2.309
5.903LeuVal: 5.903 ± 2.171
3.542LeuTrp: 3.542 ± 1.893
3.542LeuTyr: 3.542 ± 2.521
0.0LeuXaa: 0.0 ± 0.0
Met
5.903MetAla: 5.903 ± 2.414
0.0MetCys: 0.0 ± 0.0
2.361MetAsp: 2.361 ± 1.972
1.181MetGlu: 1.181 ± 1.165
0.0MetPhe: 0.0 ± 0.0
1.181MetGly: 1.181 ± 0.864
0.0MetHis: 0.0 ± 0.0
3.542MetIle: 3.542 ± 1.673
0.0MetLys: 0.0 ± 0.0
1.181MetLeu: 1.181 ± 1.297
1.181MetMet: 1.181 ± 1.165
0.0MetAsn: 0.0 ± 0.0
1.181MetPro: 1.181 ± 1.165
1.181MetGln: 1.181 ± 0.864
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
1.181MetThr: 1.181 ± 1.165
2.361MetVal: 2.361 ± 1.729
1.181MetTrp: 1.181 ± 0.864
1.181MetTyr: 1.181 ± 1.297
0.0MetXaa: 0.0 ± 0.0
Asn
2.361AsnAla: 2.361 ± 1.729
1.181AsnCys: 1.181 ± 0.864
1.181AsnAsp: 1.181 ± 0.864
0.0AsnGlu: 0.0 ± 0.0
2.361AsnPhe: 2.361 ± 1.068
2.361AsnGly: 2.361 ± 1.068
1.181AsnHis: 1.181 ± 0.863
5.903AsnIle: 5.903 ± 3.441
4.723AsnLys: 4.723 ± 1.515
3.542AsnLeu: 3.542 ± 0.908
0.0AsnMet: 0.0 ± 0.0
2.361AsnAsn: 2.361 ± 0.921
5.903AsnPro: 5.903 ± 3.175
0.0AsnGln: 0.0 ± 0.0
2.361AsnArg: 2.361 ± 1.297
7.084AsnSer: 7.084 ± 1.981
1.181AsnThr: 1.181 ± 0.864
1.181AsnVal: 1.181 ± 1.297
2.361AsnTrp: 2.361 ± 0.921
5.903AsnTyr: 5.903 ± 1.411
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
2.361ProAsp: 2.361 ± 1.725
2.361ProGlu: 2.361 ± 1.237
1.181ProPhe: 1.181 ± 1.297
4.723ProGly: 4.723 ± 1.874
3.542ProHis: 3.542 ± 2.588
1.181ProIle: 1.181 ± 0.863
3.542ProLys: 3.542 ± 1.135
3.542ProLeu: 3.542 ± 2.593
2.361ProMet: 2.361 ± 1.297
2.361ProAsn: 2.361 ± 1.725
1.181ProPro: 1.181 ± 0.863
2.361ProGln: 2.361 ± 1.729
3.542ProArg: 3.542 ± 0.908
5.903ProSer: 5.903 ± 3.888
2.361ProThr: 2.361 ± 0.921
3.542ProVal: 3.542 ± 1.038
2.361ProTrp: 2.361 ± 0.921
3.542ProTyr: 3.542 ± 1.786
0.0ProXaa: 0.0 ± 0.0
Gln
2.361GlnAla: 2.361 ± 1.237
0.0GlnCys: 0.0 ± 0.0
1.181GlnAsp: 1.181 ± 0.863
2.361GlnGlu: 2.361 ± 1.297
1.181GlnPhe: 1.181 ± 1.165
3.542GlnGly: 3.542 ± 1.693
2.361GlnHis: 2.361 ± 1.725
3.542GlnIle: 3.542 ± 2.588
0.0GlnLys: 0.0 ± 0.0
1.181GlnLeu: 1.181 ± 0.863
1.181GlnMet: 1.181 ± 1.297
2.361GlnAsn: 2.361 ± 1.741
0.0GlnPro: 0.0 ± 0.0
2.361GlnGln: 2.361 ± 1.297
2.361GlnArg: 2.361 ± 1.972
3.542GlnSer: 3.542 ± 1.893
3.542GlnThr: 3.542 ± 1.885
2.361GlnVal: 2.361 ± 1.297
0.0GlnTrp: 0.0 ± 0.0
1.181GlnTyr: 1.181 ± 0.864
0.0GlnXaa: 0.0 ± 0.0
Arg
2.361ArgAla: 2.361 ± 1.237
1.181ArgCys: 1.181 ± 0.863
1.181ArgAsp: 1.181 ± 0.863
3.542ArgGlu: 3.542 ± 1.756
4.723ArgPhe: 4.723 ± 1.45
4.723ArgGly: 4.723 ± 2.386
1.181ArgHis: 1.181 ± 0.863
3.542ArgIle: 3.542 ± 1.893
7.084ArgLys: 7.084 ± 2.005
4.723ArgLeu: 4.723 ± 2.184
1.181ArgMet: 1.181 ± 1.127
1.181ArgAsn: 1.181 ± 0.863
2.361ArgPro: 2.361 ± 1.729
3.542ArgGln: 3.542 ± 2.907
16.529ArgArg: 16.529 ± 5.205
3.542ArgSer: 3.542 ± 0.908
2.361ArgThr: 2.361 ± 0.921
4.723ArgVal: 4.723 ± 1.022
0.0ArgTrp: 0.0 ± 0.0
2.361ArgTyr: 2.361 ± 1.729
0.0ArgXaa: 0.0 ± 0.0
Ser
2.361SerAla: 2.361 ± 1.297
1.181SerCys: 1.181 ± 0.864
3.542SerAsp: 3.542 ± 1.965
2.361SerGlu: 2.361 ± 1.068
1.181SerPhe: 1.181 ± 0.864
7.084SerGly: 7.084 ± 2.761
1.181SerHis: 1.181 ± 1.297
4.723SerIle: 4.723 ± 2.015
2.361SerLys: 2.361 ± 1.348
9.445SerLeu: 9.445 ± 5.648
3.542SerMet: 3.542 ± 2.176
8.264SerAsn: 8.264 ± 1.493
3.542SerPro: 3.542 ± 1.038
2.361SerGln: 2.361 ± 2.594
9.445SerArg: 9.445 ± 1.991
11.806SerSer: 11.806 ± 3.976
4.723SerThr: 4.723 ± 2.349
5.903SerVal: 5.903 ± 3.41
1.181SerTrp: 1.181 ± 0.863
1.181SerTyr: 1.181 ± 0.863
0.0SerXaa: 0.0 ± 0.0
Thr
8.264ThrAla: 8.264 ± 4.877
1.181ThrCys: 1.181 ± 0.864
1.181ThrAsp: 1.181 ± 0.863
0.0ThrGlu: 0.0 ± 0.0
2.361ThrPhe: 2.361 ± 1.725
5.903ThrGly: 5.903 ± 4.322
0.0ThrHis: 0.0 ± 0.0
5.903ThrIle: 5.903 ± 2.522
2.361ThrLys: 2.361 ± 0.921
2.361ThrLeu: 2.361 ± 1.368
0.0ThrMet: 0.0 ± 0.0
1.181ThrAsn: 1.181 ± 1.165
1.181ThrPro: 1.181 ± 0.863
1.181ThrGln: 1.181 ± 0.863
4.723ThrArg: 4.723 ± 2.275
2.361ThrSer: 2.361 ± 1.368
3.542ThrThr: 3.542 ± 1.561
1.181ThrVal: 1.181 ± 0.864
2.361ThrTrp: 2.361 ± 1.297
2.361ThrTyr: 2.361 ± 0.921
0.0ThrXaa: 0.0 ± 0.0
Val
2.361ValAla: 2.361 ± 0.921
0.0ValCys: 0.0 ± 0.0
1.181ValAsp: 1.181 ± 0.863
2.361ValGlu: 2.361 ± 1.729
4.723ValPhe: 4.723 ± 2.349
4.723ValGly: 4.723 ± 2.462
1.181ValHis: 1.181 ± 1.165
5.903ValIle: 5.903 ± 2.171
1.181ValLys: 1.181 ± 0.863
7.084ValLeu: 7.084 ± 3.102
4.723ValMet: 4.723 ± 2.354
3.542ValAsn: 3.542 ± 2.061
4.723ValPro: 4.723 ± 2.462
1.181ValGln: 1.181 ± 0.863
3.542ValArg: 3.542 ± 1.561
7.084ValSer: 7.084 ± 1.263
3.542ValThr: 3.542 ± 2.593
1.181ValVal: 1.181 ± 0.863
0.0ValTrp: 0.0 ± 0.0
1.181ValTyr: 1.181 ± 0.864
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.181TrpGlu: 1.181 ± 0.863
2.361TrpPhe: 2.361 ± 0.921
0.0TrpGly: 0.0 ± 0.0
2.361TrpHis: 2.361 ± 1.467
0.0TrpIle: 0.0 ± 0.0
2.361TrpLys: 2.361 ± 0.921
1.181TrpLeu: 1.181 ± 1.165
0.0TrpMet: 0.0 ± 0.0
1.181TrpAsn: 1.181 ± 0.864
0.0TrpPro: 0.0 ± 0.0
2.361TrpGln: 2.361 ± 1.725
2.361TrpArg: 2.361 ± 1.297
0.0TrpSer: 0.0 ± 0.0
1.181TrpThr: 1.181 ± 0.864
4.723TrpVal: 4.723 ± 1.45
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.181TyrCys: 1.181 ± 0.863
2.361TyrAsp: 2.361 ± 0.921
2.361TyrGlu: 2.361 ± 0.921
2.361TyrPhe: 2.361 ± 1.467
1.181TyrGly: 1.181 ± 0.863
0.0TyrHis: 0.0 ± 0.0
3.542TyrIle: 3.542 ± 0.908
1.181TyrLys: 1.181 ± 1.165
3.542TyrLeu: 3.542 ± 1.645
0.0TyrMet: 0.0 ± 0.0
3.542TyrAsn: 3.542 ± 2.383
2.361TyrPro: 2.361 ± 0.921
0.0TyrGln: 0.0 ± 0.0
0.0TyrArg: 0.0 ± 0.0
5.903TyrSer: 5.903 ± 3.385
2.361TyrThr: 2.361 ± 1.725
2.361TyrVal: 2.361 ± 1.237
3.542TyrTrp: 3.542 ± 1.965
2.361TyrTyr: 2.361 ± 1.297
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (848 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski