Amino acid dipepetide frequency for Apricot vein clearing associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.575AlaAla: 2.575 ± 1.496
0.858AlaCys: 0.858 ± 0.445
3.004AlaAsp: 3.004 ± 2.339
2.146AlaGlu: 2.146 ± 1.539
2.146AlaPhe: 2.146 ± 0.72
5.15AlaGly: 5.15 ± 1.232
0.858AlaHis: 0.858 ± 0.445
3.863AlaIle: 3.863 ± 1.638
5.15AlaLys: 5.15 ± 1.391
6.438AlaLeu: 6.438 ± 2.413
1.717AlaMet: 1.717 ± 1.279
3.433AlaAsn: 3.433 ± 1.264
2.146AlaPro: 2.146 ± 1.321
2.146AlaGln: 2.146 ± 0.824
3.433AlaArg: 3.433 ± 2.282
3.433AlaSer: 3.433 ± 0.509
1.288AlaThr: 1.288 ± 0.748
3.433AlaVal: 3.433 ± 1.264
0.0AlaTrp: 0.0 ± 0.0
2.575AlaTyr: 2.575 ± 0.473
0.0AlaXaa: 0.0 ± 0.0
Cys
3.004CysAla: 3.004 ± 0.438
0.429CysCys: 0.429 ± 0.222
0.429CysAsp: 0.429 ± 1.057
0.858CysGlu: 0.858 ± 0.703
0.858CysPhe: 0.858 ± 0.445
0.858CysGly: 0.858 ± 0.922
0.858CysHis: 0.858 ± 0.922
2.146CysIle: 2.146 ± 1.112
1.717CysLys: 1.717 ± 0.641
3.004CysLeu: 3.004 ± 1.001
1.288CysMet: 1.288 ± 0.852
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.858CysGln: 0.858 ± 0.445
2.575CysArg: 2.575 ± 1.334
1.288CysSer: 1.288 ± 0.635
0.858CysThr: 0.858 ± 0.805
1.717CysVal: 1.717 ± 0.641
0.0CysTrp: 0.0 ± 0.0
1.717CysTyr: 1.717 ± 1.845
0.0CysXaa: 0.0 ± 0.0
Asp
1.717AspAla: 1.717 ± 0.755
0.858AspCys: 0.858 ± 0.445
4.292AspAsp: 4.292 ± 2.223
4.721AspGlu: 4.721 ± 1.891
3.863AspPhe: 3.863 ± 1.564
1.717AspGly: 1.717 ± 0.641
2.146AspHis: 2.146 ± 0.595
3.863AspIle: 3.863 ± 2.106
2.146AspLys: 2.146 ± 0.942
7.296AspLeu: 7.296 ± 2.407
1.717AspMet: 1.717 ± 0.889
2.575AspAsn: 2.575 ± 0.88
1.717AspPro: 1.717 ± 1.611
2.146AspGln: 2.146 ± 2.51
2.146AspArg: 2.146 ± 0.824
3.863AspSer: 3.863 ± 1.325
1.717AspThr: 1.717 ± 0.764
2.575AspVal: 2.575 ± 1.334
1.717AspTrp: 1.717 ± 0.755
2.575AspTyr: 2.575 ± 0.941
0.0AspXaa: 0.0 ± 0.0
Glu
3.433GluAla: 3.433 ± 1.51
1.288GluCys: 1.288 ± 0.667
3.433GluAsp: 3.433 ± 1.155
3.004GluGlu: 3.004 ± 1.001
2.146GluPhe: 2.146 ± 0.72
3.863GluGly: 3.863 ± 1.079
2.146GluHis: 2.146 ± 1.112
5.579GluIle: 5.579 ± 1.573
4.721GluLys: 4.721 ± 0.804
5.579GluLeu: 5.579 ± 1.656
0.858GluMet: 0.858 ± 0.445
3.863GluAsn: 3.863 ± 1.45
1.717GluPro: 1.717 ± 0.889
3.433GluGln: 3.433 ± 0.793
5.15GluArg: 5.15 ± 1.151
5.15GluSer: 5.15 ± 2.54
3.433GluThr: 3.433 ± 0.747
6.009GluVal: 6.009 ± 0.4
0.858GluTrp: 0.858 ± 0.445
1.288GluTyr: 1.288 ± 1.519
0.0GluXaa: 0.0 ± 0.0
Phe
5.579PheAla: 5.579 ± 1.639
1.717PheCys: 1.717 ± 0.889
3.863PheAsp: 3.863 ± 1.45
6.009PheGlu: 6.009 ± 1.74
4.292PhePhe: 4.292 ± 0.831
3.004PheGly: 3.004 ± 1.753
0.858PheHis: 0.858 ± 0.445
3.004PheIle: 3.004 ± 0.988
4.292PheLys: 4.292 ± 1.58
6.438PheLeu: 6.438 ± 1.87
2.146PheMet: 2.146 ± 1.112
3.004PheAsn: 3.004 ± 1.556
1.288PhePro: 1.288 ± 0.667
1.717PheGln: 1.717 ± 1.406
4.292PheArg: 4.292 ± 0.48
3.433PheSer: 3.433 ± 1.779
3.004PheThr: 3.004 ± 0.755
3.863PheVal: 3.863 ± 1.217
0.0PheTrp: 0.0 ± 0.0
0.429PheTyr: 0.429 ± 1.057
0.0PheXaa: 0.0 ± 0.0
Gly
2.146GlyAla: 2.146 ± 1.539
0.429GlyCys: 0.429 ± 0.222
3.433GlyAsp: 3.433 ± 1.779
3.433GlyGlu: 3.433 ± 1.196
3.433GlyPhe: 3.433 ± 1.282
0.858GlyGly: 0.858 ± 1.654
3.004GlyHis: 3.004 ± 1.008
1.717GlyIle: 1.717 ± 0.889
6.009GlyLys: 6.009 ± 2.081
7.296GlyLeu: 7.296 ± 2.137
0.858GlyMet: 0.858 ± 0.703
3.863GlyAsn: 3.863 ± 0.833
2.575GlyPro: 2.575 ± 1.06
1.288GlyGln: 1.288 ± 0.667
2.146GlyArg: 2.146 ± 0.72
6.438GlySer: 6.438 ± 2.529
2.146GlyThr: 2.146 ± 1.561
4.292GlyVal: 4.292 ± 4.11
1.288GlyTrp: 1.288 ± 0.667
1.288GlyTyr: 1.288 ± 0.667
0.0GlyXaa: 0.0 ± 0.0
His
1.717HisAla: 1.717 ± 0.889
0.858HisCys: 0.858 ± 0.445
0.858HisAsp: 0.858 ± 0.445
1.717HisGlu: 1.717 ± 0.641
2.575HisPhe: 2.575 ± 1.653
0.429HisGly: 0.429 ± 0.222
1.717HisHis: 1.717 ± 0.764
1.717HisIle: 1.717 ± 0.755
1.717HisLys: 1.717 ± 0.764
3.863HisLeu: 3.863 ± 2.566
1.288HisMet: 1.288 ± 0.667
0.858HisAsn: 0.858 ± 0.445
0.858HisPro: 0.858 ± 0.445
1.288HisGln: 1.288 ± 0.748
1.717HisArg: 1.717 ± 0.889
2.146HisSer: 2.146 ± 1.112
0.858HisThr: 0.858 ± 0.445
1.717HisVal: 1.717 ± 0.784
0.429HisTrp: 0.429 ± 0.222
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.858IleAla: 0.858 ± 0.805
2.575IleCys: 2.575 ± 0.941
4.721IleAsp: 4.721 ± 1.478
2.146IleGlu: 2.146 ± 2.055
3.433IlePhe: 3.433 ± 1.282
2.575IleGly: 2.575 ± 1.91
2.146IleHis: 2.146 ± 0.824
2.575IleIle: 2.575 ± 0.941
4.292IleLys: 4.292 ± 1.943
7.296IleLeu: 7.296 ± 1.696
1.288IleMet: 1.288 ± 0.667
3.863IleAsn: 3.863 ± 2.659
2.146IlePro: 2.146 ± 0.595
2.146IleGln: 2.146 ± 1.153
3.433IleArg: 3.433 ± 1.264
3.433IleSer: 3.433 ± 1.779
1.288IleThr: 1.288 ± 0.635
4.292IleVal: 4.292 ± 1.592
0.429IleTrp: 0.429 ± 0.222
1.717IleTyr: 1.717 ± 0.889
0.0IleXaa: 0.0 ± 0.0
Lys
2.575LysAla: 2.575 ± 1.06
0.429LysCys: 0.429 ± 0.222
3.433LysAsp: 3.433 ± 1.282
3.863LysGlu: 3.863 ± 0.804
3.004LysPhe: 3.004 ± 1.443
6.009LysGly: 6.009 ± 2.028
2.146LysHis: 2.146 ± 0.803
3.004LysIle: 3.004 ± 1.014
7.296LysLys: 7.296 ± 3.779
8.584LysLeu: 8.584 ± 3.344
2.146LysMet: 2.146 ± 1.539
3.863LysAsn: 3.863 ± 1.536
2.146LysPro: 2.146 ± 0.942
1.717LysGln: 1.717 ± 0.889
5.579LysArg: 5.579 ± 2.767
6.009LysSer: 6.009 ± 1.655
4.721LysThr: 4.721 ± 2.308
3.433LysVal: 3.433 ± 1.153
0.429LysTrp: 0.429 ± 0.222
0.858LysTyr: 0.858 ± 0.445
0.0LysXaa: 0.0 ± 0.0
Leu
6.438LeuAla: 6.438 ± 2.352
3.004LeuCys: 3.004 ± 0.755
4.721LeuAsp: 4.721 ± 0.856
5.579LeuGlu: 5.579 ± 0.613
9.013LeuPhe: 9.013 ± 2.614
9.871LeuGly: 9.871 ± 2.784
2.146LeuHis: 2.146 ± 1.321
6.438LeuIle: 6.438 ± 1.989
8.155LeuLys: 8.155 ± 2.019
9.442LeuLeu: 9.442 ± 4.0
2.575LeuMet: 2.575 ± 1.193
5.579LeuAsn: 5.579 ± 0.997
2.575LeuPro: 2.575 ± 1.591
3.863LeuGln: 3.863 ± 1.905
5.15LeuArg: 5.15 ± 1.096
8.584LeuSer: 8.584 ± 4.154
5.579LeuThr: 5.579 ± 1.573
6.009LeuVal: 6.009 ± 1.473
0.0LeuTrp: 0.0 ± 0.0
1.288LeuTyr: 1.288 ± 1.266
0.0LeuXaa: 0.0 ± 0.0
Met
3.433MetAla: 3.433 ± 0.509
1.717MetCys: 1.717 ± 0.889
0.429MetAsp: 0.429 ± 0.222
2.146MetGlu: 2.146 ± 1.112
1.288MetPhe: 1.288 ± 1.709
0.858MetGly: 0.858 ± 0.445
0.858MetHis: 0.858 ± 0.922
1.717MetIle: 1.717 ± 0.641
2.146MetLys: 2.146 ± 0.824
2.146MetLeu: 2.146 ± 1.153
0.858MetMet: 0.858 ± 0.445
1.288MetAsn: 1.288 ± 0.667
0.429MetPro: 0.429 ± 0.915
0.429MetGln: 0.429 ± 0.222
1.717MetArg: 1.717 ± 1.279
2.146MetSer: 2.146 ± 2.133
1.288MetThr: 1.288 ± 0.748
1.288MetVal: 1.288 ± 0.667
0.0MetTrp: 0.0 ± 0.0
1.288MetTyr: 1.288 ± 0.667
0.0MetXaa: 0.0 ± 0.0
Asn
2.575AsnAla: 2.575 ± 1.91
1.717AsnCys: 1.717 ± 0.764
2.575AsnAsp: 2.575 ± 1.27
5.579AsnGlu: 5.579 ± 2.027
2.575AsnPhe: 2.575 ± 1.334
2.575AsnGly: 2.575 ± 0.88
2.146AsnHis: 2.146 ± 1.539
0.429AsnIle: 0.429 ± 0.222
3.004AsnLys: 3.004 ± 1.014
8.584AsnLeu: 8.584 ± 0.6
0.429AsnMet: 0.429 ± 0.222
0.858AsnAsn: 0.858 ± 0.445
2.146AsnPro: 2.146 ± 0.942
2.146AsnGln: 2.146 ± 0.824
3.004AsnArg: 3.004 ± 1.486
2.146AsnSer: 2.146 ± 0.72
0.858AsnThr: 0.858 ± 0.445
3.004AsnVal: 3.004 ± 1.066
1.288AsnTrp: 1.288 ± 0.748
3.004AsnTyr: 3.004 ± 1.092
0.0AsnXaa: 0.0 ± 0.0
Pro
0.429ProAla: 0.429 ± 0.915
0.0ProCys: 0.0 ± 0.0
3.433ProAsp: 3.433 ± 1.264
2.575ProGlu: 2.575 ± 1.334
1.288ProPhe: 1.288 ± 0.635
1.717ProGly: 1.717 ± 0.889
0.0ProHis: 0.0 ± 0.0
4.721ProIle: 4.721 ± 1.743
2.146ProLys: 2.146 ± 1.754
2.146ProLeu: 2.146 ± 0.803
0.858ProMet: 0.858 ± 0.922
2.146ProAsn: 2.146 ± 0.824
0.429ProPro: 0.429 ± 0.915
1.288ProGln: 1.288 ± 0.748
1.717ProArg: 1.717 ± 0.641
2.575ProSer: 2.575 ± 1.27
2.575ProThr: 2.575 ± 2.655
1.717ProVal: 1.717 ± 0.764
0.0ProTrp: 0.0 ± 0.0
0.858ProTyr: 0.858 ± 0.445
0.0ProXaa: 0.0 ± 0.0
Gln
3.433GlnAla: 3.433 ± 1.482
1.717GlnCys: 1.717 ± 0.889
1.717GlnAsp: 1.717 ± 0.641
1.717GlnGlu: 1.717 ± 1.279
1.717GlnPhe: 1.717 ± 0.755
1.288GlnGly: 1.288 ± 0.667
1.288GlnHis: 1.288 ± 0.667
1.288GlnIle: 1.288 ± 0.635
2.146GlnLys: 2.146 ± 0.824
3.004GlnLeu: 3.004 ± 1.014
0.0GlnMet: 0.0 ± 0.0
1.717GlnAsn: 1.717 ± 1.611
0.429GlnPro: 0.429 ± 0.827
0.0GlnGln: 0.0 ± 0.0
2.146GlnArg: 2.146 ± 0.595
2.575GlnSer: 2.575 ± 1.228
1.717GlnThr: 1.717 ± 0.755
3.004GlnVal: 3.004 ± 0.438
0.858GlnTrp: 0.858 ± 1.829
1.288GlnTyr: 1.288 ± 0.667
0.0GlnXaa: 0.0 ± 0.0
Arg
5.579ArgAla: 5.579 ± 1.409
2.575ArgCys: 2.575 ± 3.079
2.575ArgAsp: 2.575 ± 3.105
4.292ArgGlu: 4.292 ± 1.045
6.867ArgPhe: 6.867 ± 1.674
2.146ArgGly: 2.146 ± 0.824
2.146ArgHis: 2.146 ± 0.824
2.575ArgIle: 2.575 ± 1.334
2.575ArgLys: 2.575 ± 1.192
4.721ArgLeu: 4.721 ± 1.506
2.146ArgMet: 2.146 ± 1.003
2.575ArgAsn: 2.575 ± 2.416
1.717ArgPro: 1.717 ± 0.641
0.858ArgGln: 0.858 ± 0.805
2.575ArgArg: 2.575 ± 2.1
4.721ArgSer: 4.721 ± 2.726
2.146ArgThr: 2.146 ± 1.112
3.004ArgVal: 3.004 ± 1.809
0.858ArgTrp: 0.858 ± 0.445
2.146ArgTyr: 2.146 ± 0.803
0.0ArgXaa: 0.0 ± 0.0
Ser
3.004SerAla: 3.004 ± 1.014
0.858SerCys: 0.858 ± 0.703
3.433SerAsp: 3.433 ± 1.779
5.579SerGlu: 5.579 ± 3.405
4.721SerPhe: 4.721 ± 1.67
6.438SerGly: 6.438 ± 1.676
1.288SerHis: 1.288 ± 0.667
5.15SerIle: 5.15 ± 1.924
4.721SerLys: 4.721 ± 1.798
9.013SerLeu: 9.013 ± 6.806
0.858SerMet: 0.858 ± 0.445
3.004SerAsn: 3.004 ± 1.556
2.575SerPro: 2.575 ± 0.941
3.004SerGln: 3.004 ± 1.066
5.579SerArg: 5.579 ± 2.293
3.863SerSer: 3.863 ± 1.345
1.717SerThr: 1.717 ± 1.094
3.863SerVal: 3.863 ± 1.905
0.858SerTrp: 0.858 ± 0.703
1.288SerTyr: 1.288 ± 0.748
0.0SerXaa: 0.0 ± 0.0
Thr
1.717ThrAla: 1.717 ± 1.733
1.288ThrCys: 1.288 ± 1.266
2.146ThrAsp: 2.146 ± 0.824
2.575ThrGlu: 2.575 ± 0.473
3.863ThrPhe: 3.863 ± 1.325
6.009ThrGly: 6.009 ± 2.967
0.858ThrHis: 0.858 ± 0.445
3.004ThrIle: 3.004 ± 1.014
3.004ThrLys: 3.004 ± 1.008
1.717ThrLeu: 1.717 ± 0.889
3.004ThrMet: 3.004 ± 1.486
1.288ThrAsn: 1.288 ± 0.635
0.858ThrPro: 0.858 ± 0.445
0.429ThrGln: 0.429 ± 0.222
1.288ThrArg: 1.288 ± 1.428
3.004ThrSer: 3.004 ± 2.354
3.863ThrThr: 3.863 ± 1.996
3.004ThrVal: 3.004 ± 1.266
1.288ThrTrp: 1.288 ± 0.748
0.429ThrTyr: 0.429 ± 0.222
0.0ThrXaa: 0.0 ± 0.0
Val
3.863ValAla: 3.863 ± 1.564
1.717ValCys: 1.717 ± 1.845
4.721ValAsp: 4.721 ± 1.027
5.579ValGlu: 5.579 ± 1.441
3.433ValPhe: 3.433 ± 1.196
1.717ValGly: 1.717 ± 1.595
1.288ValHis: 1.288 ± 0.667
2.146ValIle: 2.146 ± 1.112
4.292ValLys: 4.292 ± 1.592
5.15ValLeu: 5.15 ± 2.309
1.717ValMet: 1.717 ± 1.319
4.292ValAsn: 4.292 ± 1.592
3.863ValPro: 3.863 ± 1.351
3.004ValGln: 3.004 ± 1.257
2.575ValArg: 2.575 ± 1.653
3.433ValSer: 3.433 ± 3.286
4.292ValThr: 4.292 ± 2.223
2.146ValVal: 2.146 ± 0.72
0.0ValTrp: 0.0 ± 0.0
1.717ValTyr: 1.717 ± 0.889
0.0ValXaa: 0.0 ± 0.0
Trp
0.429TrpAla: 0.429 ± 0.915
0.0TrpCys: 0.0 ± 0.0
0.858TrpAsp: 0.858 ± 0.805
0.429TrpGlu: 0.429 ± 0.222
0.429TrpPhe: 0.429 ± 0.222
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.858TrpLys: 0.858 ± 0.445
1.717TrpLeu: 1.717 ± 0.889
0.429TrpMet: 0.429 ± 0.222
0.858TrpAsn: 0.858 ± 0.703
0.429TrpPro: 0.429 ± 0.915
0.0TrpGln: 0.0 ± 0.0
0.858TrpArg: 0.858 ± 0.805
0.429TrpSer: 0.429 ± 0.222
0.858TrpThr: 0.858 ± 0.445
1.717TrpVal: 1.717 ± 0.755
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.288TyrAla: 1.288 ± 0.667
0.429TyrCys: 0.429 ± 0.222
1.288TyrAsp: 1.288 ± 0.667
3.004TyrGlu: 3.004 ± 1.092
1.288TyrPhe: 1.288 ± 1.428
0.858TyrGly: 0.858 ± 0.445
0.429TyrHis: 0.429 ± 0.222
1.717TyrIle: 1.717 ± 0.889
0.858TyrLys: 0.858 ± 0.922
2.575TyrLeu: 2.575 ± 1.27
0.858TyrMet: 0.858 ± 0.445
1.717TyrAsn: 1.717 ± 0.641
2.575TyrPro: 2.575 ± 0.851
1.288TyrGln: 1.288 ± 0.748
1.717TyrArg: 1.717 ± 1.845
2.146TyrSer: 2.146 ± 0.824
0.429TyrThr: 0.429 ± 0.222
1.288TyrVal: 1.288 ± 0.827
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2331 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski