Amino acid dipepetide frequency for Bhendi yellow vein mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.354AlaAla: 6.354 ± 1.695
0.794AlaCys: 0.794 ± 0.668
1.589AlaAsp: 1.589 ± 0.938
0.794AlaGlu: 0.794 ± 0.755
0.794AlaPhe: 0.794 ± 0.6
1.589AlaGly: 1.589 ± 0.938
0.0AlaHis: 0.0 ± 0.0
0.0AlaIle: 0.0 ± 0.0
2.383AlaLys: 2.383 ± 1.047
5.56AlaLeu: 5.56 ± 2.1
0.0AlaMet: 0.0 ± 0.0
1.589AlaAsn: 1.589 ± 0.809
1.589AlaPro: 1.589 ± 0.651
2.383AlaGln: 2.383 ± 1.308
3.177AlaArg: 3.177 ± 1.402
4.766AlaSer: 4.766 ± 1.97
3.177AlaThr: 3.177 ± 2.044
0.794AlaVal: 0.794 ± 0.711
2.383AlaTrp: 2.383 ± 1.058
3.177AlaTyr: 3.177 ± 2.434
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.589CysCys: 1.589 ± 1.563
0.794CysAsp: 0.794 ± 0.755
0.794CysGlu: 0.794 ± 0.668
0.794CysPhe: 0.794 ± 0.711
1.589CysGly: 1.589 ± 0.963
0.794CysHis: 0.794 ± 0.944
1.589CysIle: 1.589 ± 0.904
0.794CysLys: 0.794 ± 0.668
1.589CysLeu: 1.589 ± 1.04
1.589CysMet: 1.589 ± 1.113
0.794CysAsn: 0.794 ± 0.6
2.383CysPro: 2.383 ± 1.771
0.794CysGln: 0.794 ± 0.6
2.383CysArg: 2.383 ± 1.131
3.971CysSer: 3.971 ± 2.682
1.589CysThr: 1.589 ± 1.042
0.794CysVal: 0.794 ± 0.668
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.383AspAla: 2.383 ± 1.208
0.0AspCys: 0.0 ± 0.0
2.383AspAsp: 2.383 ± 1.082
2.383AspGlu: 2.383 ± 0.86
0.794AspPhe: 0.794 ± 0.668
1.589AspGly: 1.589 ± 1.199
0.0AspHis: 0.0 ± 0.0
3.971AspIle: 3.971 ± 1.811
2.383AspLys: 2.383 ± 0.786
3.971AspLeu: 3.971 ± 1.879
0.0AspMet: 0.0 ± 0.0
2.383AspAsn: 2.383 ± 1.029
3.971AspPro: 3.971 ± 1.879
1.589AspGln: 1.589 ± 0.865
2.383AspArg: 2.383 ± 1.175
3.177AspSer: 3.177 ± 0.803
2.383AspThr: 2.383 ± 1.565
5.56AspVal: 5.56 ± 1.706
1.589AspTrp: 1.589 ± 0.963
1.589AspTyr: 1.589 ± 0.891
0.0AspXaa: 0.0 ± 0.0
Glu
3.971GluAla: 3.971 ± 0.956
0.794GluCys: 0.794 ± 0.944
0.0GluAsp: 0.0 ± 0.0
5.56GluGlu: 5.56 ± 4.197
2.383GluPhe: 2.383 ± 1.302
2.383GluGly: 2.383 ± 0.86
1.589GluHis: 1.589 ± 1.085
0.794GluIle: 0.794 ± 0.755
1.589GluLys: 1.589 ± 0.939
3.971GluLeu: 3.971 ± 2.149
0.0GluMet: 0.0 ± 0.0
4.766GluAsn: 4.766 ± 1.666
3.177GluPro: 3.177 ± 1.003
1.589GluGln: 1.589 ± 0.904
0.0GluArg: 0.0 ± 0.0
3.177GluSer: 3.177 ± 0.848
2.383GluThr: 2.383 ± 1.073
2.383GluVal: 2.383 ± 0.763
2.383GluTrp: 2.383 ± 1.297
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.589PheCys: 1.589 ± 0.938
3.177PheAsp: 3.177 ± 1.592
0.794PheGlu: 0.794 ± 0.668
2.383PhePhe: 2.383 ± 0.763
1.589PheGly: 1.589 ± 1.337
1.589PheHis: 1.589 ± 1.199
1.589PheIle: 1.589 ± 0.939
3.177PheLys: 3.177 ± 1.379
8.737PheLeu: 8.737 ± 2.679
1.589PheMet: 1.589 ± 0.819
3.971PheAsn: 3.971 ± 1.408
2.383PhePro: 2.383 ± 1.31
2.383PheGln: 2.383 ± 1.81
2.383PheArg: 2.383 ± 1.214
3.177PheSer: 3.177 ± 2.795
2.383PheThr: 2.383 ± 1.1
1.589PheVal: 1.589 ± 0.651
0.0PheTrp: 0.0 ± 0.0
1.589PheTyr: 1.589 ± 0.904
0.0PheXaa: 0.0 ± 0.0
Gly
2.383GlyAla: 2.383 ± 1.308
3.177GlyCys: 3.177 ± 1.239
1.589GlyAsp: 1.589 ± 1.199
1.589GlyGlu: 1.589 ± 0.651
2.383GlyPhe: 2.383 ± 1.365
3.177GlyGly: 3.177 ± 1.003
0.794GlyHis: 0.794 ± 0.944
2.383GlyIle: 2.383 ± 1.047
5.56GlyLys: 5.56 ± 2.407
3.177GlyLeu: 3.177 ± 1.831
1.589GlyMet: 1.589 ± 1.563
1.589GlyAsn: 1.589 ± 0.963
2.383GlyPro: 2.383 ± 1.058
1.589GlyGln: 1.589 ± 0.938
1.589GlyArg: 1.589 ± 1.04
5.56GlySer: 5.56 ± 2.007
2.383GlyThr: 2.383 ± 1.082
3.177GlyVal: 3.177 ± 1.499
0.0GlyTrp: 0.0 ± 0.0
0.794GlyTyr: 0.794 ± 0.782
0.0GlyXaa: 0.0 ± 0.0
His
1.589HisAla: 1.589 ± 1.337
1.589HisCys: 1.589 ± 1.134
2.383HisAsp: 2.383 ± 1.148
0.794HisGlu: 0.794 ± 0.6
2.383HisPhe: 2.383 ± 1.086
3.177HisGly: 3.177 ± 1.947
1.589HisHis: 1.589 ± 1.125
1.589HisIle: 1.589 ± 1.033
1.589HisLys: 1.589 ± 1.055
1.589HisLeu: 1.589 ± 0.939
0.0HisMet: 0.0 ± 0.0
3.971HisAsn: 3.971 ± 1.903
3.177HisPro: 3.177 ± 1.04
1.589HisGln: 1.589 ± 0.891
4.766HisArg: 4.766 ± 2.093
1.589HisSer: 1.589 ± 1.187
2.383HisThr: 2.383 ± 1.175
2.383HisVal: 2.383 ± 1.1
0.0HisTrp: 0.0 ± 0.0
1.589HisTyr: 1.589 ± 0.809
0.0HisXaa: 0.0 ± 0.0
Ile
0.794IleAla: 0.794 ± 0.755
2.383IleCys: 2.383 ± 1.131
1.589IleAsp: 1.589 ± 1.199
0.0IleGlu: 0.0 ± 0.0
1.589IlePhe: 1.589 ± 1.199
2.383IleGly: 2.383 ± 1.443
3.177IleHis: 3.177 ± 1.693
2.383IleIle: 2.383 ± 1.1
6.354IleLys: 6.354 ± 1.817
4.766IleLeu: 4.766 ± 2.591
0.0IleMet: 0.0 ± 0.0
2.383IleAsn: 2.383 ± 1.573
1.589IlePro: 1.589 ± 0.939
3.177IleGln: 3.177 ± 1.776
6.354IleArg: 6.354 ± 2.081
2.383IleSer: 2.383 ± 1.475
2.383IleThr: 2.383 ± 1.602
3.971IleVal: 3.971 ± 1.947
2.383IleTrp: 2.383 ± 1.536
1.589IleTyr: 1.589 ± 0.651
0.0IleXaa: 0.0 ± 0.0
Lys
2.383LysAla: 2.383 ± 1.465
2.383LysCys: 2.383 ± 1.308
1.589LysAsp: 1.589 ± 1.199
4.766LysGlu: 4.766 ± 1.451
3.177LysPhe: 3.177 ± 1.085
2.383LysGly: 2.383 ± 1.07
2.383LysHis: 2.383 ± 1.654
3.971LysIle: 3.971 ± 1.427
1.589LysLys: 1.589 ± 0.651
0.794LysLeu: 0.794 ± 0.755
0.0LysMet: 0.0 ± 0.0
4.766LysAsn: 4.766 ± 1.666
2.383LysPro: 2.383 ± 1.175
1.589LysGln: 1.589 ± 0.904
3.971LysArg: 3.971 ± 1.19
4.766LysSer: 4.766 ± 1.353
3.177LysThr: 3.177 ± 0.975
5.56LysVal: 5.56 ± 1.617
0.794LysTrp: 0.794 ± 0.668
5.56LysTyr: 5.56 ± 1.055
0.0LysXaa: 0.0 ± 0.0
Leu
0.794LeuAla: 0.794 ± 0.6
1.589LeuCys: 1.589 ± 1.199
3.971LeuAsp: 3.971 ± 1.761
5.56LeuGlu: 5.56 ± 2.469
3.177LeuPhe: 3.177 ± 1.259
4.766LeuGly: 4.766 ± 1.617
3.971LeuHis: 3.971 ± 1.889
3.177LeuIle: 3.177 ± 1.694
4.766LeuLys: 4.766 ± 1.445
3.971LeuLeu: 3.971 ± 2.311
0.794LeuMet: 0.794 ± 0.668
5.56LeuAsn: 5.56 ± 1.462
0.794LeuPro: 0.794 ± 0.944
3.177LeuGln: 3.177 ± 1.204
7.149LeuArg: 7.149 ± 2.289
6.354LeuSer: 6.354 ± 2.382
9.531LeuThr: 9.531 ± 2.421
7.943LeuVal: 7.943 ± 4.04
0.794LeuTrp: 0.794 ± 0.816
5.56LeuTyr: 5.56 ± 1.847
0.0LeuXaa: 0.0 ± 0.0
Met
0.794MetAla: 0.794 ± 0.668
0.794MetCys: 0.794 ± 0.668
3.971MetAsp: 3.971 ± 1.514
0.0MetGlu: 0.0 ± 0.0
1.589MetPhe: 1.589 ± 1.337
1.589MetGly: 1.589 ± 0.809
1.589MetHis: 1.589 ± 1.033
0.794MetIle: 0.794 ± 0.816
0.0MetLys: 0.0 ± 0.0
1.589MetLeu: 1.589 ± 0.904
0.794MetMet: 0.794 ± 0.776
1.589MetAsn: 1.589 ± 0.938
1.589MetPro: 1.589 ± 0.939
0.794MetGln: 0.794 ± 0.944
0.0MetArg: 0.0 ± 0.0
0.794MetSer: 0.794 ± 0.668
0.0MetThr: 0.0 ± 0.0
0.794MetVal: 0.794 ± 0.816
1.589MetTrp: 1.589 ± 0.891
2.383MetTyr: 2.383 ± 1.477
0.0MetXaa: 0.0 ± 0.0
Asn
3.177AsnAla: 3.177 ± 1.592
0.794AsnCys: 0.794 ± 0.944
2.383AsnAsp: 2.383 ± 1.446
2.383AsnGlu: 2.383 ± 1.145
1.589AsnPhe: 1.589 ± 0.651
3.177AsnGly: 3.177 ± 1.04
3.177AsnHis: 3.177 ± 1.534
5.56AsnIle: 5.56 ± 2.017
2.383AsnLys: 2.383 ± 1.654
7.943AsnLeu: 7.943 ± 3.346
4.766AsnMet: 4.766 ± 1.928
3.971AsnAsn: 3.971 ± 1.243
5.56AsnPro: 5.56 ± 1.219
3.971AsnGln: 3.971 ± 1.27
3.177AsnArg: 3.177 ± 1.534
1.589AsnSer: 1.589 ± 0.651
3.177AsnThr: 3.177 ± 1.862
2.383AsnVal: 2.383 ± 1.086
0.0AsnTrp: 0.0 ± 0.0
4.766AsnTyr: 4.766 ± 1.014
0.0AsnXaa: 0.0 ± 0.0
Pro
3.177ProAla: 3.177 ± 1.716
2.383ProCys: 2.383 ± 1.145
3.177ProAsp: 3.177 ± 2.286
1.589ProGlu: 1.589 ± 0.809
3.177ProPhe: 3.177 ± 1.127
0.794ProGly: 0.794 ± 0.6
3.177ProHis: 3.177 ± 1.821
3.177ProIle: 3.177 ± 1.239
2.383ProLys: 2.383 ± 1.799
5.56ProLeu: 5.56 ± 2.203
1.589ProMet: 1.589 ± 1.033
4.766ProAsn: 4.766 ± 1.811
0.794ProPro: 0.794 ± 0.6
3.971ProGln: 3.971 ± 1.872
3.177ProArg: 3.177 ± 1.446
4.766ProSer: 4.766 ± 2.538
3.971ProThr: 3.971 ± 0.972
4.766ProVal: 4.766 ± 2.81
0.794ProTrp: 0.794 ± 0.755
2.383ProTyr: 2.383 ± 1.161
0.0ProXaa: 0.0 ± 0.0
Gln
3.971GlnAla: 3.971 ± 1.548
0.0GlnCys: 0.0 ± 0.0
3.177GlnAsp: 3.177 ± 1.857
2.383GlnGlu: 2.383 ± 0.786
1.589GlnPhe: 1.589 ± 1.199
1.589GlnGly: 1.589 ± 1.199
2.383GlnHis: 2.383 ± 1.91
3.971GlnIle: 3.971 ± 1.688
0.794GlnLys: 0.794 ± 0.782
1.589GlnLeu: 1.589 ± 1.134
0.0GlnMet: 0.0 ± 0.0
3.177GlnAsn: 3.177 ± 1.313
3.971GlnPro: 3.971 ± 3.027
2.383GlnGln: 2.383 ± 1.475
2.383GlnArg: 2.383 ± 1.348
3.177GlnSer: 3.177 ± 1.055
3.177GlnThr: 3.177 ± 1.793
3.177GlnVal: 3.177 ± 0.925
0.0GlnTrp: 0.0 ± 0.0
1.589GlnTyr: 1.589 ± 0.947
0.0GlnXaa: 0.0 ± 0.0
Arg
1.589ArgAla: 1.589 ± 1.337
2.383ArgCys: 2.383 ± 1.935
3.177ArgAsp: 3.177 ± 1.156
3.177ArgGlu: 3.177 ± 1.285
3.177ArgPhe: 3.177 ± 1.003
3.177ArgGly: 3.177 ± 1.555
3.177ArgHis: 3.177 ± 1.722
7.149ArgIle: 7.149 ± 2.433
2.383ArgLys: 2.383 ± 1.477
3.971ArgLeu: 3.971 ± 1.735
2.383ArgMet: 2.383 ± 1.537
3.177ArgAsn: 3.177 ± 2.434
4.766ArgPro: 4.766 ± 1.156
1.589ArgGln: 1.589 ± 1.134
7.943ArgArg: 7.943 ± 3.92
4.766ArgSer: 4.766 ± 1.591
3.971ArgThr: 3.971 ± 1.447
5.56ArgVal: 5.56 ± 1.63
0.0ArgTrp: 0.0 ± 0.0
1.589ArgTyr: 1.589 ± 0.904
0.0ArgXaa: 0.0 ± 0.0
Ser
0.794SerAla: 0.794 ± 0.6
1.589SerCys: 1.589 ± 1.113
3.177SerAsp: 3.177 ± 0.925
3.177SerGlu: 3.177 ± 1.821
3.971SerPhe: 3.971 ± 2.592
3.177SerGly: 3.177 ± 1.204
0.794SerHis: 0.794 ± 0.944
3.177SerIle: 3.177 ± 1.912
7.149SerLys: 7.149 ± 1.691
3.177SerLeu: 3.177 ± 1.727
0.0SerMet: 0.0 ± 0.609
3.971SerAsn: 3.971 ± 1.076
7.943SerPro: 7.943 ± 2.158
2.383SerGln: 2.383 ± 1.07
7.149SerArg: 7.149 ± 0.754
8.737SerSer: 8.737 ± 3.622
5.56SerThr: 5.56 ± 1.653
5.56SerVal: 5.56 ± 1.995
0.0SerTrp: 0.0 ± 0.0
2.383SerTyr: 2.383 ± 1.297
0.0SerXaa: 0.0 ± 0.0
Thr
3.177ThrAla: 3.177 ± 0.803
0.0ThrCys: 0.0 ± 0.0
1.589ThrAsp: 1.589 ± 0.938
2.383ThrGlu: 2.383 ± 0.86
1.589ThrPhe: 1.589 ± 0.896
5.56ThrGly: 5.56 ± 1.462
3.971ThrHis: 3.971 ± 1.19
0.794ThrIle: 0.794 ± 0.6
4.766ThrLys: 4.766 ± 1.258
4.766ThrLeu: 4.766 ± 1.403
2.383ThrMet: 2.383 ± 1.082
6.354ThrAsn: 6.354 ± 1.627
4.766ThrPro: 4.766 ± 0.899
3.971ThrGln: 3.971 ± 2.016
4.766ThrArg: 4.766 ± 2.349
3.177ThrSer: 3.177 ± 1.663
2.383ThrThr: 2.383 ± 1.621
2.383ThrVal: 2.383 ± 1.384
0.0ThrTrp: 0.0 ± 0.0
0.794ThrTyr: 0.794 ± 0.6
0.0ThrXaa: 0.0 ± 0.0
Val
0.794ValAla: 0.794 ± 0.755
0.794ValCys: 0.794 ± 0.755
3.177ValAsp: 3.177 ± 1.034
2.383ValGlu: 2.383 ± 1.716
3.971ValPhe: 3.971 ± 0.992
0.794ValGly: 0.794 ± 0.668
4.766ValHis: 4.766 ± 2.417
3.971ValIle: 3.971 ± 1.569
4.766ValLys: 4.766 ± 1.701
8.737ValLeu: 8.737 ± 2.526
2.383ValMet: 2.383 ± 1.175
3.177ValAsn: 3.177 ± 1.647
4.766ValPro: 4.766 ± 0.96
3.971ValGln: 3.971 ± 0.941
3.177ValArg: 3.177 ± 2.673
3.971ValSer: 3.971 ± 1.693
3.177ValThr: 3.177 ± 2.673
3.177ValVal: 3.177 ± 1.643
0.794ValTrp: 0.794 ± 0.755
4.766ValTyr: 4.766 ± 1.809
0.0ValXaa: 0.0 ± 0.0
Trp
2.383TrpAla: 2.383 ± 1.208
0.0TrpCys: 0.0 ± 0.0
0.794TrpAsp: 0.794 ± 0.782
0.0TrpGlu: 0.0 ± 0.0
1.589TrpPhe: 1.589 ± 0.939
0.794TrpGly: 0.794 ± 0.6
0.794TrpHis: 0.794 ± 0.668
0.0TrpIle: 0.0 ± 0.0
0.794TrpLys: 0.794 ± 0.711
0.0TrpLeu: 0.0 ± 0.0
0.794TrpMet: 0.794 ± 0.668
0.794TrpAsn: 0.794 ± 0.755
0.0TrpPro: 0.0 ± 0.0
0.794TrpGln: 0.794 ± 0.6
0.794TrpArg: 0.794 ± 0.944
0.794TrpSer: 0.794 ± 0.944
1.589TrpThr: 1.589 ± 0.947
0.794TrpVal: 0.794 ± 0.6
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.383TyrAla: 2.383 ± 1.384
0.0TyrCys: 0.0 ± 0.0
0.794TyrAsp: 0.794 ± 0.668
2.383TyrGlu: 2.383 ± 1.552
3.971TyrPhe: 3.971 ± 0.954
1.589TyrGly: 1.589 ± 0.809
0.0TyrHis: 0.0 ± 0.0
1.589TyrIle: 1.589 ± 0.865
2.383TyrLys: 2.383 ± 1.208
7.943TyrLeu: 7.943 ± 2.268
1.589TyrMet: 1.589 ± 0.906
3.177TyrAsn: 3.177 ± 1.499
1.589TyrPro: 1.589 ± 0.809
0.794TyrGln: 0.794 ± 0.668
2.383TyrArg: 2.383 ± 1.384
3.971TyrSer: 3.971 ± 1.863
0.794TyrThr: 0.794 ± 0.668
4.766TyrVal: 4.766 ± 1.278
0.0TyrTrp: 0.0 ± 0.0
0.794TyrTyr: 0.794 ± 0.944
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1260 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski