Amino acid dipepetide frequency for Bhendi yellow vein Haryana virus [2003:Karnal]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.039AlaAla: 4.039 ± 1.746
0.808AlaCys: 0.808 ± 0.69
0.808AlaAsp: 0.808 ± 0.69
1.616AlaGlu: 1.616 ± 1.129
0.808AlaPhe: 0.808 ± 0.589
0.808AlaGly: 0.808 ± 0.69
1.616AlaHis: 1.616 ± 0.862
1.616AlaIle: 1.616 ± 1.178
4.039AlaLys: 4.039 ± 1.084
4.847AlaLeu: 4.847 ± 1.269
0.0AlaMet: 0.0 ± 0.0
0.808AlaAsn: 0.808 ± 0.589
3.231AlaPro: 3.231 ± 1.615
2.423AlaGln: 2.423 ± 1.19
4.847AlaArg: 4.847 ± 1.629
5.654AlaSer: 5.654 ± 2.284
2.423AlaThr: 2.423 ± 2.07
1.616AlaVal: 1.616 ± 1.052
1.616AlaTrp: 1.616 ± 0.703
2.423AlaTyr: 2.423 ± 1.19
0.0AlaXaa: 0.0 ± 0.0
Cys
1.616CysAla: 1.616 ± 0.862
1.616CysCys: 1.616 ± 1.748
0.0CysAsp: 0.0 ± 0.0
0.808CysGlu: 0.808 ± 0.69
1.616CysPhe: 1.616 ± 1.295
2.423CysGly: 2.423 ± 1.411
0.808CysHis: 0.808 ± 0.681
1.616CysIle: 1.616 ± 1.077
1.616CysLys: 1.616 ± 0.703
0.808CysLeu: 0.808 ± 0.589
1.616CysMet: 1.616 ± 1.328
1.616CysAsn: 1.616 ± 0.84
1.616CysPro: 1.616 ± 1.275
0.0CysGln: 0.0 ± 0.0
0.808CysArg: 0.808 ± 0.589
2.423CysSer: 2.423 ± 0.983
0.808CysThr: 0.808 ± 0.69
0.808CysVal: 0.808 ± 0.69
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.423AspAla: 2.423 ± 1.766
0.0AspCys: 0.0 ± 0.0
3.231AspAsp: 3.231 ± 1.33
1.616AspGlu: 1.616 ± 0.703
0.808AspPhe: 0.808 ± 0.69
2.423AspGly: 2.423 ± 1.766
0.808AspHis: 0.808 ± 0.681
2.423AspIle: 2.423 ± 1.98
2.423AspLys: 2.423 ± 0.937
5.654AspLeu: 5.654 ± 2.018
0.0AspMet: 0.0 ± 0.0
0.808AspAsn: 0.808 ± 0.69
1.616AspPro: 1.616 ± 0.862
1.616AspGln: 1.616 ± 1.295
4.039AspArg: 4.039 ± 1.372
4.847AspSer: 4.847 ± 1.812
2.423AspThr: 2.423 ± 2.018
6.462AspVal: 6.462 ± 1.775
2.423AspTrp: 2.423 ± 1.28
0.808AspTyr: 0.808 ± 0.589
0.0AspXaa: 0.0 ± 0.0
Glu
4.039GluAla: 4.039 ± 1.664
0.0GluCys: 0.0 ± 0.0
0.808GluAsp: 0.808 ± 0.589
6.462GluGlu: 6.462 ± 4.71
2.423GluPhe: 2.423 ± 1.19
2.423GluGly: 2.423 ± 1.098
1.616GluHis: 1.616 ± 1.052
2.423GluIle: 2.423 ± 1.228
1.616GluLys: 1.616 ± 0.994
3.231GluLeu: 3.231 ± 1.277
0.0GluMet: 0.0 ± 0.0
4.039GluAsn: 4.039 ± 1.846
1.616GluPro: 1.616 ± 0.703
1.616GluGln: 1.616 ± 1.108
0.0GluArg: 0.0 ± 0.0
3.231GluSer: 3.231 ± 1.28
2.423GluThr: 2.423 ± 1.077
0.808GluVal: 0.808 ± 0.69
1.616GluTrp: 1.616 ± 0.84
1.616GluTyr: 1.616 ± 1.052
0.0GluXaa: 0.0 ± 0.0
Phe
0.808PheAla: 0.808 ± 0.589
0.808PheCys: 0.808 ± 0.69
2.423PheAsp: 2.423 ± 1.098
2.423PheGlu: 2.423 ± 0.83
1.616PhePhe: 1.616 ± 0.703
1.616PheGly: 1.616 ± 1.38
1.616PheHis: 1.616 ± 1.178
2.423PheIle: 2.423 ± 1.298
4.847PheLys: 4.847 ± 3.944
7.27PheLeu: 7.27 ± 2.497
0.808PheMet: 0.808 ± 0.589
1.616PheAsn: 1.616 ± 1.09
1.616PhePro: 1.616 ± 1.328
4.039PheGln: 4.039 ± 2.083
3.231PheArg: 3.231 ± 1.277
3.231PheSer: 3.231 ± 1.397
1.616PheThr: 1.616 ± 1.157
0.808PheVal: 0.808 ± 0.69
0.0PheTrp: 0.0 ± 0.0
1.616PheTyr: 1.616 ± 1.108
0.0PheXaa: 0.0 ± 0.0
Gly
1.616GlyAla: 1.616 ± 1.178
3.231GlyCys: 3.231 ± 1.41
1.616GlyAsp: 1.616 ± 1.178
2.423GlyGlu: 2.423 ± 1.294
1.616GlyPhe: 1.616 ± 1.045
2.423GlyGly: 2.423 ± 1.098
1.616GlyHis: 1.616 ± 0.911
2.423GlyIle: 2.423 ± 1.411
5.654GlyLys: 5.654 ± 2.605
4.039GlyLeu: 4.039 ± 2.447
1.616GlyMet: 1.616 ± 1.748
1.616GlyAsn: 1.616 ± 0.84
2.423GlyPro: 2.423 ± 1.098
2.423GlyGln: 2.423 ± 0.859
2.423GlyArg: 2.423 ± 1.298
2.423GlySer: 2.423 ± 0.83
0.808GlyThr: 0.808 ± 0.681
4.847GlyVal: 4.847 ± 1.867
0.0GlyTrp: 0.0 ± 0.0
0.808GlyTyr: 0.808 ± 0.874
0.0GlyXaa: 0.0 ± 0.0
His
1.616HisAla: 1.616 ± 1.38
1.616HisCys: 1.616 ± 1.045
4.039HisAsp: 4.039 ± 1.941
0.808HisGlu: 0.808 ± 0.589
2.423HisPhe: 2.423 ± 1.19
2.423HisGly: 2.423 ± 1.408
0.808HisHis: 0.808 ± 0.681
3.231HisIle: 3.231 ± 1.157
1.616HisLys: 1.616 ± 1.228
1.616HisLeu: 1.616 ± 0.994
0.808HisMet: 0.808 ± 0.898
3.231HisAsn: 3.231 ± 1.395
2.423HisPro: 2.423 ± 1.196
1.616HisGln: 1.616 ± 1.328
4.039HisArg: 4.039 ± 2.062
2.423HisSer: 2.423 ± 1.411
1.616HisThr: 1.616 ± 1.38
3.231HisVal: 3.231 ± 1.929
0.0HisTrp: 0.0 ± 0.0
0.808HisTyr: 0.808 ± 0.589
0.0HisXaa: 0.0 ± 0.0
Ile
0.808IleAla: 0.808 ± 0.589
0.808IleCys: 0.808 ± 0.589
2.423IleAsp: 2.423 ± 1.766
0.808IleGlu: 0.808 ± 0.589
1.616IlePhe: 1.616 ± 1.178
0.808IleGly: 0.808 ± 0.69
3.231IleHis: 3.231 ± 1.231
0.808IleIle: 0.808 ± 1.006
7.27IleLys: 7.27 ± 2.125
5.654IleLeu: 5.654 ± 3.793
0.0IleMet: 0.0 ± 0.0
3.231IleAsn: 3.231 ± 1.395
2.423IlePro: 2.423 ± 1.112
4.039IleGln: 4.039 ± 2.068
4.039IleArg: 4.039 ± 1.466
4.847IleSer: 4.847 ± 1.792
5.654IleThr: 5.654 ± 2.916
1.616IleVal: 1.616 ± 1.38
1.616IleTrp: 1.616 ± 1.09
3.231IleTyr: 3.231 ± 0.961
0.0IleXaa: 0.0 ± 0.0
Lys
2.423LysAla: 2.423 ± 1.972
1.616LysCys: 1.616 ± 1.052
2.423LysAsp: 2.423 ± 1.766
4.847LysGlu: 4.847 ± 2.006
3.231LysPhe: 3.231 ± 1.189
3.231LysGly: 3.231 ± 1.238
4.039LysHis: 4.039 ± 2.823
4.847LysIle: 4.847 ± 1.83
4.039LysLys: 4.039 ± 1.966
1.616LysLeu: 1.616 ± 1.295
0.808LysMet: 0.808 ± 0.898
5.654LysAsn: 5.654 ± 2.022
2.423LysPro: 2.423 ± 1.262
3.231LysGln: 3.231 ± 1.085
2.423LysArg: 2.423 ± 1.494
4.039LysSer: 4.039 ± 1.388
5.654LysThr: 5.654 ± 0.884
4.039LysVal: 4.039 ± 1.923
0.808LysTrp: 0.808 ± 0.69
4.847LysTyr: 4.847 ± 1.262
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
1.616LeuCys: 1.616 ± 1.178
4.847LeuAsp: 4.847 ± 2.22
1.616LeuGlu: 1.616 ± 1.178
0.808LeuPhe: 0.808 ± 1.006
5.654LeuGly: 5.654 ± 2.097
3.231LeuHis: 3.231 ± 1.974
4.039LeuIle: 4.039 ± 1.818
6.462LeuLys: 6.462 ± 1.879
5.654LeuLeu: 5.654 ± 4.567
0.808LeuMet: 0.808 ± 0.69
4.039LeuAsn: 4.039 ± 2.145
2.423LeuPro: 2.423 ± 2.044
2.423LeuGln: 2.423 ± 1.19
4.847LeuArg: 4.847 ± 2.53
7.27LeuSer: 7.27 ± 2.909
7.27LeuThr: 7.27 ± 1.749
8.885LeuVal: 8.885 ± 3.422
0.808LeuTrp: 0.808 ± 0.991
4.039LeuTyr: 4.039 ± 2.818
0.0LeuXaa: 0.0 ± 0.0
Met
2.423MetAla: 2.423 ± 1.262
0.808MetCys: 0.808 ± 0.69
2.423MetAsp: 2.423 ± 1.36
0.0MetGlu: 0.0 ± 0.0
2.423MetPhe: 2.423 ± 1.494
1.616MetGly: 1.616 ± 0.911
2.423MetHis: 2.423 ± 1.409
1.616MetIle: 1.616 ± 1.052
0.0MetLys: 0.0 ± 0.0
2.423MetLeu: 2.423 ± 1.36
1.616MetMet: 1.616 ± 1.219
0.808MetAsn: 0.808 ± 0.69
1.616MetPro: 1.616 ± 1.431
0.808MetGln: 0.808 ± 0.681
0.0MetArg: 0.0 ± 0.0
0.808MetSer: 0.808 ± 0.991
0.0MetThr: 0.0 ± 0.0
0.808MetVal: 0.808 ± 0.991
1.616MetTrp: 1.616 ± 0.862
2.423MetTyr: 2.423 ± 1.522
0.0MetXaa: 0.0 ± 0.0
Asn
4.039AsnAla: 4.039 ± 1.486
0.808AsnCys: 0.808 ± 0.681
2.423AsnAsp: 2.423 ± 1.28
1.616AsnGlu: 1.616 ± 1.108
2.423AsnPhe: 2.423 ± 1.417
0.808AsnGly: 0.808 ± 0.991
3.231AsnHis: 3.231 ± 1.501
2.423AsnIle: 2.423 ± 1.098
3.231AsnLys: 3.231 ± 1.987
5.654AsnLeu: 5.654 ± 3.034
2.423AsnMet: 2.423 ± 2.051
4.039AsnAsn: 4.039 ± 0.931
3.231AsnPro: 3.231 ± 1.244
2.423AsnGln: 2.423 ± 1.293
2.423AsnArg: 2.423 ± 1.262
4.039AsnSer: 4.039 ± 1.653
3.231AsnThr: 3.231 ± 1.679
4.039AsnVal: 4.039 ± 1.288
0.0AsnTrp: 0.0 ± 0.0
4.039AsnTyr: 4.039 ± 1.664
0.0AsnXaa: 0.0 ± 0.0
Pro
2.423ProAla: 2.423 ± 1.417
2.423ProCys: 2.423 ± 1.181
3.231ProAsp: 3.231 ± 2.705
1.616ProGlu: 1.616 ± 1.178
3.231ProPhe: 3.231 ± 1.37
0.808ProGly: 0.808 ± 0.681
2.423ProHis: 2.423 ± 1.766
3.231ProIle: 3.231 ± 1.113
2.423ProLys: 2.423 ± 1.766
3.231ProLeu: 3.231 ± 1.301
1.616ProMet: 1.616 ± 1.077
3.231ProAsn: 3.231 ± 1.353
3.231ProPro: 3.231 ± 1.837
4.847ProGln: 4.847 ± 2.294
3.231ProArg: 3.231 ± 1.402
4.039ProSer: 4.039 ± 1.52
2.423ProThr: 2.423 ± 1.376
4.039ProVal: 4.039 ± 1.608
0.0ProTrp: 0.0 ± 0.0
1.616ProTyr: 1.616 ± 1.077
0.0ProXaa: 0.0 ± 0.0
Gln
3.231GlnAla: 3.231 ± 1.402
1.616GlnCys: 1.616 ± 0.994
0.808GlnAsp: 0.808 ± 0.69
2.423GlnGlu: 2.423 ± 1.36
2.423GlnPhe: 2.423 ± 1.376
0.808GlnGly: 0.808 ± 0.589
3.231GlnHis: 3.231 ± 2.294
1.616GlnIle: 1.616 ± 1.052
2.423GlnLys: 2.423 ± 0.983
1.616GlnLeu: 1.616 ± 0.862
0.0GlnMet: 0.0 ± 0.0
4.039GlnAsn: 4.039 ± 2.533
4.847GlnPro: 4.847 ± 2.39
3.231GlnGln: 3.231 ± 1.53
0.808GlnArg: 0.808 ± 0.589
4.039GlnSer: 4.039 ± 1.392
2.423GlnThr: 2.423 ± 1.228
4.039GlnVal: 4.039 ± 1.657
0.808GlnTrp: 0.808 ± 0.589
1.616GlnTyr: 1.616 ± 1.09
0.0GlnXaa: 0.0 ± 0.0
Arg
2.423ArgAla: 2.423 ± 1.294
2.423ArgCys: 2.423 ± 0.983
3.231ArgAsp: 3.231 ± 1.402
4.039ArgGlu: 4.039 ± 1.875
2.423ArgPhe: 2.423 ± 1.098
3.231ArgGly: 3.231 ± 1.275
3.231ArgHis: 3.231 ± 1.881
5.654ArgIle: 5.654 ± 1.761
3.231ArgLys: 3.231 ± 1.601
1.616ArgLeu: 1.616 ± 1.021
1.616ArgMet: 1.616 ± 1.38
1.616ArgAsn: 1.616 ± 1.129
4.039ArgPro: 4.039 ± 1.372
1.616ArgGln: 1.616 ± 1.045
6.462ArgArg: 6.462 ± 2.649
4.847ArgSer: 4.847 ± 1.458
4.847ArgThr: 4.847 ± 2.873
6.462ArgVal: 6.462 ± 2.38
0.0ArgTrp: 0.0 ± 0.0
1.616ArgTyr: 1.616 ± 1.108
0.0ArgXaa: 0.0 ± 0.0
Ser
3.231SerAla: 3.231 ± 1.668
0.808SerCys: 0.808 ± 0.991
3.231SerAsp: 3.231 ± 0.974
2.423SerGlu: 2.423 ± 1.19
2.423SerPhe: 2.423 ± 0.83
3.231SerGly: 3.231 ± 1.301
1.616SerHis: 1.616 ± 1.157
4.847SerIle: 4.847 ± 2.275
4.847SerLys: 4.847 ± 1.376
4.039SerLeu: 4.039 ± 1.453
4.039SerMet: 4.039 ± 2.64
7.27SerAsn: 7.27 ± 2.039
8.078SerPro: 8.078 ± 2.369
3.231SerGln: 3.231 ± 1.455
10.501SerArg: 10.501 ± 2.016
6.462SerSer: 6.462 ± 2.22
5.654SerThr: 5.654 ± 2.971
4.039SerVal: 4.039 ± 2.001
0.0SerTrp: 0.0 ± 0.0
1.616SerTyr: 1.616 ± 0.84
0.0SerXaa: 0.0 ± 0.0
Thr
2.423ThrAla: 2.423 ± 0.83
0.0ThrCys: 0.0 ± 0.0
1.616ThrAsp: 1.616 ± 1.318
1.616ThrGlu: 1.616 ± 1.021
1.616ThrPhe: 1.616 ± 1.318
5.654ThrGly: 5.654 ± 1.586
2.423ThrHis: 2.423 ± 1.429
1.616ThrIle: 1.616 ± 1.052
4.847ThrLys: 4.847 ± 1.272
3.231ThrLeu: 3.231 ± 1.189
3.231ThrMet: 3.231 ± 1.278
4.039ThrAsn: 4.039 ± 1.84
2.423ThrPro: 2.423 ± 0.83
2.423ThrGln: 2.423 ± 2.165
2.423ThrArg: 2.423 ± 1.163
8.885ThrSer: 8.885 ± 3.592
1.616ThrThr: 1.616 ± 1.228
5.654ThrVal: 5.654 ± 2.111
1.616ThrTrp: 1.616 ± 1.318
0.808ThrTyr: 0.808 ± 0.589
0.0ThrXaa: 0.0 ± 0.0
Val
1.616ValAla: 1.616 ± 0.882
0.808ValCys: 0.808 ± 0.589
4.847ValAsp: 4.847 ± 1.826
2.423ValGlu: 2.423 ± 1.112
6.462ValPhe: 6.462 ± 1.276
3.231ValGly: 3.231 ± 1.74
1.616ValHis: 1.616 ± 1.108
6.462ValIle: 6.462 ± 2.823
4.847ValLys: 4.847 ± 1.941
8.885ValLeu: 8.885 ± 2.372
2.423ValMet: 2.423 ± 1.262
2.423ValAsn: 2.423 ± 1.951
3.231ValPro: 3.231 ± 1.113
2.423ValGln: 2.423 ± 1.181
4.039ValArg: 4.039 ± 2.732
4.039ValSer: 4.039 ± 1.765
3.231ValThr: 3.231 ± 2.76
8.078ValVal: 8.078 ± 2.985
0.0ValTrp: 0.0 ± 0.0
4.039ValTyr: 4.039 ± 1.872
0.0ValXaa: 0.0 ± 0.0
Trp
1.616TrpAla: 1.616 ± 1.178
0.808TrpCys: 0.808 ± 0.898
0.808TrpAsp: 0.808 ± 0.874
0.808TrpGlu: 0.808 ± 1.006
0.808TrpPhe: 0.808 ± 0.991
0.808TrpGly: 0.808 ± 0.589
0.808TrpHis: 0.808 ± 0.69
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.808TrpMet: 0.808 ± 0.69
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.808TrpGln: 0.808 ± 0.589
0.808TrpArg: 0.808 ± 0.681
0.808TrpSer: 0.808 ± 0.681
1.616TrpThr: 1.616 ± 1.09
0.808TrpVal: 0.808 ± 0.589
0.0TrpTrp: 0.0 ± 0.0
0.808TrpTyr: 0.808 ± 0.589
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.231TyrAla: 3.231 ± 1.402
0.0TyrCys: 0.0 ± 0.0
1.616TyrAsp: 1.616 ± 1.108
1.616TyrGlu: 1.616 ± 1.108
3.231TyrPhe: 3.231 ± 0.99
1.616TyrGly: 1.616 ± 0.862
0.0TyrHis: 0.0 ± 0.0
1.616TyrIle: 1.616 ± 1.052
0.808TyrLys: 0.808 ± 0.589
5.654TyrLeu: 5.654 ± 1.46
1.616TyrMet: 1.616 ± 1.129
2.423TyrAsn: 2.423 ± 0.977
0.808TyrPro: 0.808 ± 0.589
0.808TyrGln: 0.808 ± 0.69
3.231TyrArg: 3.231 ± 1.746
3.231TyrSer: 3.231 ± 1.287
2.423TyrThr: 2.423 ± 1.135
4.039TyrVal: 4.039 ± 1.372
0.0TyrTrp: 0.0 ± 0.0
0.808TyrTyr: 0.808 ± 0.681
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1239 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski