Amino acid dipepetide frequency for Bhendi yellow vein Delhi virus [2004:New Delhi]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.055AlaAla: 4.055 ± 2.411
0.811AlaCys: 0.811 ± 0.699
1.622AlaAsp: 1.622 ± 0.917
0.811AlaGlu: 0.811 ± 0.644
0.0AlaPhe: 0.0 ± 0.0
2.433AlaGly: 2.433 ± 0.976
0.811AlaHis: 0.811 ± 0.924
1.622AlaIle: 1.622 ± 0.75
4.866AlaLys: 4.866 ± 1.267
6.488AlaLeu: 6.488 ± 2.71
0.0AlaMet: 0.0 ± 0.0
3.244AlaAsn: 3.244 ± 1.841
1.622AlaPro: 1.622 ± 1.288
3.244AlaGln: 3.244 ± 1.236
6.488AlaArg: 6.488 ± 1.934
2.433AlaSer: 2.433 ± 2.096
3.244AlaThr: 3.244 ± 2.038
3.244AlaVal: 3.244 ± 1.5
1.622AlaTrp: 1.622 ± 0.75
0.811AlaTyr: 0.811 ± 0.644
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.622CysGlu: 1.622 ± 1.397
1.622CysPhe: 1.622 ± 1.237
1.622CysGly: 1.622 ± 0.886
0.811CysHis: 0.811 ± 0.903
0.0CysIle: 0.0 ± 0.0
0.811CysLys: 0.811 ± 0.699
0.811CysLeu: 0.811 ± 0.699
2.433CysMet: 2.433 ± 1.179
1.622CysAsn: 1.622 ± 0.886
1.622CysPro: 1.622 ± 1.612
0.811CysGln: 0.811 ± 0.644
1.622CysArg: 1.622 ± 0.75
3.244CysSer: 3.244 ± 1.265
3.244CysThr: 3.244 ± 1.992
0.811CysVal: 0.811 ± 0.699
0.0CysTrp: 0.0 ± 0.0
0.811CysTyr: 0.811 ± 0.644
0.0CysXaa: 0.0 ± 0.0
Asp
2.433AspAla: 2.433 ± 1.932
0.0AspCys: 0.0 ± 0.0
2.433AspAsp: 2.433 ± 1.081
3.244AspGlu: 3.244 ± 0.954
0.811AspPhe: 0.811 ± 0.699
1.622AspGly: 1.622 ± 1.288
0.0AspHis: 0.0 ± 0.0
3.244AspIle: 3.244 ± 1.882
2.433AspLys: 2.433 ± 0.845
5.677AspLeu: 5.677 ± 2.154
0.0AspMet: 0.0 ± 0.0
0.811AspAsn: 0.811 ± 0.699
2.433AspPro: 2.433 ± 1.1
4.055AspGln: 4.055 ± 1.449
2.433AspArg: 2.433 ± 1.299
4.866AspSer: 4.866 ± 2.094
1.622AspThr: 1.622 ± 0.982
4.055AspVal: 4.055 ± 1.345
1.622AspTrp: 1.622 ± 0.886
0.811AspTyr: 0.811 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
4.866GluAla: 4.866 ± 1.267
0.811GluCys: 0.811 ± 0.903
0.811GluAsp: 0.811 ± 0.903
4.866GluGlu: 4.866 ± 3.865
3.244GluPhe: 3.244 ± 2.02
4.055GluGly: 4.055 ± 1.017
1.622GluHis: 1.622 ± 1.262
0.811GluIle: 0.811 ± 0.699
2.433GluLys: 2.433 ± 1.227
3.244GluLeu: 3.244 ± 1.841
0.0GluMet: 0.0 ± 0.0
4.055GluAsn: 4.055 ± 2.003
3.244GluPro: 3.244 ± 1.272
0.811GluGln: 0.811 ± 0.699
0.811GluArg: 0.811 ± 0.924
4.055GluSer: 4.055 ± 1.073
2.433GluThr: 2.433 ± 1.228
1.622GluVal: 1.622 ± 0.75
2.433GluTrp: 2.433 ± 1.259
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.244PheAsp: 3.244 ± 1.5
0.811PheGlu: 0.811 ± 0.644
1.622PhePhe: 1.622 ± 0.75
0.811PheGly: 0.811 ± 0.699
1.622PheHis: 1.622 ± 1.288
3.244PheIle: 3.244 ± 1.323
3.244PheLys: 3.244 ± 2.616
8.11PheLeu: 8.11 ± 1.966
0.811PheMet: 0.811 ± 0.644
3.244PheAsn: 3.244 ± 1.341
1.622PhePro: 1.622 ± 1.221
1.622PheGln: 1.622 ± 0.886
3.244PheArg: 3.244 ± 2.009
3.244PheSer: 3.244 ± 1.777
1.622PheThr: 1.622 ± 0.921
1.622PheVal: 1.622 ± 0.97
0.0PheTrp: 0.0 ± 0.0
0.811PheTyr: 0.811 ± 0.699
0.0PheXaa: 0.0 ± 0.0
Gly
3.244GlyAla: 3.244 ± 1.789
2.433GlyCys: 2.433 ± 1.253
2.433GlyAsp: 2.433 ± 1.325
4.866GlyGlu: 4.866 ± 1.143
2.433GlyPhe: 2.433 ± 1.93
2.433GlyGly: 2.433 ± 1.211
1.622GlyHis: 1.622 ± 0.886
2.433GlyIle: 2.433 ± 1.669
4.055GlyLys: 4.055 ± 1.909
2.433GlyLeu: 2.433 ± 1.443
1.622GlyMet: 1.622 ± 1.221
0.811GlyAsn: 0.811 ± 0.989
3.244GlyPro: 3.244 ± 1.147
2.433GlyGln: 2.433 ± 0.976
1.622GlyArg: 1.622 ± 0.927
5.677GlySer: 5.677 ± 1.639
1.622GlyThr: 1.622 ± 1.235
2.433GlyVal: 2.433 ± 2.773
0.0GlyTrp: 0.0 ± 0.0
0.811GlyTyr: 0.811 ± 0.806
0.0GlyXaa: 0.0 ± 0.0
His
1.622HisAla: 1.622 ± 1.397
2.433HisCys: 2.433 ± 1.914
0.811HisAsp: 0.811 ± 0.903
1.622HisGlu: 1.622 ± 0.982
3.244HisPhe: 3.244 ± 2.02
2.433HisGly: 2.433 ± 1.465
2.433HisHis: 2.433 ± 1.465
2.433HisIle: 2.433 ± 2.59
1.622HisLys: 1.622 ± 1.145
3.244HisLeu: 3.244 ± 1.423
0.0HisMet: 0.0 ± 0.0
2.433HisAsn: 2.433 ± 1.259
3.244HisPro: 3.244 ± 1.217
0.811HisGln: 0.811 ± 0.863
3.244HisArg: 3.244 ± 1.956
0.811HisSer: 0.811 ± 0.924
2.433HisThr: 2.433 ± 2.096
2.433HisVal: 2.433 ± 1.73
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.811IleAla: 0.811 ± 0.699
2.433IleCys: 2.433 ± 0.976
2.433IleAsp: 2.433 ± 1.932
1.622IleGlu: 1.622 ± 1.612
2.433IlePhe: 2.433 ± 1.932
0.0IleGly: 0.0 ± 0.0
0.811IleHis: 0.811 ± 0.903
2.433IleIle: 2.433 ± 1.453
6.488IleLys: 6.488 ± 1.827
7.299IleLeu: 7.299 ± 4.528
0.0IleMet: 0.0 ± 0.0
3.244IleAsn: 3.244 ± 1.791
1.622IlePro: 1.622 ± 0.927
4.055IleGln: 4.055 ± 1.731
4.055IleArg: 4.055 ± 1.121
5.677IleSer: 5.677 ± 2.812
2.433IleThr: 2.433 ± 1.107
1.622IleVal: 1.622 ± 1.727
2.433IleTrp: 2.433 ± 1.891
1.622IleTyr: 1.622 ± 0.97
0.0IleXaa: 0.0 ± 0.0
Lys
3.244LysAla: 3.244 ± 2.17
2.433LysCys: 2.433 ± 1.294
1.622LysAsp: 1.622 ± 1.288
4.055LysGlu: 4.055 ± 1.659
2.433LysPhe: 2.433 ± 0.912
2.433LysGly: 2.433 ± 1.1
3.244LysHis: 3.244 ± 2.495
4.055LysIle: 4.055 ± 1.257
1.622LysLys: 1.622 ± 0.75
1.622LysLeu: 1.622 ± 1.237
0.0LysMet: 0.0 ± 0.0
6.488LysAsn: 6.488 ± 2.544
2.433LysPro: 2.433 ± 1.299
0.811LysGln: 0.811 ± 0.989
1.622LysArg: 1.622 ± 1.073
5.677LysSer: 5.677 ± 1.348
3.244LysThr: 3.244 ± 0.943
4.866LysVal: 4.866 ± 1.216
0.811LysTrp: 0.811 ± 0.699
4.866LysTyr: 4.866 ± 1.073
0.0LysXaa: 0.0 ± 0.0
Leu
2.433LeuAla: 2.433 ± 1.452
2.433LeuCys: 2.433 ± 1.211
4.866LeuAsp: 4.866 ± 2.079
4.866LeuGlu: 4.866 ± 2.517
2.433LeuPhe: 2.433 ± 1.494
8.11LeuGly: 8.11 ± 3.078
4.866LeuHis: 4.866 ± 1.936
4.055LeuIle: 4.055 ± 1.258
6.488LeuLys: 6.488 ± 1.603
3.244LeuLeu: 3.244 ± 2.249
0.811LeuMet: 0.811 ± 0.699
4.866LeuAsn: 4.866 ± 0.94
0.811LeuPro: 0.811 ± 0.903
4.866LeuGln: 4.866 ± 1.716
4.866LeuArg: 4.866 ± 2.214
4.055LeuSer: 4.055 ± 1.621
8.921LeuThr: 8.921 ± 3.891
8.11LeuVal: 8.11 ± 2.464
0.811LeuTrp: 0.811 ± 0.806
4.866LeuTyr: 4.866 ± 2.546
0.0LeuXaa: 0.0 ± 0.0
Met
2.433MetAla: 2.433 ± 1.299
0.0MetCys: 0.0 ± 0.0
4.055MetAsp: 4.055 ± 2.129
0.0MetGlu: 0.0 ± 0.0
1.622MetPhe: 1.622 ± 1.397
2.433MetGly: 2.433 ± 1.234
1.622MetHis: 1.622 ± 0.97
0.811MetIle: 0.811 ± 0.863
0.0MetLys: 0.0 ± 0.0
2.433MetLeu: 2.433 ± 1.418
0.0MetMet: 0.0 ± 0.0
0.811MetAsn: 0.811 ± 0.699
0.811MetPro: 0.811 ± 0.863
0.811MetGln: 0.811 ± 0.903
0.811MetArg: 0.811 ± 0.924
0.811MetSer: 0.811 ± 0.989
0.0MetThr: 0.0 ± 0.0
0.811MetVal: 0.811 ± 0.863
1.622MetTrp: 1.622 ± 0.982
2.433MetTyr: 2.433 ± 1.574
0.0MetXaa: 0.0 ± 0.0
Asn
4.866AsnAla: 4.866 ± 2.391
0.811AsnCys: 0.811 ± 0.903
1.622AsnAsp: 1.622 ± 1.288
2.433AsnGlu: 2.433 ± 1.856
0.811AsnPhe: 0.811 ± 0.699
4.055AsnGly: 4.055 ± 2.029
2.433AsnHis: 2.433 ± 1.57
2.433AsnIle: 2.433 ± 0.845
1.622AsnLys: 1.622 ± 0.927
8.921AsnLeu: 8.921 ± 3.254
4.055AsnMet: 4.055 ± 2.106
3.244AsnAsn: 3.244 ± 1.453
5.677AsnPro: 5.677 ± 2.123
4.055AsnGln: 4.055 ± 1.159
1.622AsnArg: 1.622 ± 1.397
2.433AsnSer: 2.433 ± 1.211
1.622AsnThr: 1.622 ± 0.976
4.055AsnVal: 4.055 ± 1.866
0.811AsnTrp: 0.811 ± 0.644
4.866AsnTyr: 4.866 ± 1.59
0.0AsnXaa: 0.0 ± 0.0
Pro
1.622ProAla: 1.622 ± 1.397
0.811ProCys: 0.811 ± 0.699
2.433ProAsp: 2.433 ± 1.179
1.622ProGlu: 1.622 ± 0.982
2.433ProPhe: 2.433 ± 1.107
0.811ProGly: 0.811 ± 0.644
3.244ProHis: 3.244 ± 2.02
2.433ProIle: 2.433 ± 1.856
3.244ProLys: 3.244 ± 2.02
6.488ProLeu: 6.488 ± 2.434
1.622ProMet: 1.622 ± 0.97
5.677ProAsn: 5.677 ± 2.465
2.433ProPro: 2.433 ± 1.294
4.866ProGln: 4.866 ± 2.66
4.055ProArg: 4.055 ± 1.628
6.488ProSer: 6.488 ± 3.11
4.866ProThr: 4.866 ± 2.061
2.433ProVal: 2.433 ± 1.299
0.0ProTrp: 0.0 ± 0.0
0.811ProTyr: 0.811 ± 0.699
0.0ProXaa: 0.0 ± 0.0
Gln
4.055GlnAla: 4.055 ± 3.037
0.0GlnCys: 0.0 ± 0.0
1.622GlnAsp: 1.622 ± 0.917
3.244GlnGlu: 3.244 ± 1.19
2.433GlnPhe: 2.433 ± 1.294
1.622GlnGly: 1.622 ± 1.288
2.433GlnHis: 2.433 ± 1.573
2.433GlnIle: 2.433 ± 1.259
0.0GlnLys: 0.0 ± 0.0
2.433GlnLeu: 2.433 ± 1.1
0.811GlnMet: 0.811 ± 0.989
4.866GlnAsn: 4.866 ± 2.2
3.244GlnPro: 3.244 ± 2.083
3.244GlnGln: 3.244 ± 0.954
4.055GlnArg: 4.055 ± 1.327
4.866GlnSer: 4.866 ± 1.557
2.433GlnThr: 2.433 ± 1.294
3.244GlnVal: 3.244 ± 0.986
0.0GlnTrp: 0.0 ± 0.0
2.433GlnTyr: 2.433 ± 1.444
0.0GlnXaa: 0.0 ± 0.0
Arg
2.433ArgAla: 2.433 ± 1.295
2.433ArgCys: 2.433 ± 1.175
3.244ArgAsp: 3.244 ± 1.282
4.055ArgGlu: 4.055 ± 1.017
1.622ArgPhe: 1.622 ± 1.288
3.244ArgGly: 3.244 ± 1.325
0.811ArgHis: 0.811 ± 0.806
9.732ArgIle: 9.732 ± 3.02
2.433ArgLys: 2.433 ± 1.574
2.433ArgLeu: 2.433 ± 1.352
2.433ArgMet: 2.433 ± 1.453
0.811ArgAsn: 0.811 ± 0.863
4.055ArgPro: 4.055 ± 2.003
1.622ArgGln: 1.622 ± 1.158
6.488ArgArg: 6.488 ± 3.917
5.677ArgSer: 5.677 ± 1.234
4.866ArgThr: 4.866 ± 2.013
4.055ArgVal: 4.055 ± 2.231
0.0ArgTrp: 0.0 ± 0.0
1.622ArgTyr: 1.622 ± 0.917
0.0ArgXaa: 0.0 ± 0.0
Ser
3.244SerAla: 3.244 ± 1.236
4.866SerCys: 4.866 ± 1.717
4.055SerAsp: 4.055 ± 1.134
2.433SerGlu: 2.433 ± 1.259
3.244SerPhe: 3.244 ± 1.736
2.433SerGly: 2.433 ± 1.259
2.433SerHis: 2.433 ± 1.494
3.244SerIle: 3.244 ± 1.805
4.866SerLys: 4.866 ± 1.296
4.866SerLeu: 4.866 ± 1.544
1.622SerMet: 1.622 ± 1.721
4.055SerAsn: 4.055 ± 1.017
9.732SerPro: 9.732 ± 2.974
4.055SerGln: 4.055 ± 1.363
6.488SerArg: 6.488 ± 1.025
12.976SerSer: 12.976 ± 4.936
6.488SerThr: 6.488 ± 2.346
4.055SerVal: 4.055 ± 2.62
0.0SerTrp: 0.0 ± 0.0
0.811SerTyr: 0.811 ± 0.644
0.0SerXaa: 0.0 ± 0.0
Thr
4.055ThrAla: 4.055 ± 1.391
0.811ThrCys: 0.811 ± 0.989
0.811ThrAsp: 0.811 ± 0.644
1.622ThrGlu: 1.622 ± 1.148
1.622ThrPhe: 1.622 ± 1.382
4.055ThrGly: 4.055 ± 1.68
2.433ThrHis: 2.433 ± 1.57
1.622ThrIle: 1.622 ± 0.927
4.055ThrLys: 4.055 ± 1.345
4.866ThrLeu: 4.866 ± 1.143
3.244ThrMet: 3.244 ± 1.8
5.677ThrAsn: 5.677 ± 2.191
5.677ThrPro: 5.677 ± 1.681
2.433ThrGln: 2.433 ± 1.54
4.055ThrArg: 4.055 ± 2.098
4.866ThrSer: 4.866 ± 2.651
3.244ThrThr: 3.244 ± 2.299
2.433ThrVal: 2.433 ± 1.418
0.811ThrTrp: 0.811 ± 0.989
1.622ThrTyr: 1.622 ± 0.886
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
4.055ValAsp: 4.055 ± 1.006
1.622ValGlu: 1.622 ± 1.085
4.055ValPhe: 4.055 ± 2.089
0.811ValGly: 0.811 ± 0.806
2.433ValHis: 2.433 ± 1.107
4.866ValIle: 4.866 ± 2.478
4.866ValLys: 4.866 ± 1.763
7.299ValLeu: 7.299 ± 1.985
1.622ValMet: 1.622 ± 0.876
3.244ValAsn: 3.244 ± 1.853
4.055ValPro: 4.055 ± 1.334
3.244ValGln: 3.244 ± 0.954
3.244ValArg: 3.244 ± 2.038
3.244ValSer: 3.244 ± 1.841
4.055ValThr: 4.055 ± 2.717
3.244ValVal: 3.244 ± 1.243
0.0ValTrp: 0.0 ± 0.0
3.244ValTyr: 3.244 ± 1.5
0.0ValXaa: 0.0 ± 0.0
Trp
1.622TrpAla: 1.622 ± 1.288
0.0TrpCys: 0.0 ± 0.0
0.811TrpAsp: 0.811 ± 0.806
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.811TrpGly: 0.811 ± 0.644
0.811TrpHis: 0.811 ± 0.699
0.0TrpIle: 0.0 ± 0.0
0.811TrpLys: 0.811 ± 0.924
0.0TrpLeu: 0.0 ± 0.0
0.811TrpMet: 0.811 ± 0.699
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.811TrpGln: 0.811 ± 0.644
0.811TrpArg: 0.811 ± 0.903
2.433TrpSer: 2.433 ± 1.474
1.622TrpThr: 1.622 ± 1.085
0.811TrpVal: 0.811 ± 0.644
0.0TrpTrp: 0.0 ± 0.0
0.811TrpTyr: 0.811 ± 0.644
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.433TyrAla: 2.433 ± 1.299
0.0TyrCys: 0.0 ± 0.0
1.622TyrAsp: 1.622 ± 0.917
0.811TyrGlu: 0.811 ± 0.699
2.433TyrPhe: 2.433 ± 0.912
2.433TyrGly: 2.433 ± 0.903
0.811TyrHis: 0.811 ± 0.806
1.622TyrIle: 1.622 ± 0.976
1.622TyrLys: 1.622 ± 1.288
4.866TyrLeu: 4.866 ± 1.89
1.622TyrMet: 1.622 ± 1.02
3.244TyrAsn: 3.244 ± 1.676
0.811TyrPro: 0.811 ± 0.644
0.811TyrGln: 0.811 ± 0.699
2.433TyrArg: 2.433 ± 1.453
3.244TyrSer: 3.244 ± 1.388
0.0TyrThr: 0.0 ± 0.0
3.244TyrVal: 3.244 ± 1.147
0.0TyrTrp: 0.0 ± 0.0
0.811TyrTyr: 0.811 ± 0.903
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1234 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski