Amino acid dipepetide frequency for Ganda bee virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.303AlaAla: 1.303 ± 1.715
2.279AlaCys: 2.279 ± 0.421
2.605AlaAsp: 2.605 ± 0.944
1.628AlaGlu: 1.628 ± 1.875
1.303AlaPhe: 1.303 ± 0.938
1.628AlaGly: 1.628 ± 0.496
0.651AlaHis: 0.651 ± 0.588
4.559AlaIle: 4.559 ± 0.472
3.256AlaLys: 3.256 ± 0.733
2.931AlaLeu: 2.931 ± 0.949
0.977AlaMet: 0.977 ± 0.518
2.605AlaAsn: 2.605 ± 0.988
0.651AlaPro: 0.651 ± 0.247
0.977AlaGln: 0.977 ± 0.518
1.954AlaArg: 1.954 ± 1.099
1.954AlaSer: 1.954 ± 0.282
3.908AlaThr: 3.908 ± 2.044
2.279AlaVal: 2.279 ± 2.065
0.0AlaTrp: 0.0 ± 0.0
2.605AlaTyr: 2.605 ± 0.826
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.931CysAsp: 2.931 ± 1.589
0.326CysGlu: 0.326 ± 0.363
0.977CysPhe: 0.977 ± 0.694
0.651CysGly: 0.651 ± 0.725
0.651CysHis: 0.651 ± 0.725
1.303CysIle: 1.303 ± 0.314
1.628CysLys: 1.628 ± 0.766
0.977CysLeu: 0.977 ± 0.518
0.0CysMet: 0.0 ± 0.319
1.303CysAsn: 1.303 ± 0.691
1.303CysPro: 1.303 ± 0.314
0.326CysGln: 0.326 ± 0.363
0.977CysArg: 0.977 ± 0.596
2.931CysSer: 2.931 ± 0.924
1.628CysThr: 1.628 ± 0.837
0.326CysVal: 0.326 ± 0.173
0.326CysTrp: 0.326 ± 0.363
1.628CysTyr: 1.628 ± 0.837
0.0CysXaa: 0.0 ± 0.0
Asp
2.605AspAla: 2.605 ± 0.826
0.651AspCys: 0.651 ± 0.725
2.931AspAsp: 2.931 ± 0.463
3.582AspGlu: 3.582 ± 1.286
3.582AspPhe: 3.582 ± 0.662
1.303AspGly: 1.303 ± 0.314
0.977AspHis: 0.977 ± 0.224
6.187AspIle: 6.187 ± 0.463
5.21AspLys: 5.21 ± 0.664
7.164AspLeu: 7.164 ± 0.995
1.954AspMet: 1.954 ± 0.579
6.838AspAsn: 6.838 ± 1.975
0.977AspPro: 0.977 ± 0.518
1.303AspGln: 1.303 ± 0.691
3.582AspArg: 3.582 ± 0.265
0.977AspSer: 0.977 ± 0.518
1.954AspThr: 1.954 ± 0.448
2.605AspVal: 2.605 ± 0.937
0.0AspTrp: 0.0 ± 0.0
4.233AspTyr: 4.233 ± 0.506
0.0AspXaa: 0.0 ± 0.0
Glu
0.977GluAla: 0.977 ± 0.224
1.303GluCys: 1.303 ± 0.955
5.21GluAsp: 5.21 ± 0.664
4.233GluGlu: 4.233 ± 0.618
4.559GluPhe: 4.559 ± 1.315
2.931GluGly: 2.931 ± 1.112
1.628GluHis: 1.628 ± 0.864
3.908GluIle: 3.908 ± 0.895
4.559GluLys: 4.559 ± 0.79
7.489GluLeu: 7.489 ± 1.218
2.605GluMet: 2.605 ± 0.446
1.954GluAsn: 1.954 ± 1.939
1.954GluPro: 1.954 ± 0.613
3.582GluGln: 3.582 ± 0.72
0.977GluArg: 0.977 ± 0.224
3.256GluSer: 3.256 ± 0.785
3.582GluThr: 3.582 ± 0.662
3.256GluVal: 3.256 ± 0.733
0.326GluTrp: 0.326 ± 0.173
2.931GluTyr: 2.931 ± 0.082
0.0GluXaa: 0.0 ± 0.0
Phe
2.279PheAla: 2.279 ± 0.255
1.628PheCys: 1.628 ± 1.081
2.605PheAsp: 2.605 ± 0.944
2.279PheGlu: 2.279 ± 1.209
0.651PhePhe: 0.651 ± 0.247
1.628PheGly: 1.628 ± 0.837
0.326PheHis: 0.326 ± 0.173
5.536PheIle: 5.536 ± 0.328
3.908PheLys: 3.908 ± 2.371
2.279PheLeu: 2.279 ± 0.255
2.279PheMet: 2.279 ± 0.676
5.21PheAsn: 5.21 ± 1.101
0.651PhePro: 0.651 ± 0.345
0.651PheGln: 0.651 ± 0.247
1.954PheArg: 1.954 ± 0.448
4.559PheSer: 4.559 ± 0.79
2.279PheThr: 2.279 ± 0.777
1.303PheVal: 1.303 ± 0.314
0.326PheTrp: 0.326 ± 0.173
1.303PheTyr: 1.303 ± 0.494
0.0PheXaa: 0.0 ± 0.0
Gly
1.628GlyAla: 1.628 ± 0.393
2.605GlyCys: 2.605 ± 0.988
1.628GlyAsp: 1.628 ± 0.393
2.931GlyGlu: 2.931 ± 0.463
2.605GlyPhe: 2.605 ± 1.432
2.279GlyGly: 2.279 ± 1.667
2.279GlyHis: 2.279 ± 0.975
3.582GlyIle: 3.582 ± 1.572
3.256GlyLys: 3.256 ± 0.877
3.582GlyLeu: 3.582 ± 1.162
1.628GlyMet: 1.628 ± 0.496
3.256GlyAsn: 3.256 ± 0.733
1.303GlyPro: 1.303 ± 0.537
1.303GlyGln: 1.303 ± 0.314
1.628GlyArg: 1.628 ± 0.393
3.582GlySer: 3.582 ± 1.286
1.954GlyThr: 1.954 ± 0.593
4.559GlyVal: 4.559 ± 1.149
0.651GlyTrp: 0.651 ± 0.345
1.954GlyTyr: 1.954 ± 0.282
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.651HisCys: 0.651 ± 0.588
1.628HisAsp: 1.628 ± 0.455
0.977HisGlu: 0.977 ± 0.503
0.326HisPhe: 0.326 ± 0.173
1.303HisGly: 1.303 ± 0.494
0.326HisHis: 0.326 ± 0.363
1.303HisIle: 1.303 ± 0.314
2.279HisLys: 2.279 ± 1.551
1.954HisLeu: 1.954 ± 1.036
0.326HisMet: 0.326 ± 0.173
1.303HisAsn: 1.303 ± 0.691
0.326HisPro: 0.326 ± 0.363
0.651HisGln: 0.651 ± 0.345
0.651HisArg: 0.651 ± 0.857
1.954HisSer: 1.954 ± 0.613
1.628HisThr: 1.628 ± 1.081
0.651HisVal: 0.651 ± 0.345
0.326HisTrp: 0.326 ± 0.173
1.628HisTyr: 1.628 ± 1.235
0.0HisXaa: 0.0 ± 0.0
Ile
5.21IleAla: 5.21 ± 1.008
2.279IleCys: 2.279 ± 0.902
4.559IleAsp: 4.559 ± 1.779
6.513IleGlu: 6.513 ± 1.821
3.582IlePhe: 3.582 ± 0.938
5.21IleGly: 5.21 ± 1.593
0.977IleHis: 0.977 ± 0.518
6.838IleIle: 6.838 ± 1.546
5.21IleLys: 5.21 ± 1.135
6.187IleLeu: 6.187 ± 0.463
0.977IleMet: 0.977 ± 0.518
6.187IleAsn: 6.187 ± 0.67
2.279IlePro: 2.279 ± 0.676
3.582IleGln: 3.582 ± 0.869
3.908IleArg: 3.908 ± 1.624
9.443IleSer: 9.443 ± 1.163
4.884IleThr: 4.884 ± 0.401
5.861IleVal: 5.861 ± 0.163
0.326IleTrp: 0.326 ± 0.363
4.559IleTyr: 4.559 ± 0.861
0.0IleXaa: 0.0 ± 0.0
Lys
2.605LysAla: 2.605 ± 0.332
1.303LysCys: 1.303 ± 0.314
3.256LysAsp: 3.256 ± 0.733
6.513LysGlu: 6.513 ± 1.019
3.908LysPhe: 3.908 ± 0.445
1.954LysGly: 1.954 ± 0.593
1.954LysHis: 1.954 ± 0.593
7.815LysIle: 7.815 ± 0.45
4.233LysLys: 4.233 ± 0.51
5.861LysLeu: 5.861 ± 1.481
2.931LysMet: 2.931 ± 0.568
6.187LysAsn: 6.187 ± 0.463
2.605LysPro: 2.605 ± 0.868
2.931LysGln: 2.931 ± 1.562
3.582LysArg: 3.582 ± 0.274
3.582LysSer: 3.582 ± 0.938
6.513LysThr: 6.513 ± 1.375
5.536LysVal: 5.536 ± 1.756
0.977LysTrp: 0.977 ± 0.224
6.838LysTyr: 6.838 ± 1.701
0.0LysXaa: 0.0 ± 0.0
Leu
3.908LeuAla: 3.908 ± 1.151
1.303LeuCys: 1.303 ± 0.314
6.513LeuAsp: 6.513 ± 0.984
2.279LeuGlu: 2.279 ± 0.255
2.605LeuPhe: 2.605 ± 0.651
3.908LeuGly: 3.908 ± 0.184
2.605LeuHis: 2.605 ± 0.944
6.513LeuIle: 6.513 ± 1.231
11.397LeuLys: 11.397 ± 1.711
8.466LeuLeu: 8.466 ± 0.866
2.279LeuMet: 2.279 ± 0.255
6.513LeuAsn: 6.513 ± 1.572
2.605LeuPro: 2.605 ± 0.629
1.303LeuGln: 1.303 ± 0.468
3.582LeuArg: 3.582 ± 0.274
7.164LeuSer: 7.164 ± 0.168
6.187LeuThr: 6.187 ± 0.857
5.861LeuVal: 5.861 ± 1.005
0.977LeuTrp: 0.977 ± 0.518
3.256LeuTyr: 3.256 ± 1.282
0.0LeuXaa: 0.0 ± 0.0
Met
1.628MetAla: 1.628 ± 1.235
0.651MetCys: 0.651 ± 0.247
2.279MetAsp: 2.279 ± 0.69
2.605MetGlu: 2.605 ± 0.332
0.977MetPhe: 0.977 ± 0.518
0.977MetGly: 0.977 ± 0.518
0.651MetHis: 0.651 ± 0.725
3.582MetIle: 3.582 ± 0.72
2.279MetLys: 2.279 ± 0.676
2.931MetLeu: 2.931 ± 1.112
0.651MetMet: 0.651 ± 0.247
1.303MetAsn: 1.303 ± 0.314
1.954MetPro: 1.954 ± 0.448
0.977MetGln: 0.977 ± 0.224
1.303MetArg: 1.303 ± 0.468
1.628MetSer: 1.628 ± 0.455
1.954MetThr: 1.954 ± 1.006
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.977MetTyr: 0.977 ± 0.518
0.0MetXaa: 0.0 ± 0.0
Asn
1.628AsnAla: 1.628 ± 1.242
1.954AsnCys: 1.954 ± 1.192
4.233AsnAsp: 4.233 ± 1.087
3.582AsnGlu: 3.582 ± 1.453
2.931AsnPhe: 2.931 ± 0.763
3.908AsnGly: 3.908 ± 0.895
0.651AsnHis: 0.651 ± 0.247
8.466AsnIle: 8.466 ± 2.429
6.838AsnLys: 6.838 ± 2.475
7.489AsnLeu: 7.489 ± 1.213
2.931AsnMet: 2.931 ± 0.763
6.187AsnAsn: 6.187 ± 1.671
1.954AsnPro: 1.954 ± 0.448
1.954AsnGln: 1.954 ± 0.282
3.582AsnArg: 3.582 ± 0.869
3.256AsnSer: 3.256 ± 0.408
3.582AsnThr: 3.582 ± 0.662
4.559AsnVal: 4.559 ± 1.311
1.303AsnTrp: 1.303 ± 0.314
2.931AsnTyr: 2.931 ± 1.112
0.0AsnXaa: 0.0 ± 0.0
Pro
0.977ProAla: 0.977 ± 0.503
0.0ProCys: 0.0 ± 0.0
2.279ProAsp: 2.279 ± 0.421
3.582ProGlu: 3.582 ± 0.938
2.279ProPhe: 2.279 ± 0.777
1.628ProGly: 1.628 ± 0.766
0.651ProHis: 0.651 ± 0.345
2.931ProIle: 2.931 ± 0.671
2.931ProLys: 2.931 ± 1.112
1.954ProLeu: 1.954 ± 0.576
0.977ProMet: 0.977 ± 0.224
1.628ProAsn: 1.628 ± 0.455
0.977ProPro: 0.977 ± 0.596
1.303ProGln: 1.303 ± 0.494
0.977ProArg: 0.977 ± 0.518
1.954ProSer: 1.954 ± 0.593
1.303ProThr: 1.303 ± 0.537
1.628ProVal: 1.628 ± 0.455
0.0ProTrp: 0.0 ± 0.0
0.977ProTyr: 0.977 ± 0.596
0.0ProXaa: 0.0 ± 0.0
Gln
2.279GlnAla: 2.279 ± 0.421
0.977GlnCys: 0.977 ± 0.224
1.954GlnAsp: 1.954 ± 0.448
1.954GlnGlu: 1.954 ± 1.939
1.303GlnPhe: 1.303 ± 0.314
2.279GlnGly: 2.279 ± 0.957
0.651GlnHis: 0.651 ± 0.345
3.908GlnIle: 3.908 ± 0.627
0.651GlnLys: 0.651 ± 0.345
1.303GlnLeu: 1.303 ± 0.691
0.326GlnMet: 0.326 ± 0.173
2.605GlnAsn: 2.605 ± 0.249
0.651GlnPro: 0.651 ± 0.345
1.628GlnGln: 1.628 ± 0.455
1.303GlnArg: 1.303 ± 0.468
3.582GlnSer: 3.582 ± 0.274
1.954GlnThr: 1.954 ± 0.448
1.303GlnVal: 1.303 ± 0.314
0.0GlnTrp: 0.0 ± 0.0
2.279GlnTyr: 2.279 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
1.954ArgAla: 1.954 ± 1.099
0.0ArgCys: 0.0 ± 0.0
1.628ArgAsp: 1.628 ± 0.439
2.931ArgGlu: 2.931 ± 0.463
1.954ArgPhe: 1.954 ± 1.036
2.605ArgGly: 2.605 ± 1.382
0.326ArgHis: 0.326 ± 0.173
1.628ArgIle: 1.628 ± 0.496
3.256ArgLys: 3.256 ± 1.456
4.233ArgLeu: 4.233 ± 0.618
1.303ArgMet: 1.303 ± 1.177
2.931ArgAsn: 2.931 ± 0.763
1.303ArgPro: 1.303 ± 0.468
0.977ArgGln: 0.977 ± 0.596
1.628ArgArg: 1.628 ± 0.455
3.256ArgSer: 3.256 ± 0.733
3.582ArgThr: 3.582 ± 0.265
3.582ArgVal: 3.582 ± 0.827
0.0ArgTrp: 0.0 ± 0.0
1.303ArgTyr: 1.303 ± 0.938
0.0ArgXaa: 0.0 ± 0.0
Ser
1.628SerAla: 1.628 ± 1.235
1.303SerCys: 1.303 ± 0.955
5.536SerAsp: 5.536 ± 0.319
4.233SerGlu: 4.233 ± 0.618
2.931SerPhe: 2.931 ± 0.763
2.279SerGly: 2.279 ± 0.421
1.303SerHis: 1.303 ± 0.537
6.513SerIle: 6.513 ± 0.816
7.164SerLys: 7.164 ± 0.7
7.815SerLeu: 7.815 ± 1.07
2.931SerMet: 2.931 ± 0.763
4.233SerAsn: 4.233 ± 0.239
0.977SerPro: 0.977 ± 0.518
1.954SerGln: 1.954 ± 1.036
2.279SerArg: 2.279 ± 0.957
2.605SerSer: 2.605 ± 0.651
3.582SerThr: 3.582 ± 0.274
2.931SerVal: 2.931 ± 1.078
0.651SerTrp: 0.651 ± 0.345
3.256SerTyr: 3.256 ± 1.104
0.0SerXaa: 0.0 ± 0.0
Thr
1.954ThrAla: 1.954 ± 1.388
0.651ThrCys: 0.651 ± 0.345
1.628ThrAsp: 1.628 ± 0.864
2.931ThrGlu: 2.931 ± 0.082
2.931ThrPhe: 2.931 ± 0.918
4.233ThrGly: 4.233 ± 1.46
0.651ThrHis: 0.651 ± 0.588
3.256ThrIle: 3.256 ± 0.911
3.582ThrLys: 3.582 ± 0.768
5.861ThrLeu: 5.861 ± 1.135
1.303ThrMet: 1.303 ± 0.468
3.908ThrAsn: 3.908 ± 1.624
2.279ThrPro: 2.279 ± 0.421
3.256ThrGln: 3.256 ± 1.456
3.908ThrArg: 3.908 ± 0.942
3.582ThrSer: 3.582 ± 1.162
2.605ThrThr: 2.605 ± 2.244
4.233ThrVal: 4.233 ± 1.259
0.326ThrTrp: 0.326 ± 0.363
4.233ThrTyr: 4.233 ± 0.618
0.0ThrXaa: 0.0 ± 0.0
Val
2.931ValAla: 2.931 ± 0.949
0.0ValCys: 0.0 ± 0.0
2.605ValAsp: 2.605 ± 0.826
3.582ValGlu: 3.582 ± 1.359
2.605ValPhe: 2.605 ± 0.332
2.931ValGly: 2.931 ± 2.082
1.303ValHis: 1.303 ± 0.537
6.513ValIle: 6.513 ± 0.755
4.884ValLys: 4.884 ± 2.019
6.513ValLeu: 6.513 ± 1.375
1.303ValMet: 1.303 ± 0.691
4.884ValAsn: 4.884 ± 0.401
4.559ValPro: 4.559 ± 1.632
0.651ValGln: 0.651 ± 0.345
1.303ValArg: 1.303 ± 0.314
4.233ValSer: 4.233 ± 0.51
1.303ValThr: 1.303 ± 0.537
2.279ValVal: 2.279 ± 1.667
0.651ValTrp: 0.651 ± 0.588
2.605ValTyr: 2.605 ± 0.826
0.0ValXaa: 0.0 ± 0.0
Trp
0.326TrpAla: 0.326 ± 0.173
0.0TrpCys: 0.0 ± 0.0
0.651TrpAsp: 0.651 ± 0.345
0.0TrpGlu: 0.0 ± 0.0
0.326TrpPhe: 0.326 ± 0.363
0.651TrpGly: 0.651 ± 0.247
0.326TrpHis: 0.326 ± 0.363
0.651TrpIle: 0.651 ± 0.247
0.326TrpLys: 0.326 ± 0.707
0.326TrpLeu: 0.326 ± 0.173
0.0TrpMet: 0.0 ± 0.0
0.651TrpAsn: 0.651 ± 0.345
0.326TrpPro: 0.326 ± 0.173
0.326TrpGln: 0.326 ± 0.173
0.326TrpArg: 0.326 ± 0.173
0.326TrpSer: 0.326 ± 0.363
0.651TrpThr: 0.651 ± 0.345
0.977TrpVal: 0.977 ± 0.518
0.326TrpTrp: 0.326 ± 0.363
0.651TrpTyr: 0.651 ± 0.247
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.233TyrAla: 4.233 ± 0.618
0.977TyrCys: 0.977 ± 1.088
1.954TyrAsp: 1.954 ± 0.448
4.559TyrGlu: 4.559 ± 1.311
1.303TyrPhe: 1.303 ± 0.691
3.908TyrGly: 3.908 ± 0.184
1.303TyrHis: 1.303 ± 0.691
3.582TyrIle: 3.582 ± 0.869
4.233TyrLys: 4.233 ± 0.239
3.908TyrLeu: 3.908 ± 0.942
1.303TyrMet: 1.303 ± 1.177
4.233TyrAsn: 4.233 ± 1.075
1.303TyrPro: 1.303 ± 0.314
3.256TyrGln: 3.256 ± 0.408
0.977TyrArg: 0.977 ± 0.503
2.279TyrSer: 2.279 ± 1.229
2.279TyrThr: 2.279 ± 0.518
3.908TyrVal: 3.908 ± 0.942
0.651TyrTrp: 0.651 ± 0.247
2.279TyrTyr: 2.279 ± 0.777
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski