Amino acid dipepetide frequency for Hubei diptera virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.265AlaAla: 3.265 ± 3.79
0.753AlaCys: 0.753 ± 0.385
1.256AlaAsp: 1.256 ± 1.415
1.256AlaGlu: 1.256 ± 0.462
2.511AlaPhe: 2.511 ± 0.802
1.758AlaGly: 1.758 ± 0.861
0.251AlaHis: 0.251 ± 0.512
3.767AlaIle: 3.767 ± 0.316
2.26AlaLys: 2.26 ± 2.418
3.014AlaLeu: 3.014 ± 2.149
1.758AlaMet: 1.758 ± 1.194
2.009AlaAsn: 2.009 ± 0.564
1.256AlaPro: 1.256 ± 0.296
0.251AlaGln: 0.251 ± 0.156
1.758AlaArg: 1.758 ± 0.746
4.018AlaSer: 4.018 ± 0.416
4.269AlaThr: 4.269 ± 1.936
1.256AlaVal: 1.256 ± 0.341
0.502AlaTrp: 0.502 ± 0.524
1.507AlaTyr: 1.507 ± 1.377
0.0AlaXaa: 0.0 ± 0.0
Cys
0.251CysAla: 0.251 ± 0.512
0.0CysCys: 0.0 ± 0.0
1.256CysAsp: 1.256 ± 0.87
1.507CysGlu: 1.507 ± 0.809
1.507CysPhe: 1.507 ± 1.115
0.753CysGly: 0.753 ± 0.468
0.0CysHis: 0.0 ± 0.0
1.758CysIle: 1.758 ± 0.692
1.507CysLys: 1.507 ± 0.77
2.762CysLeu: 2.762 ± 0.531
0.251CysMet: 0.251 ± 0.156
2.26CysAsn: 2.26 ± 1.156
1.758CysPro: 1.758 ± 1.01
0.502CysGln: 0.502 ± 0.16
1.256CysArg: 1.256 ± 1.019
2.762CysSer: 2.762 ± 1.518
0.753CysThr: 0.753 ± 0.404
1.256CysVal: 1.256 ± 0.87
0.0CysTrp: 0.0 ± 0.0
0.251CysTyr: 0.251 ± 0.246
0.0CysXaa: 0.0 ± 0.0
Asp
1.507AspAla: 1.507 ± 0.481
2.26AspCys: 2.26 ± 1.558
4.771AspAsp: 4.771 ± 0.569
3.767AspGlu: 3.767 ± 0.764
4.52AspPhe: 4.52 ± 0.647
2.762AspGly: 2.762 ± 0.904
1.005AspHis: 1.005 ± 0.626
4.771AspIle: 4.771 ± 1.27
4.018AspLys: 4.018 ± 1.316
6.278AspLeu: 6.278 ± 1.739
0.251AspMet: 0.251 ± 0.156
2.26AspAsn: 2.26 ± 0.627
2.511AspPro: 2.511 ± 0.924
2.26AspGln: 2.26 ± 0.419
1.256AspArg: 1.256 ± 0.462
4.018AspSer: 4.018 ± 0.734
3.265AspThr: 3.265 ± 0.913
3.265AspVal: 3.265 ± 1.314
1.507AspTrp: 1.507 ± 0.35
3.516AspTyr: 3.516 ± 0.883
0.0AspXaa: 0.0 ± 0.0
Glu
2.762GluAla: 2.762 ± 1.207
1.256GluCys: 1.256 ± 0.536
5.023GluAsp: 5.023 ± 1.353
7.534GluGlu: 7.534 ± 2.876
3.014GluPhe: 3.014 ± 0.795
3.014GluGly: 3.014 ± 0.793
0.753GluHis: 0.753 ± 0.468
5.023GluIle: 5.023 ± 0.221
5.023GluLys: 5.023 ± 0.976
6.529GluLeu: 6.529 ± 1.019
1.005GluMet: 1.005 ± 0.488
2.762GluAsn: 2.762 ± 1.072
1.005GluPro: 1.005 ± 0.321
0.753GluGln: 0.753 ± 0.199
1.758GluArg: 1.758 ± 0.711
6.278GluSer: 6.278 ± 1.739
3.014GluThr: 3.014 ± 0.954
6.027GluVal: 6.027 ± 1.368
1.005GluTrp: 1.005 ± 0.624
1.758GluTyr: 1.758 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
3.014PheAla: 3.014 ± 1.536
1.005PheCys: 1.005 ± 0.321
3.265PheAsp: 3.265 ± 0.664
4.771PheGlu: 4.771 ± 1.225
1.256PhePhe: 1.256 ± 0.462
2.762PheGly: 2.762 ± 0.71
1.256PheHis: 1.256 ± 0.462
4.269PheIle: 4.269 ± 1.104
5.023PheLys: 5.023 ± 0.81
5.023PheLeu: 5.023 ± 0.713
2.511PheMet: 2.511 ± 1.098
1.758PheAsn: 1.758 ± 0.454
2.511PhePro: 2.511 ± 0.852
1.507PheGln: 1.507 ± 0.398
1.005PheArg: 1.005 ± 0.626
3.516PheSer: 3.516 ± 0.693
2.762PheThr: 2.762 ± 0.279
2.009PheVal: 2.009 ± 0.751
0.502PheTrp: 0.502 ± 0.16
2.511PheTyr: 2.511 ± 0.66
0.0PheXaa: 0.0 ± 0.0
Gly
1.005GlyAla: 1.005 ± 0.488
1.005GlyCys: 1.005 ± 0.626
2.511GlyAsp: 2.511 ± 0.652
3.516GlyGlu: 3.516 ± 0.906
4.018GlyPhe: 4.018 ± 0.714
2.009GlyGly: 2.009 ± 0.515
0.753GlyHis: 0.753 ± 0.199
3.265GlyIle: 3.265 ± 0.796
2.009GlyLys: 2.009 ± 0.208
3.516GlyLeu: 3.516 ± 3.213
1.005GlyMet: 1.005 ± 0.319
3.265GlyAsn: 3.265 ± 1.78
1.005GlyPro: 1.005 ± 0.909
1.256GlyGln: 1.256 ± 0.326
1.256GlyArg: 1.256 ± 0.87
4.018GlySer: 4.018 ± 0.724
2.009GlyThr: 2.009 ± 1.231
3.014GlyVal: 3.014 ± 0.795
0.251GlyTrp: 0.251 ± 0.156
1.005GlyTyr: 1.005 ± 0.624
0.0GlyXaa: 0.0 ± 0.0
His
0.251HisAla: 0.251 ± 0.156
0.753HisCys: 0.753 ± 0.739
1.758HisAsp: 1.758 ± 0.763
1.507HisGlu: 1.507 ± 0.398
1.005HisPhe: 1.005 ± 0.321
0.753HisGly: 0.753 ± 0.199
0.251HisHis: 0.251 ± 0.156
1.005HisIle: 1.005 ± 0.503
1.758HisLys: 1.758 ± 0.509
1.256HisLeu: 1.256 ± 0.829
0.502HisMet: 0.502 ± 0.16
0.753HisAsn: 0.753 ± 0.199
1.256HisPro: 1.256 ± 0.829
0.251HisGln: 0.251 ± 0.156
0.502HisArg: 0.502 ± 0.312
2.511HisSer: 2.511 ± 0.124
0.251HisThr: 0.251 ± 0.156
0.753HisVal: 0.753 ± 0.385
0.251HisTrp: 0.251 ± 0.246
1.005HisTyr: 1.005 ± 0.624
0.0HisXaa: 0.0 ± 0.0
Ile
2.26IleAla: 2.26 ± 0.826
1.758IleCys: 1.758 ± 0.346
4.52IleAsp: 4.52 ± 0.88
4.771IleGlu: 4.771 ± 0.899
3.516IlePhe: 3.516 ± 0.947
3.516IleGly: 3.516 ± 1.017
1.507IleHis: 1.507 ± 0.481
6.278IleIle: 6.278 ± 1.31
8.538IleLys: 8.538 ± 1.212
8.538IleLeu: 8.538 ± 1.042
2.26IleMet: 2.26 ± 1.322
5.274IleAsn: 5.274 ± 1.839
3.516IlePro: 3.516 ± 0.57
2.511IleGln: 2.511 ± 0.802
4.018IleArg: 4.018 ± 1.316
9.543IleSer: 9.543 ± 1.894
4.018IleThr: 4.018 ± 0.496
3.516IleVal: 3.516 ± 0.693
0.0IleTrp: 0.0 ± 0.0
2.762IleTyr: 2.762 ± 0.826
0.0IleXaa: 0.0 ± 0.0
Lys
3.516LysAla: 3.516 ± 1.533
1.256LysCys: 1.256 ± 0.87
4.771LysAsp: 4.771 ± 0.324
5.023LysGlu: 5.023 ± 1.247
3.265LysPhe: 3.265 ± 0.664
2.009LysGly: 2.009 ± 1.231
1.507LysHis: 1.507 ± 0.35
7.032LysIle: 7.032 ± 1.813
6.278LysLys: 6.278 ± 1.507
6.027LysLeu: 6.027 ± 1.546
3.516LysMet: 3.516 ± 0.553
4.269LysAsn: 4.269 ± 0.955
3.265LysPro: 3.265 ± 0.591
1.507LysGln: 1.507 ± 0.398
3.516LysArg: 3.516 ± 1.492
6.027LysSer: 6.027 ± 0.99
4.52LysThr: 4.52 ± 1.164
6.027LysVal: 6.027 ± 0.988
0.753LysTrp: 0.753 ± 0.468
2.511LysTyr: 2.511 ± 0.488
0.0LysXaa: 0.0 ± 0.0
Leu
2.762LeuAla: 2.762 ± 0.629
3.265LeuCys: 3.265 ± 0.913
5.525LeuAsp: 5.525 ± 0.758
5.274LeuGlu: 5.274 ± 1.362
5.525LeuPhe: 5.525 ± 0.378
4.52LeuGly: 4.52 ± 0.622
1.758LeuHis: 1.758 ± 0.746
9.292LeuIle: 9.292 ± 0.691
7.283LeuLys: 7.283 ± 1.395
11.301LeuLeu: 11.301 ± 0.946
2.009LeuMet: 2.009 ± 0.639
5.525LeuAsn: 5.525 ± 1.062
2.009LeuPro: 2.009 ± 0.515
2.26LeuGln: 2.26 ± 1.101
4.269LeuArg: 4.269 ± 0.632
8.287LeuSer: 8.287 ± 1.385
8.036LeuThr: 8.036 ± 3.288
4.018LeuVal: 4.018 ± 0.381
1.507LeuTrp: 1.507 ± 0.207
2.511LeuTyr: 2.511 ± 0.124
0.0LeuXaa: 0.0 ± 0.0
Met
2.009MetAla: 2.009 ± 0.636
0.0MetCys: 0.0 ± 0.0
2.26MetAsp: 2.26 ± 0.341
2.009MetGlu: 2.009 ± 0.636
2.009MetPhe: 2.009 ± 0.208
0.502MetGly: 0.502 ± 0.16
0.502MetHis: 0.502 ± 0.312
2.009MetIle: 2.009 ± 0.582
3.014MetLys: 3.014 ± 0.958
2.009MetLeu: 2.009 ± 0.564
1.256MetMet: 1.256 ± 0.78
1.507MetAsn: 1.507 ± 0.207
0.251MetPro: 0.251 ± 0.156
1.256MetGln: 1.256 ± 0.78
1.256MetArg: 1.256 ± 0.326
3.014MetSer: 3.014 ± 1.149
1.256MetThr: 1.256 ± 0.78
1.256MetVal: 1.256 ± 0.326
0.251MetTrp: 0.251 ± 0.156
0.502MetTyr: 0.502 ± 0.459
0.0MetXaa: 0.0 ± 0.0
Asn
1.005AsnAla: 1.005 ± 1.048
2.511AsnCys: 2.511 ± 0.66
2.762AsnAsp: 2.762 ± 1.518
3.516AsnGlu: 3.516 ± 1.525
2.26AsnPhe: 2.26 ± 0.596
2.009AsnGly: 2.009 ± 0.92
1.256AsnHis: 1.256 ± 0.326
4.018AsnIle: 4.018 ± 0.491
4.018AsnLys: 4.018 ± 1.031
5.274AsnLeu: 5.274 ± 1.995
1.005AsnMet: 1.005 ± 0.624
4.52AsnAsn: 4.52 ± 1.697
2.009AsnPro: 2.009 ± 0.642
1.507AsnGln: 1.507 ± 0.207
3.767AsnArg: 3.767 ± 0.977
5.776AsnSer: 5.776 ± 1.216
4.018AsnThr: 4.018 ± 0.751
2.762AsnVal: 2.762 ± 1.007
0.251AsnTrp: 0.251 ± 0.246
2.511AsnTyr: 2.511 ± 0.802
0.0AsnXaa: 0.0 ± 0.0
Pro
1.005ProAla: 1.005 ± 0.318
0.0ProCys: 0.0 ± 0.0
2.009ProAsp: 2.009 ± 0.208
2.762ProGlu: 2.762 ± 1.207
1.758ProPhe: 1.758 ± 0.763
1.256ProGly: 1.256 ± 0.462
0.753ProHis: 0.753 ± 0.385
3.767ProIle: 3.767 ± 0.662
1.758ProLys: 1.758 ± 1.01
3.265ProLeu: 3.265 ± 1.097
0.753ProMet: 0.753 ± 0.739
1.758ProAsn: 1.758 ± 0.137
0.753ProPro: 0.753 ± 0.739
1.005ProGln: 1.005 ± 0.503
1.758ProArg: 1.758 ± 0.473
1.758ProSer: 1.758 ± 0.714
2.26ProThr: 2.26 ± 0.611
1.256ProVal: 1.256 ± 0.341
0.502ProTrp: 0.502 ± 0.312
1.507ProTyr: 1.507 ± 0.936
0.0ProXaa: 0.0 ± 0.0
Gln
1.758GlnAla: 1.758 ± 0.454
0.251GlnCys: 0.251 ± 0.246
2.26GlnAsp: 2.26 ± 1.156
1.758GlnGlu: 1.758 ± 0.746
1.507GlnPhe: 1.507 ± 0.398
1.005GlnGly: 1.005 ± 0.321
0.502GlnHis: 0.502 ± 0.312
1.758GlnIle: 1.758 ± 0.346
1.507GlnLys: 1.507 ± 0.7
2.26GlnLeu: 2.26 ± 0.788
1.758GlnMet: 1.758 ± 0.454
2.009GlnAsn: 2.009 ± 0.639
0.251GlnPro: 0.251 ± 0.156
0.753GlnGln: 0.753 ± 0.456
0.502GlnArg: 0.502 ± 0.312
2.26GlnSer: 2.26 ± 0.627
1.758GlnThr: 1.758 ± 0.137
1.005GlnVal: 1.005 ± 0.909
0.0GlnTrp: 0.0 ± 0.0
1.256GlnTyr: 1.256 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
2.009ArgAla: 2.009 ± 0.636
0.502ArgCys: 0.502 ± 0.16
3.265ArgAsp: 3.265 ± 0.796
2.511ArgGlu: 2.511 ± 0.488
3.767ArgPhe: 3.767 ± 1.407
2.26ArgGly: 2.26 ± 0.826
0.502ArgHis: 0.502 ± 0.459
4.52ArgIle: 4.52 ± 0.457
2.762ArgLys: 2.762 ± 0.279
3.516ArgLeu: 3.516 ± 0.57
1.256ArgMet: 1.256 ± 0.462
2.009ArgAsn: 2.009 ± 0.916
1.507ArgPro: 1.507 ± 0.773
1.005ArgGln: 1.005 ± 0.319
1.758ArgArg: 1.758 ± 0.454
3.265ArgSer: 3.265 ± 0.348
2.762ArgThr: 2.762 ± 0.448
3.516ArgVal: 3.516 ± 1.056
0.502ArgTrp: 0.502 ± 0.459
3.014ArgTyr: 3.014 ± 0.432
0.0ArgXaa: 0.0 ± 0.0
Ser
3.767SerAla: 3.767 ± 1.365
2.009SerCys: 2.009 ± 0.943
4.771SerAsp: 4.771 ± 1.294
5.023SerGlu: 5.023 ± 1.046
2.762SerPhe: 2.762 ± 1.007
4.269SerGly: 4.269 ± 0.17
2.009SerHis: 2.009 ± 0.639
9.292SerIle: 9.292 ± 2.947
8.79SerLys: 8.79 ± 1.105
8.538SerLeu: 8.538 ± 0.479
1.507SerMet: 1.507 ± 0.398
5.776SerAsn: 5.776 ± 1.573
2.26SerPro: 2.26 ± 0.779
1.005SerGln: 1.005 ± 0.624
7.283SerArg: 7.283 ± 1.986
10.296SerSer: 10.296 ± 1.481
4.771SerThr: 4.771 ± 1.008
5.776SerVal: 5.776 ± 0.391
1.005SerTrp: 1.005 ± 0.318
4.269SerTyr: 4.269 ± 0.669
0.0SerXaa: 0.0 ± 0.0
Thr
2.26ThrAla: 2.26 ± 1.732
0.251ThrCys: 0.251 ± 0.156
3.265ThrAsp: 3.265 ± 0.471
2.511ThrGlu: 2.511 ± 1.316
2.009ThrPhe: 2.009 ± 0.564
3.265ThrGly: 3.265 ± 1.465
1.507ThrHis: 1.507 ± 0.7
3.265ThrIle: 3.265 ± 0.942
4.52ThrLys: 4.52 ± 1.371
8.287ThrLeu: 8.287 ± 2.483
2.009ThrMet: 2.009 ± 0.751
3.014ThrAsn: 3.014 ± 0.415
1.507ThrPro: 1.507 ± 0.481
2.26ThrGln: 2.26 ± 0.419
1.758ThrArg: 1.758 ± 0.763
7.283ThrSer: 7.283 ± 1.945
3.516ThrThr: 3.516 ± 0.937
3.014ThrVal: 3.014 ± 0.954
0.753ThrTrp: 0.753 ± 0.404
2.26ThrTyr: 2.26 ± 0.594
0.0ThrXaa: 0.0 ± 0.0
Val
3.516ValAla: 3.516 ± 1.421
2.511ValCys: 2.511 ± 1.418
2.762ValAsp: 2.762 ± 1.518
4.018ValGlu: 4.018 ± 0.496
3.516ValPhe: 3.516 ± 0.553
2.009ValGly: 2.009 ± 0.73
1.256ValHis: 1.256 ± 0.326
3.516ValIle: 3.516 ± 0.906
3.265ValLys: 3.265 ± 0.942
5.023ValLeu: 5.023 ± 1.922
1.256ValMet: 1.256 ± 0.589
3.767ValAsn: 3.767 ± 1.1
1.758ValPro: 1.758 ± 0.454
2.009ValGln: 2.009 ± 1.818
3.516ValArg: 3.516 ± 0.747
4.52ValSer: 4.52 ± 0.302
2.762ValThr: 2.762 ± 1.13
2.26ValVal: 2.26 ± 1.147
0.502ValTrp: 0.502 ± 0.312
1.507ValTyr: 1.507 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.251TrpGlu: 0.251 ± 0.246
0.502TrpPhe: 0.502 ± 0.459
0.251TrpGly: 0.251 ± 0.246
0.251TrpHis: 0.251 ± 0.246
1.758TrpIle: 1.758 ± 0.746
0.251TrpLys: 0.251 ± 0.156
1.507TrpLeu: 1.507 ± 0.7
0.251TrpMet: 0.251 ± 0.156
0.251TrpAsn: 0.251 ± 0.156
0.251TrpPro: 0.251 ± 0.156
0.502TrpGln: 0.502 ± 0.16
0.502TrpArg: 0.502 ± 0.312
2.511TrpSer: 2.511 ± 0.488
0.753TrpThr: 0.753 ± 0.199
0.753TrpVal: 0.753 ± 0.404
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.753TyrAla: 0.753 ± 0.385
1.005TyrCys: 1.005 ± 0.626
1.758TyrAsp: 1.758 ± 0.861
2.009TyrGlu: 2.009 ± 0.582
2.009TyrPhe: 2.009 ± 0.751
1.005TyrGly: 1.005 ± 0.319
0.753TyrHis: 0.753 ± 0.385
2.511TyrIle: 2.511 ± 0.802
3.014TyrLys: 3.014 ± 0.198
3.014TyrLeu: 3.014 ± 0.793
1.758TyrMet: 1.758 ± 0.826
2.009TyrAsn: 2.009 ± 0.515
1.005TyrPro: 1.005 ± 0.503
1.758TyrGln: 1.758 ± 0.746
3.516TyrArg: 3.516 ± 0.274
3.516TyrSer: 3.516 ± 0.747
1.758TyrThr: 1.758 ± 0.137
2.511TyrVal: 2.511 ± 0.591
0.502TyrTrp: 0.502 ± 0.312
1.005TyrTyr: 1.005 ± 0.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski