Amino acid dipepetide frequency for Kasokero virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.546AlaAla: 3.546 ± 4.038
2.026AlaCys: 2.026 ± 0.558
1.182AlaAsp: 1.182 ± 0.581
5.235AlaGlu: 5.235 ± 0.441
1.857AlaPhe: 1.857 ± 0.085
3.377AlaGly: 3.377 ± 2.303
0.675AlaHis: 0.675 ± 0.384
3.884AlaIle: 3.884 ± 0.356
2.871AlaLys: 2.871 ± 2.088
3.546AlaLeu: 3.546 ± 1.057
0.507AlaMet: 0.507 ± 0.416
3.546AlaAsn: 3.546 ± 1.242
2.195AlaPro: 2.195 ± 0.495
1.182AlaGln: 1.182 ± 0.241
2.195AlaArg: 2.195 ± 0.089
2.702AlaSer: 2.702 ± 2.041
2.364AlaThr: 2.364 ± 0.352
4.897AlaVal: 4.897 ± 0.853
1.182AlaTrp: 1.182 ± 0.822
2.026AlaTyr: 2.026 ± 0.027
0.0AlaXaa: 0.0 ± 0.0
Cys
1.857CysAla: 1.857 ± 0.624
1.013CysCys: 1.013 ± 0.341
0.844CysAsp: 0.844 ± 0.267
1.182CysGlu: 1.182 ± 0.617
1.013CysPhe: 1.013 ± 0.269
0.675CysGly: 0.675 ± 0.451
0.675CysHis: 0.675 ± 0.451
1.351CysIle: 1.351 ± 0.664
1.52CysLys: 1.52 ± 0.865
3.546CysLeu: 3.546 ± 0.553
0.675CysMet: 0.675 ± 0.265
1.52CysAsn: 1.52 ± 0.865
2.026CysPro: 2.026 ± 0.796
1.351CysGln: 1.351 ± 0.239
1.351CysArg: 1.351 ± 0.498
1.52CysSer: 1.52 ± 0.249
2.026CysThr: 2.026 ± 0.796
1.182CysVal: 1.182 ± 0.396
0.844CysTrp: 0.844 ± 0.417
0.507CysTyr: 0.507 ± 0.288
0.0CysXaa: 0.0 ± 0.0
Asp
3.04AspAla: 3.04 ± 0.988
1.689AspCys: 1.689 ± 0.534
2.871AspAsp: 2.871 ± 1.076
3.208AspGlu: 3.208 ± 1.399
1.689AspPhe: 1.689 ± 0.659
3.208AspGly: 3.208 ± 1.099
0.675AspHis: 0.675 ± 0.49
3.04AspIle: 3.04 ± 1.359
2.533AspLys: 2.533 ± 0.818
6.248AspLeu: 6.248 ± 0.232
1.52AspMet: 1.52 ± 1.228
2.364AspAsn: 2.364 ± 0.17
1.857AspPro: 1.857 ± 0.555
1.52AspGln: 1.52 ± 0.747
1.689AspArg: 1.689 ± 0.532
5.066AspSer: 5.066 ± 0.222
1.52AspThr: 1.52 ± 0.578
3.546AspVal: 3.546 ± 0.68
0.844AspTrp: 0.844 ± 0.266
1.52AspTyr: 1.52 ± 0.462
0.0AspXaa: 0.0 ± 0.0
Glu
3.715GluAla: 3.715 ± 0.682
1.013GluCys: 1.013 ± 0.269
4.053GluAsp: 4.053 ± 0.651
6.754GluGlu: 6.754 ± 0.873
2.871GluPhe: 2.871 ± 0.342
3.884GluGly: 3.884 ± 1.397
1.857GluHis: 1.857 ± 0.76
4.559GluIle: 4.559 ± 1.492
6.586GluLys: 6.586 ± 0.404
8.781GluLeu: 8.781 ± 2.536
1.689GluMet: 1.689 ± 0.558
2.871GluAsn: 2.871 ± 0.871
1.52GluPro: 1.52 ± 0.459
2.364GluGln: 2.364 ± 0.352
2.364GluArg: 2.364 ± 0.352
5.235GluSer: 5.235 ± 0.974
4.053GluThr: 4.053 ± 0.693
6.079GluVal: 6.079 ± 1.847
0.675GluTrp: 0.675 ± 0.265
1.182GluTyr: 1.182 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
1.351PheAla: 1.351 ± 0.331
1.013PheCys: 1.013 ± 0.398
1.52PheAsp: 1.52 ± 0.122
3.715PheGlu: 3.715 ± 0.517
2.364PhePhe: 2.364 ± 0.727
2.026PheGly: 2.026 ± 0.918
1.013PheHis: 1.013 ± 0.398
1.182PheIle: 1.182 ± 0.393
4.39PheLys: 4.39 ± 1.532
4.897PheLeu: 4.897 ± 1.084
0.675PheMet: 0.675 ± 0.332
2.871PheAsn: 2.871 ± 0.694
1.182PhePro: 1.182 ± 0.777
1.689PheGln: 1.689 ± 0.107
1.857PheArg: 1.857 ± 0.605
4.559PheSer: 4.559 ± 0.367
2.533PheThr: 2.533 ± 0.391
1.182PheVal: 1.182 ± 0.396
0.169PheTrp: 0.169 ± 0.083
1.182PheTyr: 1.182 ± 0.581
0.0PheXaa: 0.0 ± 0.0
Gly
3.208GlyAla: 3.208 ± 1.954
1.351GlyCys: 1.351 ± 0.902
3.208GlyAsp: 3.208 ± 0.241
3.04GlyGlu: 3.04 ± 1.136
1.351GlyPhe: 1.351 ± 0.331
2.702GlyGly: 2.702 ± 0.9
1.182GlyHis: 1.182 ± 0.339
5.066GlyIle: 5.066 ± 1.079
5.91GlyLys: 5.91 ± 1.343
3.546GlyLeu: 3.546 ± 0.587
1.52GlyMet: 1.52 ± 0.265
2.871GlyAsn: 2.871 ± 0.783
1.351GlyPro: 1.351 ± 0.531
1.182GlyGln: 1.182 ± 0.252
2.702GlyArg: 2.702 ± 0.36
6.079GlySer: 6.079 ± 0.994
4.222GlyThr: 4.222 ± 1.696
3.04GlyVal: 3.04 ± 0.497
0.338GlyTrp: 0.338 ± 0.46
0.844GlyTyr: 0.844 ± 0.58
0.0GlyXaa: 0.0 ± 0.0
His
1.013HisAla: 1.013 ± 0.341
0.675HisCys: 0.675 ± 0.265
0.338HisAsp: 0.338 ± 0.518
1.52HisGlu: 1.52 ± 0.679
2.026HisPhe: 2.026 ± 0.796
1.52HisGly: 1.52 ± 0.462
0.507HisHis: 0.507 ± 0.249
1.689HisIle: 1.689 ± 0.605
1.351HisLys: 1.351 ± 0.396
1.857HisLeu: 1.857 ± 0.555
0.507HisMet: 0.507 ± 0.416
1.013HisAsn: 1.013 ± 0.294
0.844HisPro: 0.844 ± 0.317
0.507HisGln: 0.507 ± 0.445
1.52HisArg: 1.52 ± 0.526
1.52HisSer: 1.52 ± 0.526
1.182HisThr: 1.182 ± 0.339
1.182HisVal: 1.182 ± 0.339
0.169HisTrp: 0.169 ± 0.083
0.844HisTyr: 0.844 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
3.546IleAla: 3.546 ± 0.099
1.52IleCys: 1.52 ± 0.462
2.533IleAsp: 2.533 ± 0.517
4.728IleGlu: 4.728 ± 0.344
2.195IlePhe: 2.195 ± 0.288
2.026IleGly: 2.026 ± 1.328
1.351IleHis: 1.351 ± 0.664
2.533IleIle: 2.533 ± 0.733
4.897IleLys: 4.897 ± 1.599
6.586IleLeu: 6.586 ± 1.402
1.52IleMet: 1.52 ± 0.578
2.871IleAsn: 2.871 ± 0.498
1.689IlePro: 1.689 ± 1.188
1.689IleGln: 1.689 ± 0.483
2.702IleArg: 2.702 ± 0.683
4.728IleSer: 4.728 ± 1.387
3.715IleThr: 3.715 ± 0.171
3.377IleVal: 3.377 ± 0.841
0.507IleTrp: 0.507 ± 0.445
2.026IleTyr: 2.026 ± 1.094
0.0IleXaa: 0.0 ± 0.0
Lys
3.715LysAla: 3.715 ± 0.832
0.844LysCys: 0.844 ± 0.417
4.897LysAsp: 4.897 ± 1.246
5.572LysGlu: 5.572 ± 0.695
3.546LysPhe: 3.546 ± 1.137
5.404LysGly: 5.404 ± 0.997
2.195LysHis: 2.195 ± 0.943
3.715LysIle: 3.715 ± 0.624
7.599LysLys: 7.599 ± 1.787
7.261LysLeu: 7.261 ± 1.408
1.52LysMet: 1.52 ± 1.632
3.208LysAsn: 3.208 ± 0.849
3.546LysPro: 3.546 ± 0.722
1.857LysGln: 1.857 ± 0.085
3.884LysArg: 3.884 ± 0.729
5.066LysSer: 5.066 ± 1.321
4.39LysThr: 4.39 ± 0.362
8.274LysVal: 8.274 ± 0.275
0.675LysTrp: 0.675 ± 0.332
1.857LysTyr: 1.857 ± 1.149
0.0LysXaa: 0.0 ± 0.0
Leu
4.053LeuAla: 4.053 ± 0.54
2.364LeuCys: 2.364 ± 0.791
5.572LeuAsp: 5.572 ± 1.568
6.586LeuGlu: 6.586 ± 0.992
3.715LeuPhe: 3.715 ± 0.081
6.079LeuGly: 6.079 ± 1.098
2.195LeuHis: 2.195 ± 0.187
5.572LeuIle: 5.572 ± 2.222
7.937LeuLys: 7.937 ± 1.298
9.963LeuLeu: 9.963 ± 2.249
1.857LeuMet: 1.857 ± 0.741
5.404LeuAsn: 5.404 ± 0.436
4.559LeuPro: 4.559 ± 0.443
4.222LeuGln: 4.222 ± 1.213
3.884LeuArg: 3.884 ± 0.619
10.638LeuSer: 10.638 ± 1.067
6.079LeuThr: 6.079 ± 1.374
6.754LeuVal: 6.754 ± 0.831
0.169LeuTrp: 0.169 ± 0.083
2.533LeuTyr: 2.533 ± 0.798
0.0LeuXaa: 0.0 ± 0.0
Met
1.857MetAla: 1.857 ± 0.085
1.013MetCys: 1.013 ± 0.269
1.351MetAsp: 1.351 ± 1.331
1.52MetGlu: 1.52 ± 0.265
0.675MetPhe: 0.675 ± 0.332
1.182MetGly: 1.182 ± 0.418
0.338MetHis: 0.338 ± 0.518
1.351MetIle: 1.351 ± 0.498
1.351MetLys: 1.351 ± 0.18
2.702MetLeu: 2.702 ± 0.935
1.182MetMet: 1.182 ± 0.581
1.182MetAsn: 1.182 ± 0.418
1.013MetPro: 1.013 ± 0.498
0.675MetGln: 0.675 ± 0.332
0.844MetArg: 0.844 ± 0.415
1.689MetSer: 1.689 ± 0.634
1.013MetThr: 1.013 ± 0.372
1.182MetVal: 1.182 ± 0.777
0.169MetTrp: 0.169 ± 0.083
0.338MetTyr: 0.338 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
1.857AsnAla: 1.857 ± 0.625
1.689AsnCys: 1.689 ± 0.534
2.026AsnAsp: 2.026 ± 0.241
2.026AsnGlu: 2.026 ± 0.213
1.689AsnPhe: 1.689 ± 0.312
2.026AsnGly: 2.026 ± 0.538
1.182AsnHis: 1.182 ± 0.339
2.533AsnIle: 2.533 ± 0.43
3.208AsnLys: 3.208 ± 1.332
6.754AsnLeu: 6.754 ± 0.52
1.013AsnMet: 1.013 ± 0.269
3.377AsnAsn: 3.377 ± 0.623
1.857AsnPro: 1.857 ± 1.076
0.844AsnGln: 0.844 ± 0.317
2.871AsnArg: 2.871 ± 0.946
6.079AsnSer: 6.079 ± 1.098
2.195AsnThr: 2.195 ± 1.064
3.208AsnVal: 3.208 ± 0.422
1.182AsnTrp: 1.182 ± 0.241
2.195AsnTyr: 2.195 ± 0.659
0.0AsnXaa: 0.0 ± 0.0
Pro
2.195ProAla: 2.195 ± 1.07
1.013ProCys: 1.013 ± 0.269
3.377ProAsp: 3.377 ± 0.436
2.364ProGlu: 2.364 ± 1.009
2.195ProPhe: 2.195 ± 1.607
2.364ProGly: 2.364 ± 0.159
0.338ProHis: 0.338 ± 0.133
1.857ProIle: 1.857 ± 1.149
2.871ProLys: 2.871 ± 0.282
1.857ProLeu: 1.857 ± 0.535
0.507ProMet: 0.507 ± 0.249
1.013ProAsn: 1.013 ± 0.372
1.182ProPro: 1.182 ± 0.252
0.507ProGln: 0.507 ± 0.288
1.689ProArg: 1.689 ± 0.532
2.533ProSer: 2.533 ± 0.172
2.702ProThr: 2.702 ± 0.683
3.208ProVal: 3.208 ± 1.179
0.338ProTrp: 0.338 ± 0.133
1.182ProTyr: 1.182 ± 0.252
0.0ProXaa: 0.0 ± 0.0
Gln
1.857GlnAla: 1.857 ± 0.371
0.844GlnCys: 0.844 ± 0.417
2.195GlnAsp: 2.195 ± 0.089
2.702GlnGlu: 2.702 ± 0.958
1.013GlnPhe: 1.013 ± 0.372
1.689GlnGly: 1.689 ± 0.605
0.675GlnHis: 0.675 ± 0.198
2.026GlnIle: 2.026 ± 0.692
2.026GlnLys: 2.026 ± 0.692
3.546GlnLeu: 3.546 ± 1.255
1.52GlnMet: 1.52 ± 0.44
1.351GlnAsn: 1.351 ± 0.755
0.338GlnPro: 0.338 ± 0.133
2.026GlnGln: 2.026 ± 0.608
2.364GlnArg: 2.364 ± 0.367
1.857GlnSer: 1.857 ± 0.147
2.026GlnThr: 2.026 ± 1.101
2.364GlnVal: 2.364 ± 0.727
0.338GlnTrp: 0.338 ± 0.46
1.013GlnTyr: 1.013 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
1.857ArgAla: 1.857 ± 0.582
0.675ArgCys: 0.675 ± 0.265
2.195ArgAsp: 2.195 ± 0.905
2.195ArgGlu: 2.195 ± 0.187
1.52ArgPhe: 1.52 ± 0.462
2.364ArgGly: 2.364 ± 0.159
1.52ArgHis: 1.52 ± 0.462
2.364ArgIle: 2.364 ± 0.677
4.559ArgLys: 4.559 ± 0.931
5.404ArgLeu: 5.404 ± 0.152
1.52ArgMet: 1.52 ± 0.747
2.364ArgAsn: 2.364 ± 0.569
2.026ArgPro: 2.026 ± 0.692
1.857ArgGln: 1.857 ± 0.741
2.533ArgArg: 2.533 ± 0.734
5.572ArgSer: 5.572 ± 0.596
3.208ArgThr: 3.208 ± 0.965
2.364ArgVal: 2.364 ± 0.929
0.338ArgTrp: 0.338 ± 0.166
1.182ArgTyr: 1.182 ± 0.241
0.0ArgXaa: 0.0 ± 0.0
Ser
4.559SerAla: 4.559 ± 0.446
2.871SerCys: 2.871 ± 0.638
4.222SerAsp: 4.222 ± 1.051
7.937SerGlu: 7.937 ± 1.287
5.235SerPhe: 5.235 ± 1.846
5.404SerGly: 5.404 ± 1.019
1.52SerHis: 1.52 ± 0.44
6.248SerIle: 6.248 ± 0.782
7.092SerLys: 7.092 ± 1.137
7.937SerLeu: 7.937 ± 2.392
1.351SerMet: 1.351 ± 0.755
4.39SerAsn: 4.39 ± 0.576
2.702SerPro: 2.702 ± 0.36
3.04SerGln: 3.04 ± 0.972
4.39SerArg: 4.39 ± 0.818
10.807SerSer: 10.807 ± 2.728
4.559SerThr: 4.559 ± 0.948
5.572SerVal: 5.572 ± 1.072
1.857SerTrp: 1.857 ± 1.407
1.857SerTyr: 1.857 ± 0.741
0.0SerXaa: 0.0 ± 0.0
Thr
2.702ThrAla: 2.702 ± 1.815
2.026ThrCys: 2.026 ± 0.241
2.871ThrAsp: 2.871 ± 1.234
4.053ThrGlu: 4.053 ± 1.493
2.026ThrPhe: 2.026 ± 0.823
4.053ThrGly: 4.053 ± 0.482
1.351ThrHis: 1.351 ± 0.531
1.689ThrIle: 1.689 ± 0.532
3.208ThrLys: 3.208 ± 0.429
4.728ThrLeu: 4.728 ± 0.239
1.182ThrMet: 1.182 ± 0.393
1.182ThrAsn: 1.182 ± 0.396
3.377ThrPro: 3.377 ± 0.333
1.52ThrGln: 1.52 ± 0.679
2.702ThrArg: 2.702 ± 1.022
7.092ThrSer: 7.092 ± 1.365
2.871ThrThr: 2.871 ± 1.382
4.053ThrVal: 4.053 ± 0.482
1.52ThrTrp: 1.52 ± 0.898
2.026ThrTyr: 2.026 ± 0.594
0.0ThrXaa: 0.0 ± 0.0
Val
3.377ValAla: 3.377 ± 1.14
1.182ValCys: 1.182 ± 0.339
2.871ValAsp: 2.871 ± 0.297
4.053ValGlu: 4.053 ± 0.614
2.195ValPhe: 2.195 ± 0.089
2.533ValGly: 2.533 ± 0.734
1.52ValHis: 1.52 ± 0.462
3.715ValIle: 3.715 ± 0.392
6.248ValLys: 6.248 ± 1.464
7.43ValLeu: 7.43 ± 1.572
1.351ValMet: 1.351 ± 0.531
4.728ValAsn: 4.728 ± 1.445
1.013ValPro: 1.013 ± 0.269
3.377ValGln: 3.377 ± 1.064
3.208ValArg: 3.208 ± 0.993
7.43ValSer: 7.43 ± 1.572
3.377ValThr: 3.377 ± 0.164
5.404ValVal: 5.404 ± 0.336
1.013ValTrp: 1.013 ± 0.329
1.857ValTyr: 1.857 ± 0.085
0.0ValXaa: 0.0 ± 0.0
Trp
0.675TrpAla: 0.675 ± 1.529
0.338TrpCys: 0.338 ± 0.133
0.675TrpAsp: 0.675 ± 0.332
1.182TrpGlu: 1.182 ± 0.548
0.675TrpPhe: 0.675 ± 0.662
1.013TrpGly: 1.013 ± 0.329
0.169TrpHis: 0.169 ± 0.515
0.507TrpIle: 0.507 ± 0.249
1.351TrpLys: 1.351 ± 0.498
1.689TrpLeu: 1.689 ± 0.696
0.338TrpMet: 0.338 ± 0.518
0.169TrpAsn: 0.169 ± 0.083
0.338TrpPro: 0.338 ± 0.166
0.507TrpGln: 0.507 ± 0.288
1.013TrpArg: 1.013 ± 0.832
0.675TrpSer: 0.675 ± 0.265
0.675TrpThr: 0.675 ± 0.378
0.507TrpVal: 0.507 ± 0.249
0.169TrpTrp: 0.169 ± 0.083
0.338TrpTyr: 0.338 ± 0.46
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.013TyrAla: 1.013 ± 0.398
1.52TyrCys: 1.52 ± 0.679
1.013TyrAsp: 1.013 ± 0.294
2.533TyrGlu: 2.533 ± 0.954
1.52TyrPhe: 1.52 ± 0.526
1.182TyrGly: 1.182 ± 0.777
0.844TyrHis: 0.844 ± 0.267
1.857TyrIle: 1.857 ± 0.605
1.52TyrLys: 1.52 ± 0.578
1.857TyrLeu: 1.857 ± 0.597
0.507TyrMet: 0.507 ± 0.416
1.52TyrAsn: 1.52 ± 0.265
0.675TyrPro: 0.675 ± 0.198
1.857TyrGln: 1.857 ± 1.142
1.857TyrArg: 1.857 ± 0.555
2.702TyrSer: 2.702 ± 0.335
1.52TyrThr: 1.52 ± 0.249
0.507TyrVal: 0.507 ± 0.249
0.507TyrTrp: 0.507 ± 0.416
0.507TyrTyr: 0.507 ± 0.288
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5923 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski