Amino acid dipepetide frequency for Kwatta virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.156AlaAla: 3.156 ± 1.498
1.315AlaCys: 1.315 ± 0.662
0.789AlaAsp: 0.789 ± 0.269
2.63AlaGlu: 2.63 ± 1.068
2.367AlaPhe: 2.367 ± 0.829
2.893AlaGly: 2.893 ± 0.605
1.052AlaHis: 1.052 ± 0.358
2.63AlaIle: 2.63 ± 0.395
1.841AlaLys: 1.841 ± 0.48
3.945AlaLeu: 3.945 ± 1.94
0.789AlaMet: 0.789 ± 0.686
1.841AlaAsn: 1.841 ± 0.507
1.578AlaPro: 1.578 ± 0.725
2.104AlaGln: 2.104 ± 0.668
2.893AlaArg: 2.893 ± 1.291
4.471AlaSer: 4.471 ± 0.912
1.841AlaThr: 1.841 ± 0.719
1.578AlaVal: 1.578 ± 0.539
0.263AlaTrp: 0.263 ± 0.309
0.789AlaTyr: 0.789 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.526CysCys: 0.526 ± 0.238
1.052CysAsp: 1.052 ± 0.268
0.526CysGlu: 0.526 ± 0.299
0.526CysPhe: 0.526 ± 0.391
1.052CysGly: 1.052 ± 0.597
0.263CysHis: 0.263 ± 0.149
1.052CysIle: 1.052 ± 0.268
1.315CysLys: 1.315 ± 0.273
3.945CysLeu: 3.945 ± 1.069
0.263CysMet: 0.263 ± 0.149
0.263CysAsn: 0.263 ± 0.388
1.315CysPro: 1.315 ± 1.398
0.789CysGln: 0.789 ± 0.414
0.789CysArg: 0.789 ± 0.691
1.841CysSer: 1.841 ± 0.513
2.104CysThr: 2.104 ± 0.536
1.052CysVal: 1.052 ± 0.361
0.526CysTrp: 0.526 ± 0.299
0.526CysTyr: 0.526 ± 0.488
0.0CysXaa: 0.0 ± 0.0
Asp
1.578AspAla: 1.578 ± 1.419
1.315AspCys: 1.315 ± 0.399
4.734AspAsp: 4.734 ± 1.617
3.156AspGlu: 3.156 ± 1.036
3.156AspPhe: 3.156 ± 0.834
3.156AspGly: 3.156 ± 1.275
1.315AspHis: 1.315 ± 0.423
3.156AspIle: 3.156 ± 0.684
3.419AspLys: 3.419 ± 0.788
7.102AspLeu: 7.102 ± 1.083
1.578AspMet: 1.578 ± 0.456
1.578AspAsn: 1.578 ± 0.35
6.049AspPro: 6.049 ± 1.565
1.315AspGln: 1.315 ± 0.347
2.63AspArg: 2.63 ± 0.774
4.471AspSer: 4.471 ± 1.24
2.104AspThr: 2.104 ± 0.48
2.893AspVal: 2.893 ± 0.747
0.789AspTrp: 0.789 ± 0.541
2.63AspTyr: 2.63 ± 0.619
0.0AspXaa: 0.0 ± 0.0
Glu
1.578GluAla: 1.578 ± 0.385
0.789GluCys: 0.789 ± 0.269
3.156GluAsp: 3.156 ± 1.244
3.682GluGlu: 3.682 ± 1.045
1.578GluPhe: 1.578 ± 0.647
2.367GluGly: 2.367 ± 0.674
1.841GluHis: 1.841 ± 0.245
7.102GluIle: 7.102 ± 1.005
4.471GluLys: 4.471 ± 1.226
6.575GluLeu: 6.575 ± 1.877
1.052GluMet: 1.052 ± 0.307
2.63GluAsn: 2.63 ± 0.707
2.63GluPro: 2.63 ± 1.013
3.156GluGln: 3.156 ± 0.742
2.893GluArg: 2.893 ± 0.385
4.471GluSer: 4.471 ± 0.693
3.419GluThr: 3.419 ± 1.709
3.419GluVal: 3.419 ± 0.575
0.789GluTrp: 0.789 ± 0.337
2.104GluTyr: 2.104 ± 0.488
0.0GluXaa: 0.0 ± 0.0
Phe
1.578PheAla: 1.578 ± 0.411
1.315PheCys: 1.315 ± 0.495
3.419PheAsp: 3.419 ± 1.194
2.893PheGlu: 2.893 ± 0.726
3.419PhePhe: 3.419 ± 0.79
2.893PheGly: 2.893 ± 1.01
0.526PheHis: 0.526 ± 0.299
3.419PheIle: 3.419 ± 0.734
4.208PheLys: 4.208 ± 1.072
4.997PheLeu: 4.997 ± 0.748
0.789PheMet: 0.789 ± 0.337
0.789PheAsn: 0.789 ± 0.267
2.367PhePro: 2.367 ± 0.707
0.789PheGln: 0.789 ± 0.448
2.367PheArg: 2.367 ± 0.794
1.578PheSer: 1.578 ± 0.534
2.893PheThr: 2.893 ± 0.625
3.682PheVal: 3.682 ± 0.519
1.052PheTrp: 1.052 ± 0.659
0.789PheTyr: 0.789 ± 0.448
0.0PheXaa: 0.0 ± 0.0
Gly
2.893GlyAla: 2.893 ± 0.873
0.789GlyCys: 0.789 ± 0.448
2.893GlyAsp: 2.893 ± 0.907
2.893GlyGlu: 2.893 ± 0.869
3.419GlyPhe: 3.419 ± 1.381
3.682GlyGly: 3.682 ± 0.786
0.789GlyHis: 0.789 ± 0.448
4.208GlyIle: 4.208 ± 0.553
3.945GlyLys: 3.945 ± 1.383
5.786GlyLeu: 5.786 ± 0.972
2.104GlyMet: 2.104 ± 0.313
2.63GlyAsn: 2.63 ± 0.684
2.104GlyPro: 2.104 ± 0.559
3.156GlyGln: 3.156 ± 0.721
3.419GlyArg: 3.419 ± 0.864
5.786GlySer: 5.786 ± 1.961
3.945GlyThr: 3.945 ± 0.849
4.734GlyVal: 4.734 ± 0.794
1.315GlyTrp: 1.315 ± 0.615
1.315GlyTyr: 1.315 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.789HisAla: 0.789 ± 0.403
0.0HisCys: 0.0 ± 0.0
0.789HisAsp: 0.789 ± 0.688
1.315HisGlu: 1.315 ± 0.477
1.841HisPhe: 1.841 ± 0.436
1.052HisGly: 1.052 ± 0.544
0.263HisHis: 0.263 ± 0.295
1.315HisIle: 1.315 ± 0.483
0.789HisLys: 0.789 ± 0.267
1.578HisLeu: 1.578 ± 0.618
1.052HisMet: 1.052 ± 0.268
1.052HisAsn: 1.052 ± 0.512
1.315HisPro: 1.315 ± 0.483
1.315HisGln: 1.315 ± 0.778
2.367HisArg: 2.367 ± 1.061
1.841HisSer: 1.841 ± 0.668
0.263HisThr: 0.263 ± 0.149
1.052HisVal: 1.052 ± 0.361
0.0HisTrp: 0.0 ± 0.0
1.052HisTyr: 1.052 ± 0.358
0.0HisXaa: 0.0 ± 0.0
Ile
3.945IleAla: 3.945 ± 1.123
0.789IleCys: 0.789 ± 0.448
3.419IleAsp: 3.419 ± 0.641
2.63IleGlu: 2.63 ± 0.916
3.419IlePhe: 3.419 ± 0.692
2.63IleGly: 2.63 ± 0.955
1.841IleHis: 1.841 ± 0.779
3.945IleIle: 3.945 ± 1.063
6.312IleLys: 6.312 ± 1.652
6.575IleLeu: 6.575 ± 1.533
2.104IleMet: 2.104 ± 0.522
3.156IleAsn: 3.156 ± 1.159
4.997IlePro: 4.997 ± 0.741
2.367IleGln: 2.367 ± 0.746
4.734IleArg: 4.734 ± 0.985
3.682IleSer: 3.682 ± 1.124
5.786IleThr: 5.786 ± 1.281
3.419IleVal: 3.419 ± 0.834
0.789IleTrp: 0.789 ± 0.337
3.156IleTyr: 3.156 ± 0.487
0.0IleXaa: 0.0 ± 0.0
Lys
2.367LysAla: 2.367 ± 0.643
1.052LysCys: 1.052 ± 0.401
2.893LysAsp: 2.893 ± 1.086
6.312LysGlu: 6.312 ± 1.025
3.156LysPhe: 3.156 ± 0.699
5.26LysGly: 5.26 ± 0.959
1.578LysHis: 1.578 ± 0.663
5.786LysIle: 5.786 ± 1.129
5.523LysLys: 5.523 ± 0.754
6.839LysLeu: 6.839 ± 0.798
0.789LysMet: 0.789 ± 0.448
4.471LysAsn: 4.471 ± 1.117
1.315LysPro: 1.315 ± 0.519
1.841LysGln: 1.841 ± 0.633
4.734LysArg: 4.734 ± 1.315
5.26LysSer: 5.26 ± 1.079
4.734LysThr: 4.734 ± 1.161
3.945LysVal: 3.945 ± 0.733
1.052LysTrp: 1.052 ± 0.459
1.578LysTyr: 1.578 ± 0.539
0.0LysXaa: 0.0 ± 0.0
Leu
5.26LeuAla: 5.26 ± 0.943
2.367LeuCys: 2.367 ± 0.517
5.26LeuAsp: 5.26 ± 0.861
7.365LeuGlu: 7.365 ± 1.442
3.419LeuPhe: 3.419 ± 0.485
7.891LeuGly: 7.891 ± 1.364
0.789LeuHis: 0.789 ± 0.267
7.102LeuIle: 7.102 ± 1.567
7.628LeuLys: 7.628 ± 2.062
8.417LeuLeu: 8.417 ± 1.391
3.945LeuMet: 3.945 ± 0.937
6.312LeuAsn: 6.312 ± 1.315
4.734LeuPro: 4.734 ± 2.356
3.156LeuGln: 3.156 ± 1.285
6.575LeuArg: 6.575 ± 1.158
8.68LeuSer: 8.68 ± 1.54
6.839LeuThr: 6.839 ± 0.871
6.575LeuVal: 6.575 ± 1.46
0.789LeuTrp: 0.789 ± 0.448
1.578LeuTyr: 1.578 ± 0.556
0.0LeuXaa: 0.0 ± 0.0
Met
1.052MetAla: 1.052 ± 0.597
0.263MetCys: 0.263 ± 0.149
2.63MetAsp: 2.63 ± 1.113
2.104MetGlu: 2.104 ± 0.631
2.367MetPhe: 2.367 ± 0.515
1.841MetGly: 1.841 ± 0.605
0.263MetHis: 0.263 ± 0.295
2.104MetIle: 2.104 ± 1.194
1.315MetLys: 1.315 ± 0.669
1.841MetLeu: 1.841 ± 0.468
1.052MetMet: 1.052 ± 0.677
1.052MetAsn: 1.052 ± 0.361
1.315MetPro: 1.315 ± 1.075
0.0MetGln: 0.0 ± 0.0
0.789MetArg: 0.789 ± 0.448
1.841MetSer: 1.841 ± 0.468
1.841MetThr: 1.841 ± 0.724
0.526MetVal: 0.526 ± 0.249
0.526MetTrp: 0.526 ± 0.5
0.526MetTyr: 0.526 ± 0.618
0.0MetXaa: 0.0 ± 0.0
Asn
1.841AsnAla: 1.841 ± 1.358
1.052AsnCys: 1.052 ± 0.268
2.63AsnAsp: 2.63 ± 0.879
1.315AsnGlu: 1.315 ± 0.468
1.578AsnPhe: 1.578 ± 0.442
2.104AsnGly: 2.104 ± 0.743
1.315AsnHis: 1.315 ± 0.448
3.156AsnIle: 3.156 ± 0.787
2.367AsnLys: 2.367 ± 1.071
6.049AsnLeu: 6.049 ± 1.831
0.526AsnMet: 0.526 ± 0.348
2.367AsnAsn: 2.367 ± 0.756
4.734AsnPro: 4.734 ± 0.846
0.789AsnGln: 0.789 ± 0.366
1.841AsnArg: 1.841 ± 0.758
3.945AsnSer: 3.945 ± 1.144
2.367AsnThr: 2.367 ± 0.789
2.367AsnVal: 2.367 ± 0.775
0.0AsnTrp: 0.0 ± 0.0
1.578AsnTyr: 1.578 ± 0.35
0.0AsnXaa: 0.0 ± 0.0
Pro
1.841ProAla: 1.841 ± 0.972
1.315ProCys: 1.315 ± 0.893
4.208ProAsp: 4.208 ± 0.993
3.156ProGlu: 3.156 ± 2.034
0.263ProPhe: 0.263 ± 0.149
2.893ProGly: 2.893 ± 1.791
0.789ProHis: 0.789 ± 0.417
3.682ProIle: 3.682 ± 1.067
3.419ProLys: 3.419 ± 1.424
5.26ProLeu: 5.26 ± 0.871
0.263ProMet: 0.263 ± 0.646
2.893ProAsn: 2.893 ± 0.694
4.208ProPro: 4.208 ± 1.97
2.104ProGln: 2.104 ± 0.988
2.63ProArg: 2.63 ± 0.779
4.997ProSer: 4.997 ± 2.309
2.104ProThr: 2.104 ± 0.907
4.471ProVal: 4.471 ± 1.622
0.789ProTrp: 0.789 ± 0.267
1.052ProTyr: 1.052 ± 0.497
0.0ProXaa: 0.0 ± 0.0
Gln
1.578GlnAla: 1.578 ± 0.516
0.263GlnCys: 0.263 ± 0.486
3.682GlnAsp: 3.682 ± 1.379
3.419GlnGlu: 3.419 ± 1.183
0.263GlnPhe: 0.263 ± 0.309
2.104GlnGly: 2.104 ± 0.634
0.789GlnHis: 0.789 ± 0.337
2.63GlnIle: 2.63 ± 0.685
3.156GlnLys: 3.156 ± 0.957
4.471GlnLeu: 4.471 ± 1.059
0.263GlnMet: 0.263 ± 0.292
2.104GlnAsn: 2.104 ± 0.536
0.263GlnPro: 0.263 ± 0.391
1.315GlnGln: 1.315 ± 0.477
2.367GlnArg: 2.367 ± 0.494
2.893GlnSer: 2.893 ± 1.057
1.578GlnThr: 1.578 ± 0.411
1.578GlnVal: 1.578 ± 1.965
0.526GlnTrp: 0.526 ± 0.553
0.263GlnTyr: 0.263 ± 0.149
0.0GlnXaa: 0.0 ± 0.0
Arg
1.578ArgAla: 1.578 ± 0.951
1.578ArgCys: 1.578 ± 0.498
3.156ArgAsp: 3.156 ± 0.809
3.945ArgGlu: 3.945 ± 0.778
2.893ArgPhe: 2.893 ± 0.615
3.419ArgGly: 3.419 ± 1.413
1.052ArgHis: 1.052 ± 0.476
3.419ArgIle: 3.419 ± 2.237
3.945ArgLys: 3.945 ± 0.965
4.997ArgLeu: 4.997 ± 0.771
2.893ArgMet: 2.893 ± 0.655
1.315ArgAsn: 1.315 ± 0.346
2.104ArgPro: 2.104 ± 0.766
1.578ArgGln: 1.578 ± 0.516
2.104ArgArg: 2.104 ± 1.368
3.682ArgSer: 3.682 ± 0.938
3.156ArgThr: 3.156 ± 1.221
3.419ArgVal: 3.419 ± 0.747
1.841ArgTrp: 1.841 ± 0.605
1.315ArgTyr: 1.315 ± 0.483
0.0ArgXaa: 0.0 ± 0.0
Ser
3.156SerAla: 3.156 ± 0.97
0.789SerCys: 0.789 ± 0.515
4.734SerAsp: 4.734 ± 0.817
5.523SerGlu: 5.523 ± 0.405
3.945SerPhe: 3.945 ± 1.327
5.786SerGly: 5.786 ± 0.736
2.893SerHis: 2.893 ± 1.174
5.26SerIle: 5.26 ± 0.983
4.208SerLys: 4.208 ± 0.597
11.573SerLeu: 11.573 ± 1.12
1.315SerMet: 1.315 ± 0.468
3.156SerAsn: 3.156 ± 0.854
4.471SerPro: 4.471 ± 1.513
1.841SerGln: 1.841 ± 0.614
3.682SerArg: 3.682 ± 1.463
6.312SerSer: 6.312 ± 1.403
3.945SerThr: 3.945 ± 0.847
3.682SerVal: 3.682 ± 1.117
1.315SerTrp: 1.315 ± 0.346
1.578SerTyr: 1.578 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
1.315ThrAla: 1.315 ± 0.495
0.789ThrCys: 0.789 ± 0.267
1.578ThrAsp: 1.578 ± 0.409
2.367ThrGlu: 2.367 ± 1.114
2.893ThrPhe: 2.893 ± 0.841
2.893ThrGly: 2.893 ± 0.623
0.526ThrHis: 0.526 ± 0.299
4.208ThrIle: 4.208 ± 0.729
4.471ThrLys: 4.471 ± 1.321
5.523ThrLeu: 5.523 ± 1.061
2.367ThrMet: 2.367 ± 0.471
2.367ThrAsn: 2.367 ± 0.701
3.419ThrPro: 3.419 ± 1.465
4.471ThrGln: 4.471 ± 1.46
2.893ThrArg: 2.893 ± 0.594
7.891ThrSer: 7.891 ± 1.387
3.156ThrThr: 3.156 ± 0.805
2.367ThrVal: 2.367 ± 0.543
1.841ThrTrp: 1.841 ± 0.846
1.841ThrTyr: 1.841 ± 0.391
0.0ThrXaa: 0.0 ± 0.0
Val
3.419ValAla: 3.419 ± 1.82
2.367ValCys: 2.367 ± 0.567
4.997ValAsp: 4.997 ± 0.655
2.63ValGlu: 2.63 ± 0.712
2.63ValPhe: 2.63 ± 0.93
4.471ValGly: 4.471 ± 1.434
1.052ValHis: 1.052 ± 0.497
2.893ValIle: 2.893 ± 0.607
3.945ValLys: 3.945 ± 0.723
5.523ValLeu: 5.523 ± 1.184
0.789ValMet: 0.789 ± 0.356
2.367ValAsn: 2.367 ± 0.707
2.104ValPro: 2.104 ± 0.313
1.841ValGln: 1.841 ± 0.452
1.841ValArg: 1.841 ± 0.586
3.682ValSer: 3.682 ± 0.98
3.945ValThr: 3.945 ± 0.854
3.419ValVal: 3.419 ± 1.456
0.526ValTrp: 0.526 ± 0.299
2.104ValTyr: 2.104 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
0.526TrpAla: 0.526 ± 0.238
0.263TrpCys: 0.263 ± 0.149
0.526TrpAsp: 0.526 ± 0.238
1.052TrpGlu: 1.052 ± 0.437
0.526TrpPhe: 0.526 ± 0.249
2.104TrpGly: 2.104 ± 0.692
0.789TrpHis: 0.789 ± 0.529
1.315TrpIle: 1.315 ± 0.726
1.578TrpLys: 1.578 ± 0.617
1.052TrpLeu: 1.052 ± 0.437
0.263TrpMet: 0.263 ± 0.295
1.315TrpAsn: 1.315 ± 0.483
0.263TrpPro: 0.263 ± 0.149
0.789TrpGln: 0.789 ± 0.399
0.263TrpArg: 0.263 ± 0.149
0.526TrpSer: 0.526 ± 0.873
1.315TrpThr: 1.315 ± 0.606
0.263TrpVal: 0.263 ± 0.309
0.263TrpTrp: 0.263 ± 0.149
0.263TrpTyr: 0.263 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.315TyrAla: 1.315 ± 0.746
0.789TyrCys: 0.789 ± 0.269
1.578TyrAsp: 1.578 ± 0.35
0.789TyrGlu: 0.789 ± 0.448
2.104TyrPhe: 2.104 ± 0.432
1.315TyrGly: 1.315 ± 0.483
1.315TyrHis: 1.315 ± 0.858
1.315TyrIle: 1.315 ± 0.477
2.63TyrLys: 2.63 ± 0.967
2.63TyrLeu: 2.63 ± 0.537
0.789TyrMet: 0.789 ± 0.319
0.263TyrAsn: 0.263 ± 0.149
1.315TyrPro: 1.315 ± 0.347
0.789TyrGln: 0.789 ± 0.403
1.578TyrArg: 1.578 ± 0.692
1.315TyrSer: 1.315 ± 0.477
1.841TyrThr: 1.841 ± 0.463
2.104TyrVal: 2.104 ± 1.449
0.263TyrTrp: 0.263 ± 0.486
1.578TyrTyr: 1.578 ± 1.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3803 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski