Amino acid dipepetide frequency for Mumps orthorubulavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.622AlaAla: 4.622 ± 1.492
0.804AlaCys: 0.804 ± 0.427
2.412AlaAsp: 2.412 ± 1.012
2.211AlaGlu: 2.211 ± 0.813
2.01AlaPhe: 2.01 ± 0.568
4.622AlaGly: 4.622 ± 1.042
1.005AlaHis: 1.005 ± 0.444
4.421AlaIle: 4.421 ± 1.001
3.416AlaLys: 3.416 ± 1.631
6.23AlaLeu: 6.23 ± 0.958
0.804AlaMet: 0.804 ± 0.607
3.818AlaAsn: 3.818 ± 0.78
2.613AlaPro: 2.613 ± 0.983
3.818AlaGln: 3.818 ± 1.7
4.622AlaArg: 4.622 ± 2.009
7.637AlaSer: 7.637 ± 1.555
5.024AlaThr: 5.024 ± 1.598
3.416AlaVal: 3.416 ± 1.541
0.402AlaTrp: 0.402 ± 0.324
2.412AlaTyr: 2.412 ± 0.637
0.0AlaXaa: 0.0 ± 0.0
Cys
1.005CysAla: 1.005 ± 0.419
0.603CysCys: 0.603 ± 0.256
0.402CysAsp: 0.402 ± 0.434
1.206CysGlu: 1.206 ± 0.448
1.005CysPhe: 1.005 ± 0.419
0.402CysGly: 0.402 ± 0.268
0.201CysHis: 0.201 ± 0.209
1.206CysIle: 1.206 ± 0.6
1.407CysLys: 1.407 ± 0.774
1.809CysLeu: 1.809 ± 0.598
0.402CysMet: 0.402 ± 0.336
0.603CysAsn: 0.603 ± 0.309
0.603CysPro: 0.603 ± 0.278
1.407CysGln: 1.407 ± 0.627
0.402CysArg: 0.402 ± 0.302
2.613CysSer: 2.613 ± 0.683
0.603CysThr: 0.603 ± 0.346
1.809CysVal: 1.809 ± 0.428
0.201CysTrp: 0.201 ± 0.206
1.206CysTyr: 1.206 ± 0.343
0.0CysXaa: 0.0 ± 0.0
Asp
2.01AspAla: 2.01 ± 0.406
0.603AspCys: 0.603 ± 0.27
3.014AspAsp: 3.014 ± 0.721
2.613AspGlu: 2.613 ± 0.964
1.608AspPhe: 1.608 ± 0.465
2.613AspGly: 2.613 ± 0.623
1.206AspHis: 1.206 ± 0.479
4.019AspIle: 4.019 ± 0.927
2.01AspLys: 2.01 ± 0.637
5.627AspLeu: 5.627 ± 0.859
0.402AspMet: 0.402 ± 0.207
2.814AspAsn: 2.814 ± 0.915
4.019AspPro: 4.019 ± 0.896
2.412AspGln: 2.412 ± 0.62
2.01AspArg: 2.01 ± 1.093
2.211AspSer: 2.211 ± 0.525
1.809AspThr: 1.809 ± 0.82
1.407AspVal: 1.407 ± 0.425
0.402AspTrp: 0.402 ± 0.212
2.412AspTyr: 2.412 ± 0.792
0.0AspXaa: 0.0 ± 0.0
Glu
1.809GluAla: 1.809 ± 0.845
0.603GluCys: 0.603 ± 0.379
2.211GluAsp: 2.211 ± 0.366
2.613GluGlu: 2.613 ± 0.767
1.809GluPhe: 1.809 ± 0.39
3.416GluGly: 3.416 ± 0.491
1.206GluHis: 1.206 ± 0.45
4.22GluIle: 4.22 ± 0.583
2.211GluLys: 2.211 ± 0.961
5.828GluLeu: 5.828 ± 0.802
1.407GluMet: 1.407 ± 0.47
2.412GluAsn: 2.412 ± 0.42
2.01GluPro: 2.01 ± 0.512
3.014GluGln: 3.014 ± 1.075
3.014GluArg: 3.014 ± 0.631
3.215GluSer: 3.215 ± 0.88
3.014GluThr: 3.014 ± 0.636
2.211GluVal: 2.211 ± 0.519
0.804GluTrp: 0.804 ± 0.536
1.407GluTyr: 1.407 ± 0.885
0.0GluXaa: 0.0 ± 0.0
Phe
2.412PheAla: 2.412 ± 0.594
0.804PheCys: 0.804 ± 0.435
1.005PheAsp: 1.005 ± 0.481
2.412PheGlu: 2.412 ± 0.622
2.01PhePhe: 2.01 ± 0.438
0.804PheGly: 0.804 ± 0.371
0.804PheHis: 0.804 ± 0.44
2.814PheIle: 2.814 ± 0.548
1.206PheLys: 1.206 ± 0.526
3.818PheLeu: 3.818 ± 0.672
0.603PheMet: 0.603 ± 0.225
2.01PheAsn: 2.01 ± 1.09
1.206PhePro: 1.206 ± 0.389
1.206PheGln: 1.206 ± 0.581
1.809PheArg: 1.809 ± 0.813
3.416PheSer: 3.416 ± 0.663
2.613PheThr: 2.613 ± 0.82
2.01PheVal: 2.01 ± 0.431
0.0PheTrp: 0.0 ± 0.0
1.608PheTyr: 1.608 ± 0.534
0.0PheXaa: 0.0 ± 0.0
Gly
3.617GlyAla: 3.617 ± 1.511
1.608GlyCys: 1.608 ± 0.519
3.818GlyAsp: 3.818 ± 0.691
3.014GlyGlu: 3.014 ± 0.8
1.608GlyPhe: 1.608 ± 0.396
3.617GlyGly: 3.617 ± 1.615
1.005GlyHis: 1.005 ± 0.312
4.019GlyIle: 4.019 ± 0.769
2.01GlyLys: 2.01 ± 0.299
4.22GlyLeu: 4.22 ± 0.786
1.608GlyMet: 1.608 ± 0.998
2.613GlyAsn: 2.613 ± 1.164
2.211GlyPro: 2.211 ± 0.856
2.211GlyGln: 2.211 ± 0.576
2.814GlyArg: 2.814 ± 0.494
5.225GlySer: 5.225 ± 1.341
4.019GlyThr: 4.019 ± 2.109
4.421GlyVal: 4.421 ± 0.831
0.402GlyTrp: 0.402 ± 0.207
1.206GlyTyr: 1.206 ± 0.431
0.0GlyXaa: 0.0 ± 0.0
His
1.809HisAla: 1.809 ± 0.412
0.201HisCys: 0.201 ± 0.126
0.603HisAsp: 0.603 ± 0.251
0.402HisGlu: 0.402 ± 0.253
0.402HisPhe: 0.402 ± 0.457
1.407HisGly: 1.407 ± 0.451
0.804HisHis: 0.804 ± 0.285
1.608HisIle: 1.608 ± 0.406
0.804HisLys: 0.804 ± 0.354
2.814HisLeu: 2.814 ± 1.012
0.402HisMet: 0.402 ± 0.207
0.603HisAsn: 0.603 ± 0.217
1.005HisPro: 1.005 ± 0.364
1.809HisGln: 1.809 ± 1.116
0.804HisArg: 0.804 ± 0.326
1.005HisSer: 1.005 ± 0.42
0.402HisThr: 0.402 ± 0.181
0.804HisVal: 0.804 ± 0.345
0.402HisTrp: 0.402 ± 0.329
0.201HisTyr: 0.201 ± 0.126
0.0HisXaa: 0.0 ± 0.0
Ile
5.627IleAla: 5.627 ± 1.53
1.005IleCys: 1.005 ± 0.401
2.814IleAsp: 2.814 ± 0.78
4.622IleGlu: 4.622 ± 0.951
1.608IlePhe: 1.608 ± 0.379
4.019IleGly: 4.019 ± 0.846
1.407IleHis: 1.407 ± 0.565
5.426IleIle: 5.426 ± 0.947
3.416IleLys: 3.416 ± 0.802
9.244IleLeu: 9.244 ± 2.936
2.01IleMet: 2.01 ± 0.661
6.029IleAsn: 6.029 ± 1.784
5.426IlePro: 5.426 ± 1.125
5.225IleGln: 5.225 ± 1.53
3.818IleArg: 3.818 ± 0.591
6.23IleSer: 6.23 ± 1.73
4.019IleThr: 4.019 ± 0.613
3.617IleVal: 3.617 ± 0.989
1.608IleTrp: 1.608 ± 1.011
1.608IleTyr: 1.608 ± 0.531
0.0IleXaa: 0.0 ± 0.0
Lys
2.814LysAla: 2.814 ± 0.609
1.206LysCys: 1.206 ± 0.58
1.809LysAsp: 1.809 ± 0.519
2.613LysGlu: 2.613 ± 0.62
1.206LysPhe: 1.206 ± 0.444
2.412LysGly: 2.412 ± 0.734
0.603LysHis: 0.603 ± 0.379
3.014LysIle: 3.014 ± 0.662
1.407LysLys: 1.407 ± 0.565
4.421LysLeu: 4.421 ± 1.106
0.804LysMet: 0.804 ± 0.267
1.608LysAsn: 1.608 ± 0.598
2.211LysPro: 2.211 ± 1.206
2.412LysGln: 2.412 ± 0.722
1.809LysArg: 1.809 ± 0.568
3.215LysSer: 3.215 ± 0.478
3.416LysThr: 3.416 ± 0.76
3.215LysVal: 3.215 ± 0.517
0.603LysTrp: 0.603 ± 0.225
2.01LysTyr: 2.01 ± 0.574
0.0LysXaa: 0.0 ± 0.0
Leu
8.24LeuAla: 8.24 ± 1.147
2.211LeuCys: 2.211 ± 0.793
5.627LeuAsp: 5.627 ± 1.276
5.828LeuGlu: 5.828 ± 1.568
3.416LeuPhe: 3.416 ± 0.256
3.416LeuGly: 3.416 ± 0.754
1.206LeuHis: 1.206 ± 0.758
8.039LeuIle: 8.039 ± 0.812
4.823LeuLys: 4.823 ± 1.507
10.852LeuLeu: 10.852 ± 1.49
2.814LeuMet: 2.814 ± 0.945
6.632LeuAsn: 6.632 ± 2.024
4.823LeuPro: 4.823 ± 0.879
3.014LeuGln: 3.014 ± 0.844
5.828LeuArg: 5.828 ± 1.085
9.847LeuSer: 9.847 ± 1.354
9.646LeuThr: 9.646 ± 2.039
5.426LeuVal: 5.426 ± 1.256
1.005LeuTrp: 1.005 ± 0.482
3.215LeuTyr: 3.215 ± 1.096
0.0LeuXaa: 0.0 ± 0.0
Met
1.608MetAla: 1.608 ± 0.595
0.402MetCys: 0.402 ± 0.253
1.407MetAsp: 1.407 ± 1.212
1.809MetGlu: 1.809 ± 0.334
0.201MetPhe: 0.201 ± 0.186
1.206MetGly: 1.206 ± 0.343
0.0MetHis: 0.0 ± 0.0
2.211MetIle: 2.211 ± 0.755
1.206MetLys: 1.206 ± 0.39
2.01MetLeu: 2.01 ± 0.751
0.603MetMet: 0.603 ± 0.452
1.608MetAsn: 1.608 ± 0.319
1.005MetPro: 1.005 ± 0.305
0.603MetGln: 0.603 ± 0.273
1.809MetArg: 1.809 ± 0.682
1.407MetSer: 1.407 ± 0.244
1.407MetThr: 1.407 ± 0.455
1.407MetVal: 1.407 ± 0.425
0.402MetTrp: 0.402 ± 0.207
0.804MetTyr: 0.804 ± 0.354
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 0.76
1.407AsnCys: 1.407 ± 0.844
2.412AsnAsp: 2.412 ± 0.508
2.412AsnGlu: 2.412 ± 0.501
1.407AsnPhe: 1.407 ± 0.283
2.613AsnGly: 2.613 ± 0.757
2.613AsnHis: 2.613 ± 0.756
4.421AsnIle: 4.421 ± 1.482
2.01AsnLys: 2.01 ± 0.607
4.823AsnLeu: 4.823 ± 1.061
1.005AsnMet: 1.005 ± 0.343
2.613AsnAsn: 2.613 ± 0.535
4.019AsnPro: 4.019 ± 1.121
3.215AsnGln: 3.215 ± 1.082
3.617AsnArg: 3.617 ± 0.555
4.019AsnSer: 4.019 ± 1.014
2.01AsnThr: 2.01 ± 0.825
2.01AsnVal: 2.01 ± 0.593
1.206AsnTrp: 1.206 ± 0.587
1.407AsnTyr: 1.407 ± 0.569
0.0AsnXaa: 0.0 ± 0.0
Pro
3.014ProAla: 3.014 ± 1.404
0.402ProCys: 0.402 ± 0.324
2.814ProAsp: 2.814 ± 0.726
3.014ProGlu: 3.014 ± 1.029
2.211ProPhe: 2.211 ± 0.639
3.416ProGly: 3.416 ± 1.162
0.402ProHis: 0.402 ± 0.253
5.024ProIle: 5.024 ± 1.223
1.809ProLys: 1.809 ± 0.629
5.627ProLeu: 5.627 ± 0.838
0.603ProMet: 0.603 ± 0.264
2.814ProAsn: 2.814 ± 1.011
3.215ProPro: 3.215 ± 0.645
2.613ProGln: 2.613 ± 0.61
1.608ProArg: 1.608 ± 0.76
4.421ProSer: 4.421 ± 1.188
5.225ProThr: 5.225 ± 1.369
3.416ProVal: 3.416 ± 1.453
0.201ProTrp: 0.201 ± 0.186
1.809ProTyr: 1.809 ± 0.662
0.0ProXaa: 0.0 ± 0.0
Gln
4.421GlnAla: 4.421 ± 0.854
0.201GlnCys: 0.201 ± 0.295
3.215GlnAsp: 3.215 ± 1.26
1.005GlnGlu: 1.005 ± 0.382
1.608GlnPhe: 1.608 ± 0.441
4.622GlnGly: 4.622 ± 1.864
0.402GlnHis: 0.402 ± 0.328
5.024GlnIle: 5.024 ± 1.129
1.809GlnLys: 1.809 ± 0.597
5.024GlnLeu: 5.024 ± 1.2
1.608GlnMet: 1.608 ± 0.478
2.01GlnAsn: 2.01 ± 1.598
3.416GlnPro: 3.416 ± 1.74
3.215GlnGln: 3.215 ± 2.066
2.01GlnArg: 2.01 ± 0.559
3.416GlnSer: 3.416 ± 0.557
3.014GlnThr: 3.014 ± 0.517
3.014GlnVal: 3.014 ± 0.608
0.201GlnTrp: 0.201 ± 0.206
2.211GlnTyr: 2.211 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
2.613ArgAla: 2.613 ± 0.623
0.603ArgCys: 0.603 ± 0.466
1.608ArgAsp: 1.608 ± 0.441
1.608ArgGlu: 1.608 ± 0.519
2.01ArgPhe: 2.01 ± 0.862
2.211ArgGly: 2.211 ± 1.344
1.005ArgHis: 1.005 ± 0.524
4.823ArgIle: 4.823 ± 1.339
2.613ArgLys: 2.613 ± 0.716
6.431ArgLeu: 6.431 ± 1.049
1.206ArgMet: 1.206 ± 0.367
1.206ArgAsn: 1.206 ± 0.271
2.613ArgPro: 2.613 ± 0.465
2.01ArgGln: 2.01 ± 0.829
4.22ArgArg: 4.22 ± 0.7
5.627ArgSer: 5.627 ± 0.988
1.809ArgThr: 1.809 ± 0.372
3.818ArgVal: 3.818 ± 0.846
0.201ArgTrp: 0.201 ± 0.126
2.01ArgTyr: 2.01 ± 1.093
0.0ArgXaa: 0.0 ± 0.0
Ser
5.627SerAla: 5.627 ± 1.454
2.211SerCys: 2.211 ± 0.654
3.818SerAsp: 3.818 ± 1.057
3.617SerGlu: 3.617 ± 1.264
3.617SerPhe: 3.617 ± 0.813
5.225SerGly: 5.225 ± 1.671
1.809SerHis: 1.809 ± 0.364
4.622SerIle: 4.622 ± 1.561
3.617SerLys: 3.617 ± 1.209
9.646SerLeu: 9.646 ± 1.724
3.014SerMet: 3.014 ± 0.787
4.622SerAsn: 4.622 ± 1.084
4.421SerPro: 4.421 ± 0.876
4.22SerGln: 4.22 ± 0.893
1.809SerArg: 1.809 ± 0.536
7.838SerSer: 7.838 ± 1.253
5.627SerThr: 5.627 ± 1.346
4.22SerVal: 4.22 ± 0.5
1.608SerTrp: 1.608 ± 0.44
3.818SerTyr: 3.818 ± 0.711
0.0SerXaa: 0.0 ± 0.0
Thr
5.426ThrAla: 5.426 ± 0.922
1.608ThrCys: 1.608 ± 0.612
2.412ThrAsp: 2.412 ± 0.546
2.613ThrGlu: 2.613 ± 0.426
2.613ThrPhe: 2.613 ± 0.636
4.421ThrGly: 4.421 ± 2.065
0.804ThrHis: 0.804 ± 0.525
6.833ThrIle: 6.833 ± 1.137
2.412ThrLys: 2.412 ± 0.6
7.436ThrLeu: 7.436 ± 1.345
1.005ThrMet: 1.005 ± 0.506
2.412ThrAsn: 2.412 ± 0.585
3.617ThrPro: 3.617 ± 0.687
3.617ThrGln: 3.617 ± 0.808
3.416ThrArg: 3.416 ± 1.048
3.818ThrSer: 3.818 ± 0.774
4.421ThrThr: 4.421 ± 0.619
4.823ThrVal: 4.823 ± 0.993
0.603ThrTrp: 0.603 ± 0.379
1.407ThrTyr: 1.407 ± 0.545
0.0ThrXaa: 0.0 ± 0.0
Val
2.814ValAla: 2.814 ± 0.997
1.005ValCys: 1.005 ± 0.658
2.412ValAsp: 2.412 ± 0.408
3.014ValGlu: 3.014 ± 0.83
2.211ValPhe: 2.211 ± 0.732
2.613ValGly: 2.613 ± 1.169
1.809ValHis: 1.809 ± 0.695
3.215ValIle: 3.215 ± 0.725
2.814ValLys: 2.814 ± 0.842
5.426ValLeu: 5.426 ± 0.335
1.608ValMet: 1.608 ± 0.366
3.014ValAsn: 3.014 ± 0.703
2.412ValPro: 2.412 ± 0.605
2.613ValGln: 2.613 ± 0.594
2.814ValArg: 2.814 ± 0.453
5.426ValSer: 5.426 ± 1.369
5.426ValThr: 5.426 ± 1.902
3.014ValVal: 3.014 ± 0.872
0.603ValTrp: 0.603 ± 0.32
2.814ValTyr: 2.814 ± 0.599
0.0ValXaa: 0.0 ± 0.0
Trp
1.206TrpAla: 1.206 ± 0.415
0.603TrpCys: 0.603 ± 0.278
0.201TrpAsp: 0.201 ± 0.268
0.402TrpGlu: 0.402 ± 0.253
0.603TrpPhe: 0.603 ± 0.272
0.603TrpGly: 0.603 ± 0.272
0.0TrpHis: 0.0 ± 0.0
0.804TrpIle: 0.804 ± 0.299
0.603TrpLys: 0.603 ± 0.379
0.402TrpLeu: 0.402 ± 0.253
0.201TrpMet: 0.201 ± 0.126
0.603TrpAsn: 0.603 ± 0.251
0.804TrpPro: 0.804 ± 0.363
0.402TrpGln: 0.402 ± 0.253
0.402TrpArg: 0.402 ± 0.253
1.608TrpSer: 1.608 ± 0.59
0.402TrpThr: 0.402 ± 0.253
0.804TrpVal: 0.804 ± 0.391
0.201TrpTrp: 0.201 ± 0.186
0.402TrpTyr: 0.402 ± 0.253
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.412TyrAla: 2.412 ± 1.277
1.005TyrCys: 1.005 ± 0.412
1.206TyrAsp: 1.206 ± 0.364
1.407TyrGlu: 1.407 ± 0.389
1.407TyrPhe: 1.407 ± 0.778
1.608TyrGly: 1.608 ± 0.44
0.0TyrHis: 0.0 ± 0.0
3.014TyrIle: 3.014 ± 0.784
1.206TyrLys: 1.206 ± 0.47
4.22TyrLeu: 4.22 ± 0.922
0.804TyrMet: 0.804 ± 0.423
2.412TyrAsn: 2.412 ± 0.559
2.211TyrPro: 2.211 ± 0.468
2.412TyrGln: 2.412 ± 0.449
1.608TyrArg: 1.608 ± 0.501
2.814TyrSer: 2.814 ± 0.616
1.809TyrThr: 1.809 ± 0.561
2.211TyrVal: 2.211 ± 0.541
0.201TyrTrp: 0.201 ± 0.126
1.407TyrTyr: 1.407 ± 0.378
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski