Amino acid dipepetide frequency for Almpiwar virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.878AlaAla: 2.878 ± 1.864
0.785AlaCys: 0.785 ± 0.528
2.355AlaAsp: 2.355 ± 0.464
2.355AlaGlu: 2.355 ± 0.897
2.355AlaPhe: 2.355 ± 0.651
3.401AlaGly: 3.401 ± 0.487
0.785AlaHis: 0.785 ± 0.371
2.878AlaIle: 2.878 ± 0.683
1.57AlaLys: 1.57 ± 1.087
4.186AlaLeu: 4.186 ± 0.962
1.57AlaMet: 1.57 ± 1.094
2.093AlaAsn: 2.093 ± 1.191
0.785AlaPro: 0.785 ± 0.619
1.57AlaGln: 1.57 ± 0.475
1.308AlaArg: 1.308 ± 0.35
2.616AlaSer: 2.616 ± 1.108
2.616AlaThr: 2.616 ± 0.684
1.308AlaVal: 1.308 ± 0.918
1.308AlaTrp: 1.308 ± 0.544
2.355AlaTyr: 2.355 ± 0.577
0.0AlaXaa: 0.0 ± 0.0
Cys
1.308CysAla: 1.308 ± 0.613
0.0CysCys: 0.0 ± 0.0
0.262CysAsp: 0.262 ± 0.152
0.785CysGlu: 0.785 ± 0.396
1.047CysPhe: 1.047 ± 0.422
0.785CysGly: 0.785 ± 0.78
0.785CysHis: 0.785 ± 0.61
1.57CysIle: 1.57 ± 0.7
1.308CysLys: 1.308 ± 0.539
2.616CysLeu: 2.616 ± 1.458
0.785CysMet: 0.785 ± 0.396
1.57CysAsn: 1.57 ± 0.563
0.523CysPro: 0.523 ± 0.667
1.047CysGln: 1.047 ± 0.589
0.785CysArg: 0.785 ± 0.445
2.093CysSer: 2.093 ± 0.579
1.047CysThr: 1.047 ± 0.327
1.047CysVal: 1.047 ± 0.427
0.785CysTrp: 0.785 ± 0.396
0.523CysTyr: 0.523 ± 0.304
0.0CysXaa: 0.0 ± 0.0
Asp
1.308AspAla: 1.308 ± 0.656
1.047AspCys: 1.047 ± 0.471
2.616AspAsp: 2.616 ± 1.3
3.663AspGlu: 3.663 ± 1.837
2.878AspPhe: 2.878 ± 0.859
2.878AspGly: 2.878 ± 1.267
1.047AspHis: 1.047 ± 0.468
4.448AspIle: 4.448 ± 1.493
4.71AspLys: 4.71 ± 1.164
5.233AspLeu: 5.233 ± 1.231
1.832AspMet: 1.832 ± 0.583
1.832AspAsn: 1.832 ± 0.39
2.355AspPro: 2.355 ± 0.717
1.832AspGln: 1.832 ± 0.646
0.785AspArg: 0.785 ± 0.619
4.448AspSer: 4.448 ± 1.016
3.401AspThr: 3.401 ± 0.449
2.878AspVal: 2.878 ± 1.152
1.57AspTrp: 1.57 ± 0.727
2.616AspTyr: 2.616 ± 0.683
0.0AspXaa: 0.0 ± 0.0
Glu
2.355GluAla: 2.355 ± 1.04
0.785GluCys: 0.785 ± 0.329
4.448GluAsp: 4.448 ± 1.563
4.448GluGlu: 4.448 ± 2.306
3.925GluPhe: 3.925 ± 1.442
3.925GluGly: 3.925 ± 1.106
1.57GluHis: 1.57 ± 1.095
4.448GluIle: 4.448 ± 1.362
4.448GluLys: 4.448 ± 1.121
6.541GluLeu: 6.541 ± 1.856
1.047GluMet: 1.047 ± 0.611
2.878GluAsn: 2.878 ± 0.587
0.523GluPro: 0.523 ± 0.404
1.308GluGln: 1.308 ± 0.634
2.093GluArg: 2.093 ± 0.791
5.233GluSer: 5.233 ± 0.889
2.616GluThr: 2.616 ± 0.539
3.401GluVal: 3.401 ± 1.471
1.308GluTrp: 1.308 ± 0.539
2.616GluTyr: 2.616 ± 0.944
0.0GluXaa: 0.0 ± 0.0
Phe
1.57PheAla: 1.57 ± 0.429
1.57PheCys: 1.57 ± 1.072
2.878PheAsp: 2.878 ± 0.813
2.355PheGlu: 2.355 ± 0.673
2.093PhePhe: 2.093 ± 1.113
2.616PheGly: 2.616 ± 0.691
0.262PheHis: 0.262 ± 0.152
2.878PheIle: 2.878 ± 0.749
3.925PheLys: 3.925 ± 0.754
6.018PheLeu: 6.018 ± 1.353
1.047PheMet: 1.047 ± 0.56
2.616PheAsn: 2.616 ± 0.623
3.401PhePro: 3.401 ± 0.734
2.093PheGln: 2.093 ± 0.953
1.308PheArg: 1.308 ± 0.496
3.925PheSer: 3.925 ± 1.714
2.616PheThr: 2.616 ± 0.677
2.616PheVal: 2.616 ± 0.964
1.308PheTrp: 1.308 ± 0.733
1.047PheTyr: 1.047 ± 0.665
0.0PheXaa: 0.0 ± 0.0
Gly
2.093GlyAla: 2.093 ± 0.608
1.308GlyCys: 1.308 ± 0.472
2.616GlyAsp: 2.616 ± 0.769
3.401GlyGlu: 3.401 ± 1.129
2.355GlyPhe: 2.355 ± 0.523
3.663GlyGly: 3.663 ± 1.23
2.355GlyHis: 2.355 ± 0.629
4.186GlyIle: 4.186 ± 1.389
3.663GlyLys: 3.663 ± 1.006
5.756GlyLeu: 5.756 ± 1.568
2.093GlyMet: 2.093 ± 0.579
1.308GlyAsn: 1.308 ± 0.868
2.616GlyPro: 2.616 ± 1.065
2.616GlyGln: 2.616 ± 0.862
3.401GlyArg: 3.401 ± 0.998
5.495GlySer: 5.495 ± 0.879
2.093GlyThr: 2.093 ± 0.528
3.14GlyVal: 3.14 ± 1.123
2.355GlyTrp: 2.355 ± 0.823
2.878GlyTyr: 2.878 ± 0.516
0.0GlyXaa: 0.0 ± 0.0
His
0.523HisAla: 0.523 ± 0.294
0.523HisCys: 0.523 ± 0.267
1.308HisAsp: 1.308 ± 0.768
1.57HisGlu: 1.57 ± 0.658
1.832HisPhe: 1.832 ± 0.54
1.047HisGly: 1.047 ± 0.899
1.047HisHis: 1.047 ± 0.468
1.308HisIle: 1.308 ± 0.51
2.093HisLys: 2.093 ± 0.536
2.616HisLeu: 2.616 ± 1.218
0.523HisMet: 0.523 ± 0.304
1.047HisAsn: 1.047 ± 0.534
1.832HisPro: 1.832 ± 0.866
0.523HisGln: 0.523 ± 0.612
0.785HisArg: 0.785 ± 0.297
1.832HisSer: 1.832 ± 0.633
1.047HisThr: 1.047 ± 0.814
0.785HisVal: 0.785 ± 0.61
1.047HisTrp: 1.047 ± 0.427
1.047HisTyr: 1.047 ± 0.665
0.0HisXaa: 0.0 ± 0.0
Ile
2.093IleAla: 2.093 ± 0.936
1.57IleCys: 1.57 ± 0.911
4.71IleAsp: 4.71 ± 1.083
3.14IleGlu: 3.14 ± 1.519
3.401IlePhe: 3.401 ± 1.018
5.233IleGly: 5.233 ± 1.527
2.355IleHis: 2.355 ± 0.823
5.495IleIle: 5.495 ± 1.939
3.925IleLys: 3.925 ± 1.4
6.541IleLeu: 6.541 ± 2.805
1.832IleMet: 1.832 ± 0.754
5.233IleAsn: 5.233 ± 1.486
4.186IlePro: 4.186 ± 1.577
2.616IleGln: 2.616 ± 1.337
5.495IleArg: 5.495 ± 1.575
6.279IleSer: 6.279 ± 1.304
5.756IleThr: 5.756 ± 1.33
2.878IleVal: 2.878 ± 1.381
0.523IleTrp: 0.523 ± 0.304
4.186IleTyr: 4.186 ± 1.887
0.0IleXaa: 0.0 ± 0.0
Lys
1.57LysAla: 1.57 ± 0.959
1.57LysCys: 1.57 ± 0.658
3.663LysAsp: 3.663 ± 0.704
4.971LysGlu: 4.971 ± 0.668
3.401LysPhe: 3.401 ± 0.941
5.233LysGly: 5.233 ± 1.452
1.047LysHis: 1.047 ± 0.743
8.373LysIle: 8.373 ± 1.008
6.018LysLys: 6.018 ± 0.859
5.233LysLeu: 5.233 ± 1.096
2.355LysMet: 2.355 ± 0.677
2.878LysAsn: 2.878 ± 1.051
3.14LysPro: 3.14 ± 0.55
1.832LysGln: 1.832 ± 1.376
3.663LysArg: 3.663 ± 0.867
4.186LysSer: 4.186 ± 0.869
4.71LysThr: 4.71 ± 0.697
2.878LysVal: 2.878 ± 0.652
2.616LysTrp: 2.616 ± 0.912
2.355LysTyr: 2.355 ± 0.795
0.0LysXaa: 0.0 ± 0.0
Leu
3.663LeuAla: 3.663 ± 0.735
1.57LeuCys: 1.57 ± 0.658
4.186LeuAsp: 4.186 ± 0.687
4.971LeuGlu: 4.971 ± 1.258
3.925LeuPhe: 3.925 ± 0.918
4.971LeuGly: 4.971 ± 1.122
1.047LeuHis: 1.047 ± 0.641
7.588LeuIle: 7.588 ± 1.698
12.036LeuLys: 12.036 ± 3.071
9.419LeuLeu: 9.419 ± 2.502
2.878LeuMet: 2.878 ± 1.213
6.279LeuAsn: 6.279 ± 1.273
2.355LeuPro: 2.355 ± 0.987
1.308LeuGln: 1.308 ± 0.426
6.279LeuArg: 6.279 ± 2.264
5.756LeuSer: 5.756 ± 1.248
9.681LeuThr: 9.681 ± 1.796
4.186LeuVal: 4.186 ± 0.996
1.047LeuTrp: 1.047 ± 0.835
2.355LeuTyr: 2.355 ± 1.061
0.0LeuXaa: 0.0 ± 0.0
Met
1.57MetAla: 1.57 ± 0.841
0.785MetCys: 0.785 ± 0.329
0.785MetAsp: 0.785 ± 0.371
1.57MetGlu: 1.57 ± 0.524
1.047MetPhe: 1.047 ± 0.478
2.093MetGly: 2.093 ± 0.674
0.262MetHis: 0.262 ± 0.152
2.878MetIle: 2.878 ± 1.138
2.878MetLys: 2.878 ± 1.207
2.093MetLeu: 2.093 ± 0.772
0.785MetMet: 0.785 ± 0.297
1.308MetAsn: 1.308 ± 0.603
0.262MetPro: 0.262 ± 0.333
1.047MetGln: 1.047 ± 0.607
0.523MetArg: 0.523 ± 0.446
2.355MetSer: 2.355 ± 0.587
2.355MetThr: 2.355 ± 1.479
1.308MetVal: 1.308 ± 0.462
0.262MetTrp: 0.262 ± 0.333
0.523MetTyr: 0.523 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
3.401AsnAla: 3.401 ± 2.373
2.093AsnCys: 2.093 ± 0.647
3.925AsnAsp: 3.925 ± 0.588
2.878AsnGlu: 2.878 ± 1.208
1.308AsnPhe: 1.308 ± 0.51
2.616AsnGly: 2.616 ± 1.357
0.785AsnHis: 0.785 ± 0.297
3.14AsnIle: 3.14 ± 1.207
3.663AsnLys: 3.663 ± 0.728
6.018AsnLeu: 6.018 ± 0.922
0.523AsnMet: 0.523 ± 0.406
4.186AsnAsn: 4.186 ± 1.375
3.925AsnPro: 3.925 ± 0.821
3.14AsnGln: 3.14 ± 1.356
1.57AsnArg: 1.57 ± 0.7
2.878AsnSer: 2.878 ± 0.816
3.14AsnThr: 3.14 ± 1.009
2.355AsnVal: 2.355 ± 0.825
0.262AsnTrp: 0.262 ± 0.152
3.14AsnTyr: 3.14 ± 1.139
0.0AsnXaa: 0.0 ± 0.0
Pro
2.616ProAla: 2.616 ± 0.895
0.262ProCys: 0.262 ± 0.718
2.616ProAsp: 2.616 ± 0.905
1.57ProGlu: 1.57 ± 0.429
0.785ProPhe: 0.785 ± 0.371
2.093ProGly: 2.093 ± 0.538
0.523ProHis: 0.523 ± 0.304
3.663ProIle: 3.663 ± 1.137
3.14ProLys: 3.14 ± 0.721
3.401ProLeu: 3.401 ± 1.038
0.262ProMet: 0.262 ± 0.152
1.832ProAsn: 1.832 ± 0.575
1.832ProPro: 1.832 ± 0.753
0.523ProGln: 0.523 ± 0.304
1.57ProArg: 1.57 ± 0.808
4.971ProSer: 4.971 ± 0.849
2.616ProThr: 2.616 ± 1.304
3.663ProVal: 3.663 ± 0.534
1.047ProTrp: 1.047 ± 0.607
1.308ProTyr: 1.308 ± 0.742
0.0ProXaa: 0.0 ± 0.0
Gln
2.355GlnAla: 2.355 ± 0.651
0.0GlnCys: 0.0 ± 0.0
2.616GlnAsp: 2.616 ± 0.563
2.616GlnGlu: 2.616 ± 1.164
0.785GlnPhe: 0.785 ± 0.297
1.308GlnGly: 1.308 ± 0.464
1.832GlnHis: 1.832 ± 0.81
2.878GlnIle: 2.878 ± 0.731
1.57GlnLys: 1.57 ± 0.511
3.663GlnLeu: 3.663 ± 1.468
1.832GlnMet: 1.832 ± 0.971
2.355GlnAsn: 2.355 ± 0.798
0.262GlnPro: 0.262 ± 0.152
0.523GlnGln: 0.523 ± 0.404
1.832GlnArg: 1.832 ± 0.676
2.355GlnSer: 2.355 ± 1.483
0.785GlnThr: 0.785 ± 0.61
1.308GlnVal: 1.308 ± 0.579
0.262GlnTrp: 0.262 ± 0.333
0.262GlnTyr: 0.262 ± 0.152
0.0GlnXaa: 0.0 ± 0.0
Arg
2.878ArgAla: 2.878 ± 0.46
1.308ArgCys: 1.308 ± 0.656
1.832ArgAsp: 1.832 ± 0.908
2.616ArgGlu: 2.616 ± 0.649
2.355ArgPhe: 2.355 ± 0.868
1.832ArgGly: 1.832 ± 0.739
1.832ArgHis: 1.832 ± 0.677
3.14ArgIle: 3.14 ± 1.045
2.616ArgLys: 2.616 ± 2.064
4.186ArgLeu: 4.186 ± 2.15
0.785ArgMet: 0.785 ± 0.504
2.355ArgAsn: 2.355 ± 0.894
2.093ArgPro: 2.093 ± 0.769
0.523ArgGln: 0.523 ± 0.304
2.093ArgArg: 2.093 ± 1.71
3.925ArgSer: 3.925 ± 1.506
2.616ArgThr: 2.616 ± 0.806
4.71ArgVal: 4.71 ± 0.731
1.047ArgTrp: 1.047 ± 0.42
1.57ArgTyr: 1.57 ± 0.547
0.0ArgXaa: 0.0 ± 0.0
Ser
3.663SerAla: 3.663 ± 1.34
1.832SerCys: 1.832 ± 1.282
4.448SerAsp: 4.448 ± 1.201
7.588SerGlu: 7.588 ± 2.049
5.233SerPhe: 5.233 ± 0.963
3.663SerGly: 3.663 ± 2.124
3.663SerHis: 3.663 ± 1.34
4.71SerIle: 4.71 ± 1.356
3.663SerLys: 3.663 ± 1.108
6.018SerLeu: 6.018 ± 1.148
3.14SerMet: 3.14 ± 0.537
4.186SerAsn: 4.186 ± 1.362
2.878SerPro: 2.878 ± 1.122
3.14SerGln: 3.14 ± 0.837
3.925SerArg: 3.925 ± 1.518
6.279SerSer: 6.279 ± 2.049
4.971SerThr: 4.971 ± 1.205
2.878SerVal: 2.878 ± 1.131
0.785SerTrp: 0.785 ± 0.456
3.925SerTyr: 3.925 ± 1.553
0.0SerXaa: 0.0 ± 0.0
Thr
1.308ThrAla: 1.308 ± 0.679
0.523ThrCys: 0.523 ± 0.349
2.355ThrAsp: 2.355 ± 0.841
5.233ThrGlu: 5.233 ± 1.955
3.14ThrPhe: 3.14 ± 0.972
4.971ThrGly: 4.971 ± 0.82
1.047ThrHis: 1.047 ± 0.42
4.186ThrIle: 4.186 ± 1.375
4.186ThrLys: 4.186 ± 1.034
5.495ThrLeu: 5.495 ± 1.41
1.57ThrMet: 1.57 ± 0.671
4.971ThrAsn: 4.971 ± 1.613
2.616ThrPro: 2.616 ± 1.006
2.616ThrGln: 2.616 ± 1.68
3.663ThrArg: 3.663 ± 1.347
4.186ThrSer: 4.186 ± 1.33
3.14ThrThr: 3.14 ± 1.105
2.616ThrVal: 2.616 ± 0.944
1.047ThrTrp: 1.047 ± 0.577
2.355ThrTyr: 2.355 ± 0.789
0.0ThrXaa: 0.0 ± 0.0
Val
1.57ValAla: 1.57 ± 0.616
1.832ValCys: 1.832 ± 0.42
3.401ValAsp: 3.401 ± 1.106
1.832ValGlu: 1.832 ± 0.717
2.355ValPhe: 2.355 ± 1.067
2.616ValGly: 2.616 ± 0.936
0.785ValHis: 0.785 ± 0.41
5.756ValIle: 5.756 ± 1.041
2.093ValLys: 2.093 ± 0.856
4.71ValLeu: 4.71 ± 1.754
0.785ValMet: 0.785 ± 0.562
3.925ValAsn: 3.925 ± 0.628
1.832ValPro: 1.832 ± 0.911
1.57ValGln: 1.57 ± 0.594
2.616ValArg: 2.616 ± 1.072
4.971ValSer: 4.971 ± 0.953
2.355ValThr: 2.355 ± 0.954
1.308ValVal: 1.308 ± 0.826
0.785ValTrp: 0.785 ± 0.566
1.308ValTyr: 1.308 ± 0.949
0.0ValXaa: 0.0 ± 0.0
Trp
1.047TrpAla: 1.047 ± 0.589
0.0TrpCys: 0.0 ± 0.0
0.262TrpAsp: 0.262 ± 0.152
1.047TrpGlu: 1.047 ± 0.607
0.785TrpPhe: 0.785 ± 0.566
1.047TrpGly: 1.047 ± 0.427
0.523TrpHis: 0.523 ± 0.294
1.308TrpIle: 1.308 ± 0.631
1.308TrpLys: 1.308 ± 0.51
1.308TrpLeu: 1.308 ± 0.668
0.523TrpMet: 0.523 ± 0.294
0.785TrpAsn: 0.785 ± 0.329
0.523TrpPro: 0.523 ± 0.294
0.0TrpGln: 0.0 ± 0.0
0.785TrpArg: 0.785 ± 0.631
2.616TrpSer: 2.616 ± 0.417
2.616TrpThr: 2.616 ± 0.602
1.308TrpVal: 1.308 ± 0.968
0.0TrpTrp: 0.0 ± 0.0
1.832TrpTyr: 1.832 ± 0.6
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.308TyrAla: 1.308 ± 0.742
1.308TyrCys: 1.308 ± 0.613
2.093TyrAsp: 2.093 ± 0.612
1.57TyrGlu: 1.57 ± 0.671
3.14TyrPhe: 3.14 ± 0.669
3.401TyrGly: 3.401 ± 0.833
0.785TyrHis: 0.785 ± 0.442
2.616TyrIle: 2.616 ± 1.224
2.878TyrLys: 2.878 ± 0.765
4.448TyrLeu: 4.448 ± 1.492
0.262TyrMet: 0.262 ± 0.152
1.308TyrAsn: 1.308 ± 0.656
2.355TyrPro: 2.355 ± 0.543
1.57TyrGln: 1.57 ± 0.658
1.832TyrArg: 1.832 ± 0.739
4.186TyrSer: 4.186 ± 1.281
1.047TyrThr: 1.047 ± 0.468
1.832TyrVal: 1.832 ± 0.74
0.262TyrTrp: 0.262 ± 0.333
0.785TyrTyr: 0.785 ± 0.329
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (3823 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski