Amino acid dipepetide frequency for Inhangapi virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.836AlaAla: 1.836 ± 0.654
0.525AlaCys: 0.525 ± 0.754
3.148AlaAsp: 3.148 ± 1.572
2.361AlaGlu: 2.361 ± 1.541
0.787AlaPhe: 0.787 ± 0.343
1.836AlaGly: 1.836 ± 1.357
1.312AlaHis: 1.312 ± 0.521
4.197AlaIle: 4.197 ± 1.075
1.836AlaLys: 1.836 ± 0.53
5.771AlaLeu: 5.771 ± 1.579
0.262AlaMet: 0.262 ± 0.164
1.836AlaAsn: 1.836 ± 0.412
1.049AlaPro: 1.049 ± 0.312
0.787AlaGln: 0.787 ± 0.333
0.525AlaArg: 0.525 ± 0.562
1.049AlaSer: 1.049 ± 0.432
2.623AlaThr: 2.623 ± 0.526
1.049AlaVal: 1.049 ± 0.771
1.049AlaTrp: 1.049 ± 0.391
0.525AlaTyr: 0.525 ± 0.32
0.0AlaXaa: 0.0 ± 0.0
Cys
0.262CysAla: 0.262 ± 0.164
0.0CysCys: 0.0 ± 0.0
0.525CysAsp: 0.525 ± 0.328
1.836CysGlu: 1.836 ± 0.368
0.525CysPhe: 0.525 ± 0.555
1.574CysGly: 1.574 ± 0.496
0.787CysHis: 0.787 ± 0.343
0.787CysIle: 0.787 ± 0.492
1.574CysLys: 1.574 ± 0.987
0.787CysLeu: 0.787 ± 0.294
0.0CysMet: 0.0 ± 0.0
0.787CysAsn: 0.787 ± 0.701
0.787CysPro: 0.787 ± 0.701
1.049CysGln: 1.049 ± 0.751
0.525CysArg: 0.525 ± 0.328
2.623CysSer: 2.623 ± 0.888
0.262CysThr: 0.262 ± 0.375
0.787CysVal: 0.787 ± 0.422
0.262CysTrp: 0.262 ± 0.164
1.049CysTyr: 1.049 ± 0.501
0.0CysXaa: 0.0 ± 0.0
Asp
2.099AspAla: 2.099 ± 0.615
1.049AspCys: 1.049 ± 0.432
3.935AspAsp: 3.935 ± 2.155
4.197AspGlu: 4.197 ± 1.242
2.361AspPhe: 2.361 ± 0.741
4.722AspGly: 4.722 ± 1.517
1.574AspHis: 1.574 ± 0.384
3.935AspIle: 3.935 ± 0.509
3.41AspLys: 3.41 ± 1.133
6.296AspLeu: 6.296 ± 0.756
1.574AspMet: 1.574 ± 0.598
3.148AspAsn: 3.148 ± 1.298
4.722AspPro: 4.722 ± 1.16
1.312AspGln: 1.312 ± 0.285
1.312AspArg: 1.312 ± 0.521
3.673AspSer: 3.673 ± 0.666
1.836AspThr: 1.836 ± 1.135
3.41AspVal: 3.41 ± 1.655
1.836AspTrp: 1.836 ± 0.897
3.148AspTyr: 3.148 ± 1.111
0.0AspXaa: 0.0 ± 0.0
Glu
0.525GluAla: 0.525 ± 0.366
0.525GluCys: 0.525 ± 0.32
2.361GluAsp: 2.361 ± 0.81
6.034GluGlu: 6.034 ± 1.165
1.836GluPhe: 1.836 ± 0.51
3.148GluGly: 3.148 ± 0.934
1.574GluHis: 1.574 ± 1.066
6.558GluIle: 6.558 ± 1.0
6.034GluLys: 6.034 ± 1.614
6.821GluLeu: 6.821 ± 1.803
1.836GluMet: 1.836 ± 0.517
3.935GluAsn: 3.935 ± 0.696
1.836GluPro: 1.836 ± 0.541
2.099GluGln: 2.099 ± 0.377
1.312GluArg: 1.312 ± 0.561
6.034GluSer: 6.034 ± 1.391
3.41GluThr: 3.41 ± 0.964
3.148GluVal: 3.148 ± 1.099
1.049GluTrp: 1.049 ± 0.837
2.099GluTyr: 2.099 ± 0.658
0.0GluXaa: 0.0 ± 0.0
Phe
0.525PheAla: 0.525 ± 0.32
1.574PheCys: 1.574 ± 0.878
1.312PheAsp: 1.312 ± 0.655
1.836PheGlu: 1.836 ± 0.644
2.099PhePhe: 2.099 ± 0.856
1.574PheGly: 1.574 ± 0.553
0.525PheHis: 0.525 ± 0.328
3.673PheIle: 3.673 ± 1.318
2.099PheLys: 2.099 ± 0.578
2.623PheLeu: 2.623 ± 0.639
0.787PheMet: 0.787 ± 0.362
1.574PheAsn: 1.574 ± 0.576
1.574PhePro: 1.574 ± 0.903
2.623PheGln: 2.623 ± 0.496
1.836PheArg: 1.836 ± 0.413
4.46PheSer: 4.46 ± 0.802
2.361PheThr: 2.361 ± 0.628
3.41PheVal: 3.41 ± 0.737
1.049PheTrp: 1.049 ± 0.51
1.049PheTyr: 1.049 ± 0.432
0.0PheXaa: 0.0 ± 0.0
Gly
1.574GlyAla: 1.574 ± 0.89
0.262GlyCys: 0.262 ± 0.305
4.722GlyAsp: 4.722 ± 0.956
2.623GlyGlu: 2.623 ± 1.617
2.099GlyPhe: 2.099 ± 0.997
2.361GlyGly: 2.361 ± 0.451
1.049GlyHis: 1.049 ± 0.678
4.722GlyIle: 4.722 ± 1.517
4.722GlyLys: 4.722 ± 1.519
6.296GlyLeu: 6.296 ± 1.75
2.361GlyMet: 2.361 ± 0.591
2.099GlyAsn: 2.099 ± 0.807
1.574GlyPro: 1.574 ± 0.733
2.361GlyGln: 2.361 ± 0.807
2.361GlyArg: 2.361 ± 0.666
3.673GlySer: 3.673 ± 0.608
5.771GlyThr: 5.771 ± 1.657
3.148GlyVal: 3.148 ± 0.534
0.525GlyTrp: 0.525 ± 0.25
3.148GlyTyr: 3.148 ± 0.857
0.0GlyXaa: 0.0 ± 0.0
His
1.574HisAla: 1.574 ± 0.402
0.787HisCys: 0.787 ± 0.294
1.049HisAsp: 1.049 ± 0.312
0.525HisGlu: 0.525 ± 0.32
0.525HisPhe: 0.525 ± 0.328
0.787HisGly: 0.787 ± 0.375
0.525HisHis: 0.525 ± 0.25
2.099HisIle: 2.099 ± 0.67
1.836HisLys: 1.836 ± 0.518
2.361HisLeu: 2.361 ± 0.788
0.787HisMet: 0.787 ± 0.343
0.787HisAsn: 0.787 ± 0.294
3.41HisPro: 3.41 ± 1.043
0.262HisGln: 0.262 ± 0.164
0.525HisArg: 0.525 ± 0.32
1.312HisSer: 1.312 ± 0.82
0.787HisThr: 0.787 ± 0.375
1.574HisVal: 1.574 ± 0.751
0.525HisTrp: 0.525 ± 0.25
0.262HisTyr: 0.262 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
2.623IleAla: 2.623 ± 1.685
2.099IleCys: 2.099 ± 0.807
4.46IleAsp: 4.46 ± 1.404
6.296IleGlu: 6.296 ± 1.23
2.623IlePhe: 2.623 ± 0.778
6.034IleGly: 6.034 ± 1.488
2.361IleHis: 2.361 ± 0.529
5.247IleIle: 5.247 ± 1.85
8.395IleLys: 8.395 ± 1.466
8.132IleLeu: 8.132 ± 1.611
3.148IleMet: 3.148 ± 0.852
4.984IleAsn: 4.984 ± 2.26
3.673IlePro: 3.673 ± 0.826
3.148IleGln: 3.148 ± 0.665
6.034IleArg: 6.034 ± 1.522
6.296IleSer: 6.296 ± 0.81
3.41IleThr: 3.41 ± 0.564
3.41IleVal: 3.41 ± 0.896
1.312IleTrp: 1.312 ± 0.396
4.197IleTyr: 4.197 ± 0.902
0.0IleXaa: 0.0 ± 0.0
Lys
3.148LysAla: 3.148 ± 2.091
1.574LysCys: 1.574 ± 0.496
4.984LysAsp: 4.984 ± 1.709
3.935LysGlu: 3.935 ± 0.869
2.361LysPhe: 2.361 ± 0.625
4.722LysGly: 4.722 ± 0.856
1.574LysHis: 1.574 ± 0.433
7.608LysIle: 7.608 ± 1.298
7.608LysLys: 7.608 ± 1.639
5.771LysLeu: 5.771 ± 1.233
2.886LysMet: 2.886 ± 0.854
4.984LysAsn: 4.984 ± 0.815
3.41LysPro: 3.41 ± 2.486
2.361LysGln: 2.361 ± 0.666
3.41LysArg: 3.41 ± 0.594
5.771LysSer: 5.771 ± 0.836
3.673LysThr: 3.673 ± 0.487
3.673LysVal: 3.673 ± 0.397
2.099LysTrp: 2.099 ± 0.447
2.361LysTyr: 2.361 ± 0.697
0.0LysXaa: 0.0 ± 0.0
Leu
3.148LeuAla: 3.148 ± 0.547
1.574LeuCys: 1.574 ± 0.413
6.296LeuAsp: 6.296 ± 1.52
5.771LeuGlu: 5.771 ± 1.412
2.361LeuPhe: 2.361 ± 0.697
3.41LeuGly: 3.41 ± 1.187
1.574LeuHis: 1.574 ± 0.413
12.329LeuIle: 12.329 ± 2.517
7.87LeuLys: 7.87 ± 3.42
5.509LeuLeu: 5.509 ± 1.052
3.41LeuMet: 3.41 ± 0.88
5.509LeuAsn: 5.509 ± 1.726
4.984LeuPro: 4.984 ± 1.353
1.574LeuGln: 1.574 ± 0.402
4.46LeuArg: 4.46 ± 2.102
7.87LeuSer: 7.87 ± 2.355
7.345LeuThr: 7.345 ± 2.458
3.673LeuVal: 3.673 ± 0.535
1.312LeuTrp: 1.312 ± 0.762
3.148LeuTyr: 3.148 ± 0.832
0.0LeuXaa: 0.0 ± 0.0
Met
1.312MetAla: 1.312 ± 0.991
0.262MetCys: 0.262 ± 0.164
1.574MetAsp: 1.574 ± 0.576
2.623MetGlu: 2.623 ± 0.989
2.361MetPhe: 2.361 ± 1.222
2.623MetGly: 2.623 ± 0.448
0.525MetHis: 0.525 ± 0.25
3.41MetIle: 3.41 ± 1.046
2.886MetLys: 2.886 ± 1.065
2.099MetLeu: 2.099 ± 0.807
0.787MetMet: 0.787 ± 0.474
1.312MetAsn: 1.312 ± 0.559
0.525MetPro: 0.525 ± 0.455
0.525MetGln: 0.525 ± 0.386
1.574MetArg: 1.574 ± 0.685
1.836MetSer: 1.836 ± 0.704
2.099MetThr: 2.099 ± 0.696
1.312MetVal: 1.312 ± 0.452
0.0MetTrp: 0.0 ± 0.0
0.525MetTyr: 0.525 ± 0.328
0.0MetXaa: 0.0 ± 0.0
Asn
1.836AsnAla: 1.836 ± 0.578
0.262AsnCys: 0.262 ± 0.5
2.623AsnAsp: 2.623 ± 0.833
1.574AsnGlu: 1.574 ± 0.733
2.623AsnPhe: 2.623 ± 0.374
1.836AsnGly: 1.836 ± 1.293
1.836AsnHis: 1.836 ± 0.518
3.673AsnIle: 3.673 ± 0.854
5.247AsnLys: 5.247 ± 0.779
6.034AsnLeu: 6.034 ± 1.616
2.361AsnMet: 2.361 ± 0.625
3.41AsnAsn: 3.41 ± 0.772
3.41AsnPro: 3.41 ± 1.004
3.41AsnGln: 3.41 ± 0.725
1.574AsnArg: 1.574 ± 0.413
2.886AsnSer: 2.886 ± 0.425
2.886AsnThr: 2.886 ± 1.118
3.41AsnVal: 3.41 ± 0.636
1.574AsnTrp: 1.574 ± 0.609
3.148AsnTyr: 3.148 ± 1.072
0.0AsnXaa: 0.0 ± 0.0
Pro
1.049ProAla: 1.049 ± 0.732
0.787ProCys: 0.787 ± 0.592
4.46ProAsp: 4.46 ± 0.889
2.099ProGlu: 2.099 ± 1.072
1.312ProPhe: 1.312 ± 0.474
3.673ProGly: 3.673 ± 1.393
1.312ProHis: 1.312 ± 0.285
4.197ProIle: 4.197 ± 0.472
2.623ProLys: 2.623 ± 0.777
4.197ProLeu: 4.197 ± 1.77
1.574ProMet: 1.574 ± 0.667
1.574ProAsn: 1.574 ± 0.6
4.197ProPro: 4.197 ± 3.297
1.312ProGln: 1.312 ± 0.78
1.312ProArg: 1.312 ± 0.664
5.509ProSer: 5.509 ± 1.941
1.836ProThr: 1.836 ± 0.51
2.623ProVal: 2.623 ± 1.538
0.525ProTrp: 0.525 ± 0.328
2.623ProTyr: 2.623 ± 0.526
0.0ProXaa: 0.0 ± 0.0
Gln
1.574GlnAla: 1.574 ± 0.379
1.574GlnCys: 1.574 ± 1.291
1.049GlnAsp: 1.049 ± 0.751
2.623GlnGlu: 2.623 ± 1.315
0.787GlnPhe: 0.787 ± 0.294
2.623GlnGly: 2.623 ± 0.974
0.787GlnHis: 0.787 ± 0.343
3.935GlnIle: 3.935 ± 0.884
1.836GlnLys: 1.836 ± 0.474
2.361GlnLeu: 2.361 ± 0.546
1.574GlnMet: 1.574 ± 0.728
1.836GlnAsn: 1.836 ± 0.413
0.262GlnPro: 0.262 ± 0.426
0.787GlnGln: 0.787 ± 0.45
1.312GlnArg: 1.312 ± 0.762
2.623GlnSer: 2.623 ± 0.992
1.312GlnThr: 1.312 ± 0.828
1.312GlnVal: 1.312 ± 0.521
0.0GlnTrp: 0.0 ± 0.0
1.574GlnTyr: 1.574 ± 0.667
0.0GlnXaa: 0.0 ± 0.0
Arg
2.623ArgAla: 2.623 ± 0.783
0.525ArgCys: 0.525 ± 0.25
2.361ArgAsp: 2.361 ± 0.595
3.148ArgGlu: 3.148 ± 1.174
3.41ArgPhe: 3.41 ± 0.31
3.148ArgGly: 3.148 ± 0.493
0.787ArgHis: 0.787 ± 0.492
2.886ArgIle: 2.886 ± 1.047
3.41ArgLys: 3.41 ± 0.914
1.836ArgLeu: 1.836 ± 0.578
1.049ArgMet: 1.049 ± 0.639
3.148ArgAsn: 3.148 ± 0.667
1.049ArgPro: 1.049 ± 0.501
1.049ArgGln: 1.049 ± 0.405
3.148ArgArg: 3.148 ± 0.949
3.673ArgSer: 3.673 ± 0.712
3.673ArgThr: 3.673 ± 0.731
2.099ArgVal: 2.099 ± 0.593
0.525ArgTrp: 0.525 ± 0.25
1.312ArgTyr: 1.312 ± 0.455
0.0ArgXaa: 0.0 ± 0.0
Ser
3.148SerAla: 3.148 ± 1.191
0.787SerCys: 0.787 ± 0.492
4.46SerAsp: 4.46 ± 1.116
4.984SerGlu: 4.984 ± 1.08
3.41SerPhe: 3.41 ± 0.654
4.722SerGly: 4.722 ± 0.64
0.787SerHis: 0.787 ± 0.677
6.034SerIle: 6.034 ± 1.892
5.247SerLys: 5.247 ± 0.903
8.395SerLeu: 8.395 ± 1.545
1.836SerMet: 1.836 ± 1.119
4.984SerAsn: 4.984 ± 0.968
2.361SerPro: 2.361 ± 0.685
3.148SerGln: 3.148 ± 0.665
5.247SerArg: 5.247 ± 0.706
6.558SerSer: 6.558 ± 1.466
4.984SerThr: 4.984 ± 1.445
2.099SerVal: 2.099 ± 0.804
2.099SerTrp: 2.099 ± 0.663
2.361SerTyr: 2.361 ± 1.309
0.0SerXaa: 0.0 ± 0.0
Thr
2.361ThrAla: 2.361 ± 1.746
1.049ThrCys: 1.049 ± 0.553
3.673ThrAsp: 3.673 ± 1.216
3.673ThrGlu: 3.673 ± 1.354
2.099ThrPhe: 2.099 ± 1.477
2.623ThrGly: 2.623 ± 0.597
0.787ThrHis: 0.787 ± 0.375
4.984ThrIle: 4.984 ± 0.439
5.247ThrLys: 5.247 ± 2.032
6.034ThrLeu: 6.034 ± 1.795
1.312ThrMet: 1.312 ± 0.285
1.574ThrAsn: 1.574 ± 0.697
3.41ThrPro: 3.41 ± 1.084
1.312ThrGln: 1.312 ± 0.578
2.623ThrArg: 2.623 ± 1.387
4.722ThrSer: 4.722 ± 0.996
2.099ThrThr: 2.099 ± 0.579
2.886ThrVal: 2.886 ± 1.147
1.312ThrTrp: 1.312 ± 0.543
2.361ThrTyr: 2.361 ± 0.807
0.0ThrXaa: 0.0 ± 0.0
Val
1.312ValAla: 1.312 ± 0.789
0.787ValCys: 0.787 ± 0.294
3.148ValAsp: 3.148 ± 1.481
2.099ValGlu: 2.099 ± 0.527
2.623ValPhe: 2.623 ± 0.547
2.099ValGly: 2.099 ± 0.663
1.312ValHis: 1.312 ± 0.534
2.886ValIle: 2.886 ± 1.29
2.886ValLys: 2.886 ± 0.723
4.197ValLeu: 4.197 ± 0.866
1.312ValMet: 1.312 ± 0.534
2.886ValAsn: 2.886 ± 1.079
3.673ValPro: 3.673 ± 0.469
0.787ValGln: 0.787 ± 0.333
2.361ValArg: 2.361 ± 0.605
4.197ValSer: 4.197 ± 0.682
3.41ValThr: 3.41 ± 1.07
2.099ValVal: 2.099 ± 0.627
0.262ValTrp: 0.262 ± 0.305
2.886ValTyr: 2.886 ± 0.722
0.0ValXaa: 0.0 ± 0.0
Trp
0.525TrpAla: 0.525 ± 0.25
0.262TrpCys: 0.262 ± 0.164
0.525TrpAsp: 0.525 ± 0.328
2.099TrpGlu: 2.099 ± 0.578
1.049TrpPhe: 1.049 ± 0.51
1.574TrpGly: 1.574 ± 0.695
0.525TrpHis: 0.525 ± 0.328
2.623TrpIle: 2.623 ± 0.708
1.049TrpLys: 1.049 ± 0.464
1.049TrpLeu: 1.049 ± 0.465
0.525TrpMet: 0.525 ± 0.32
1.049TrpAsn: 1.049 ± 0.501
1.312TrpPro: 1.312 ± 0.593
0.262TrpGln: 0.262 ± 0.5
0.262TrpArg: 0.262 ± 0.305
1.049TrpSer: 1.049 ± 0.312
1.049TrpThr: 1.049 ± 0.501
0.525TrpVal: 0.525 ± 0.455
0.262TrpTrp: 0.262 ± 0.305
0.787TrpTyr: 0.787 ± 0.294
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.312TyrAla: 1.312 ± 0.556
0.525TyrCys: 0.525 ± 0.328
2.886TyrAsp: 2.886 ± 0.829
1.836TyrGlu: 1.836 ± 0.773
1.312TyrPhe: 1.312 ± 0.543
2.361TyrGly: 2.361 ± 1.031
0.787TyrHis: 0.787 ± 0.533
2.099TyrIle: 2.099 ± 0.593
2.361TyrLys: 2.361 ± 0.883
6.558TyrLeu: 6.558 ± 1.159
0.525TyrMet: 0.525 ± 0.328
4.197TyrAsn: 4.197 ± 1.197
1.574TyrPro: 1.574 ± 0.697
1.574TyrGln: 1.574 ± 0.462
3.148TyrArg: 3.148 ± 0.767
1.836TyrSer: 1.836 ± 0.867
1.312TyrThr: 1.312 ± 0.642
1.574TyrVal: 1.574 ± 0.693
0.787TyrTrp: 0.787 ± 0.532
0.787TyrTyr: 0.787 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3813 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski