Amino acid dipepetide frequency for Niakha virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.974AlaAla: 1.974 ± 0.9
1.41AlaCys: 1.41 ± 0.965
2.256AlaAsp: 2.256 ± 0.767
2.538AlaGlu: 2.538 ± 0.604
1.41AlaPhe: 1.41 ± 0.795
2.538AlaGly: 2.538 ± 1.094
1.41AlaHis: 1.41 ± 0.874
2.538AlaIle: 2.538 ± 1.877
1.974AlaLys: 1.974 ± 1.343
3.948AlaLeu: 3.948 ± 0.696
1.692AlaMet: 1.692 ± 0.602
1.692AlaAsn: 1.692 ± 0.381
1.41AlaPro: 1.41 ± 0.952
2.538AlaGln: 2.538 ± 0.724
1.974AlaArg: 1.974 ± 0.764
2.538AlaSer: 2.538 ± 0.397
1.974AlaThr: 1.974 ± 1.01
1.692AlaVal: 1.692 ± 0.691
0.564AlaTrp: 0.564 ± 0.299
0.846AlaTyr: 0.846 ± 0.56
0.0AlaXaa: 0.0 ± 0.0
Cys
1.974CysAla: 1.974 ± 0.518
0.564CysCys: 0.564 ± 0.276
1.128CysAsp: 1.128 ± 0.598
1.128CysGlu: 1.128 ± 0.485
2.82CysPhe: 2.82 ± 0.832
1.128CysGly: 1.128 ± 0.478
0.564CysHis: 0.564 ± 0.65
0.846CysIle: 0.846 ± 0.284
1.974CysLys: 1.974 ± 1.3
1.41CysLeu: 1.41 ± 0.669
0.282CysMet: 0.282 ± 0.149
1.128CysAsn: 1.128 ± 0.598
1.974CysPro: 1.974 ± 0.632
1.128CysGln: 1.128 ± 0.529
2.256CysArg: 2.256 ± 1.046
1.974CysSer: 1.974 ± 1.488
0.282CysThr: 0.282 ± 0.342
1.41CysVal: 1.41 ± 0.595
0.564CysTrp: 0.564 ± 0.299
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.256AspAla: 2.256 ± 1.222
0.564AspCys: 0.564 ± 0.34
2.538AspAsp: 2.538 ± 0.589
2.82AspGlu: 2.82 ± 0.527
2.82AspPhe: 2.82 ± 0.857
1.128AspGly: 1.128 ± 0.471
1.128AspHis: 1.128 ± 0.598
3.948AspIle: 3.948 ± 2.101
3.384AspLys: 3.384 ± 0.4
9.306AspLeu: 9.306 ± 1.679
1.974AspMet: 1.974 ± 0.723
1.692AspAsn: 1.692 ± 0.568
3.666AspPro: 3.666 ± 1.169
2.82AspGln: 2.82 ± 1.114
2.256AspArg: 2.256 ± 0.929
2.538AspSer: 2.538 ± 1.066
3.102AspThr: 3.102 ± 1.316
2.538AspVal: 2.538 ± 1.368
1.974AspTrp: 1.974 ± 0.512
2.256AspTyr: 2.256 ± 0.957
0.0AspXaa: 0.0 ± 0.0
Glu
2.256GluAla: 2.256 ± 1.222
2.538GluCys: 2.538 ± 1.082
2.82GluAsp: 2.82 ± 0.868
6.486GluGlu: 6.486 ± 2.199
3.948GluPhe: 3.948 ± 0.552
5.076GluGly: 5.076 ± 2.165
0.564GluHis: 0.564 ± 0.276
5.922GluIle: 5.922 ± 0.855
4.512GluLys: 4.512 ± 1.092
4.512GluLeu: 4.512 ± 0.934
2.538GluMet: 2.538 ± 0.794
4.23GluAsn: 4.23 ± 0.758
1.974GluPro: 1.974 ± 0.875
0.564GluGln: 0.564 ± 0.41
2.538GluArg: 2.538 ± 0.67
5.076GluSer: 5.076 ± 1.368
3.384GluThr: 3.384 ± 1.208
5.358GluVal: 5.358 ± 1.156
0.846GluTrp: 0.846 ± 0.453
3.666GluTyr: 3.666 ± 0.833
0.0GluXaa: 0.0 ± 0.0
Phe
0.846PheAla: 0.846 ± 0.346
1.128PheCys: 1.128 ± 0.41
3.666PheAsp: 3.666 ± 1.11
2.538PheGlu: 2.538 ± 1.342
2.538PhePhe: 2.538 ± 1.038
2.82PheGly: 2.82 ± 1.005
0.564PheHis: 0.564 ± 0.683
2.82PheIle: 2.82 ± 0.884
5.922PheLys: 5.922 ± 0.669
3.666PheLeu: 3.666 ± 0.326
0.564PheMet: 0.564 ± 0.276
2.82PheAsn: 2.82 ± 1.167
3.948PhePro: 3.948 ± 0.645
1.974PheGln: 1.974 ± 0.632
1.974PheArg: 1.974 ± 0.585
3.384PheSer: 3.384 ± 0.561
1.692PheThr: 1.692 ± 0.405
4.23PheVal: 4.23 ± 0.936
0.846PheTrp: 0.846 ± 0.603
0.846PheTyr: 0.846 ± 0.346
0.0PheXaa: 0.0 ± 0.0
Gly
0.846GlyAla: 0.846 ± 0.56
1.128GlyCys: 1.128 ± 0.679
5.922GlyAsp: 5.922 ± 1.779
3.102GlyGlu: 3.102 ± 1.591
2.82GlyPhe: 2.82 ± 0.747
3.948GlyGly: 3.948 ± 0.979
0.564GlyHis: 0.564 ± 0.299
4.23GlyIle: 4.23 ± 1.355
5.076GlyLys: 5.076 ± 2.835
7.332GlyLeu: 7.332 ± 1.722
1.41GlyMet: 1.41 ± 0.434
1.692GlyAsn: 1.692 ± 1.154
2.256GlyPro: 2.256 ± 1.341
2.256GlyGln: 2.256 ± 0.879
1.41GlyArg: 1.41 ± 0.681
5.64GlySer: 5.64 ± 1.187
2.256GlyThr: 2.256 ± 1.139
3.948GlyVal: 3.948 ± 1.618
1.128GlyTrp: 1.128 ± 0.471
1.974GlyTyr: 1.974 ± 0.841
0.0GlyXaa: 0.0 ± 0.0
His
0.564HisAla: 0.564 ± 0.276
0.564HisCys: 0.564 ± 0.65
0.846HisAsp: 0.846 ± 0.448
1.41HisGlu: 1.41 ± 0.54
1.128HisPhe: 1.128 ± 0.471
0.0HisGly: 0.0 ± 0.0
0.846HisHis: 0.846 ± 0.284
2.538HisIle: 2.538 ± 0.601
1.41HisLys: 1.41 ± 0.473
1.692HisLeu: 1.692 ± 0.405
0.846HisMet: 0.846 ± 0.284
0.564HisAsn: 0.564 ± 0.464
1.128HisPro: 1.128 ± 0.471
0.846HisGln: 0.846 ± 0.448
1.692HisArg: 1.692 ± 0.461
1.41HisSer: 1.41 ± 0.398
0.282HisThr: 0.282 ± 0.149
0.0HisVal: 0.0 ± 0.0
0.282HisTrp: 0.282 ± 0.342
2.538HisTyr: 2.538 ± 0.827
0.0HisXaa: 0.0 ± 0.0
Ile
0.846IleAla: 0.846 ± 0.415
2.256IleCys: 2.256 ± 0.957
3.102IleAsp: 3.102 ± 0.988
3.666IleGlu: 3.666 ± 1.118
2.256IlePhe: 2.256 ± 0.525
3.384IleGly: 3.384 ± 0.961
1.41IleHis: 1.41 ± 0.691
4.512IleIle: 4.512 ± 2.05
7.05IleLys: 7.05 ± 1.41
7.896IleLeu: 7.896 ± 1.682
2.256IleMet: 2.256 ± 1.388
3.948IleAsn: 3.948 ± 1.421
3.666IlePro: 3.666 ± 0.589
4.23IleGln: 4.23 ± 1.628
3.384IleArg: 3.384 ± 0.794
5.076IleSer: 5.076 ± 0.853
4.794IleThr: 4.794 ± 0.748
5.922IleVal: 5.922 ± 1.508
0.846IleTrp: 0.846 ± 0.603
3.384IleTyr: 3.384 ± 0.514
0.0IleXaa: 0.0 ± 0.0
Lys
3.666LysAla: 3.666 ± 0.787
1.41LysCys: 1.41 ± 0.339
5.076LysAsp: 5.076 ± 0.767
5.922LysGlu: 5.922 ± 1.237
3.948LysPhe: 3.948 ± 0.997
4.512LysGly: 4.512 ± 1.261
1.974LysHis: 1.974 ± 0.632
7.05LysIle: 7.05 ± 2.343
8.46LysLys: 8.46 ± 2.677
5.076LysLeu: 5.076 ± 1.09
3.102LysMet: 3.102 ± 0.594
4.512LysAsn: 4.512 ± 0.962
2.538LysPro: 2.538 ± 0.77
2.538LysGln: 2.538 ± 0.641
3.102LysArg: 3.102 ± 0.363
4.794LysSer: 4.794 ± 1.223
3.384LysThr: 3.384 ± 1.61
3.384LysVal: 3.384 ± 1.31
0.846LysTrp: 0.846 ± 0.346
2.256LysTyr: 2.256 ± 0.721
0.0LysXaa: 0.0 ± 0.0
Leu
4.794LeuAla: 4.794 ± 0.915
1.692LeuCys: 1.692 ± 0.649
2.538LeuAsp: 2.538 ± 0.75
6.486LeuGlu: 6.486 ± 1.561
4.512LeuPhe: 4.512 ± 0.88
5.64LeuGly: 5.64 ± 1.252
1.41LeuHis: 1.41 ± 0.677
8.178LeuIle: 8.178 ± 2.728
7.332LeuLys: 7.332 ± 1.834
7.05LeuLeu: 7.05 ± 1.298
2.538LeuMet: 2.538 ± 1.012
4.512LeuAsn: 4.512 ± 1.029
2.82LeuPro: 2.82 ± 0.911
1.692LeuGln: 1.692 ± 0.447
6.204LeuArg: 6.204 ± 0.841
9.588LeuSer: 9.588 ± 2.508
6.768LeuThr: 6.768 ± 1.064
3.666LeuVal: 3.666 ± 1.507
1.41LeuTrp: 1.41 ± 0.763
2.82LeuTyr: 2.82 ± 0.742
0.0LeuXaa: 0.0 ± 0.0
Met
1.692MetAla: 1.692 ± 0.8
0.282MetCys: 0.282 ± 0.395
1.974MetAsp: 1.974 ± 0.917
2.256MetGlu: 2.256 ± 1.125
1.128MetPhe: 1.128 ± 0.409
2.256MetGly: 2.256 ± 0.626
0.0MetHis: 0.0 ± 0.0
2.256MetIle: 2.256 ± 1.196
1.41MetLys: 1.41 ± 0.691
2.82MetLeu: 2.82 ± 0.839
1.128MetMet: 1.128 ± 0.41
1.128MetAsn: 1.128 ± 0.478
1.41MetPro: 1.41 ± 0.698
0.846MetGln: 0.846 ± 0.415
1.128MetArg: 1.128 ± 0.598
1.692MetSer: 1.692 ± 0.606
2.256MetThr: 2.256 ± 0.89
0.846MetVal: 0.846 ± 1.371
0.282MetTrp: 0.282 ± 0.149
0.846MetTyr: 0.846 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
1.974AsnAla: 1.974 ± 0.744
1.128AsnCys: 1.128 ± 0.598
2.256AsnAsp: 2.256 ± 0.6
3.102AsnGlu: 3.102 ± 0.481
1.692AsnPhe: 1.692 ± 0.697
2.82AsnGly: 2.82 ± 0.965
1.974AsnHis: 1.974 ± 0.512
2.82AsnIle: 2.82 ± 1.167
3.948AsnLys: 3.948 ± 0.435
3.102AsnLeu: 3.102 ± 1.072
0.846AsnMet: 0.846 ± 0.448
3.102AsnAsn: 3.102 ± 1.313
2.538AsnPro: 2.538 ± 0.444
1.692AsnGln: 1.692 ± 0.381
2.256AsnArg: 2.256 ± 0.821
3.384AsnSer: 3.384 ± 0.853
3.384AsnThr: 3.384 ± 0.734
2.538AsnVal: 2.538 ± 0.601
0.846AsnTrp: 0.846 ± 0.448
2.538AsnTyr: 2.538 ± 0.847
0.0AsnXaa: 0.0 ± 0.0
Pro
1.692ProAla: 1.692 ± 1.149
0.282ProCys: 0.282 ± 0.395
3.102ProAsp: 3.102 ± 0.834
2.538ProGlu: 2.538 ± 1.246
1.128ProPhe: 1.128 ± 0.471
2.256ProGly: 2.256 ± 0.976
1.41ProHis: 1.41 ± 0.794
2.82ProIle: 2.82 ± 1.556
3.666ProLys: 3.666 ± 1.469
3.102ProLeu: 3.102 ± 0.741
0.846ProMet: 0.846 ± 0.603
1.41ProAsn: 1.41 ± 0.512
1.128ProPro: 1.128 ± 0.821
1.41ProGln: 1.41 ± 0.364
1.692ProArg: 1.692 ± 0.405
5.076ProSer: 5.076 ± 1.319
4.23ProThr: 4.23 ± 1.278
3.102ProVal: 3.102 ± 0.839
0.564ProTrp: 0.564 ± 0.299
1.128ProTyr: 1.128 ± 0.358
0.0ProXaa: 0.0 ± 0.0
Gln
0.846GlnAla: 0.846 ± 0.432
0.846GlnCys: 0.846 ± 0.448
2.538GlnAsp: 2.538 ± 0.315
1.974GlnGlu: 1.974 ± 0.616
1.692GlnPhe: 1.692 ± 0.831
2.256GlnGly: 2.256 ± 0.837
0.564GlnHis: 0.564 ± 0.299
2.538GlnIle: 2.538 ± 1.296
2.538GlnLys: 2.538 ± 1.025
3.102GlnLeu: 3.102 ± 1.191
0.564GlnMet: 0.564 ± 0.276
2.256GlnAsn: 2.256 ± 0.584
0.564GlnPro: 0.564 ± 0.276
0.564GlnGln: 0.564 ± 0.299
2.538GlnArg: 2.538 ± 0.79
3.102GlnSer: 3.102 ± 0.606
1.41GlnThr: 1.41 ± 0.364
0.846GlnVal: 0.846 ± 0.448
0.564GlnTrp: 0.564 ± 0.276
1.974GlnTyr: 1.974 ± 0.738
0.0GlnXaa: 0.0 ± 0.0
Arg
1.41ArgAla: 1.41 ± 0.561
1.692ArgCys: 1.692 ± 0.829
4.512ArgAsp: 4.512 ± 0.563
2.82ArgGlu: 2.82 ± 1.167
1.692ArgPhe: 1.692 ± 1.019
2.256ArgGly: 2.256 ± 0.525
0.564ArgHis: 0.564 ± 0.299
3.666ArgIle: 3.666 ± 0.783
3.102ArgLys: 3.102 ± 1.317
4.512ArgLeu: 4.512 ± 1.105
1.41ArgMet: 1.41 ± 0.473
2.256ArgAsn: 2.256 ± 0.616
1.974ArgPro: 1.974 ± 0.891
1.128ArgGln: 1.128 ± 0.598
0.846ArgArg: 0.846 ± 0.448
4.23ArgSer: 4.23 ± 1.293
3.384ArgThr: 3.384 ± 1.2
3.384ArgVal: 3.384 ± 0.4
0.0ArgTrp: 0.0 ± 0.0
1.41ArgTyr: 1.41 ± 0.512
0.0ArgXaa: 0.0 ± 0.0
Ser
3.384SerAla: 3.384 ± 0.826
1.692SerCys: 1.692 ± 0.568
3.948SerAsp: 3.948 ± 0.602
7.896SerGlu: 7.896 ± 1.021
4.23SerPhe: 4.23 ± 1.536
4.794SerGly: 4.794 ± 2.025
2.256SerHis: 2.256 ± 0.89
5.922SerIle: 5.922 ± 1.419
4.794SerLys: 4.794 ± 0.972
8.178SerLeu: 8.178 ± 0.837
1.692SerMet: 1.692 ± 0.381
4.23SerAsn: 4.23 ± 0.894
2.82SerPro: 2.82 ± 0.757
1.974SerGln: 1.974 ± 0.506
3.948SerArg: 3.948 ± 1.21
7.05SerSer: 7.05 ± 0.653
3.948SerThr: 3.948 ± 0.979
4.23SerVal: 4.23 ± 0.755
1.41SerTrp: 1.41 ± 0.339
1.692SerTyr: 1.692 ± 0.461
0.0SerXaa: 0.0 ± 0.0
Thr
3.666ThrAla: 3.666 ± 0.871
1.974ThrCys: 1.974 ± 0.223
2.82ThrAsp: 2.82 ± 1.11
5.358ThrGlu: 5.358 ± 1.132
2.538ThrPhe: 2.538 ± 0.315
4.512ThrGly: 4.512 ± 1.413
1.41ThrHis: 1.41 ± 0.747
3.102ThrIle: 3.102 ± 0.763
3.102ThrLys: 3.102 ± 0.88
3.666ThrLeu: 3.666 ± 0.426
1.692ThrMet: 1.692 ± 0.423
2.256ThrAsn: 2.256 ± 0.721
1.974ThrPro: 1.974 ± 0.917
1.692ThrGln: 1.692 ± 0.697
2.256ThrArg: 2.256 ± 0.616
4.794ThrSer: 4.794 ± 1.244
2.538ThrThr: 2.538 ± 0.724
3.948ThrVal: 3.948 ± 0.641
2.256ThrTrp: 2.256 ± 0.831
1.41ThrTyr: 1.41 ± 0.795
0.0ThrXaa: 0.0 ± 0.0
Val
2.82ValAla: 2.82 ± 1.181
1.692ValCys: 1.692 ± 0.606
2.538ValAsp: 2.538 ± 0.583
4.23ValGlu: 4.23 ± 1.899
2.256ValPhe: 2.256 ± 0.455
3.666ValGly: 3.666 ± 1.904
1.128ValHis: 1.128 ± 0.478
4.512ValIle: 4.512 ± 0.311
3.384ValLys: 3.384 ± 0.823
5.076ValLeu: 5.076 ± 0.859
1.128ValMet: 1.128 ± 0.358
3.102ValAsn: 3.102 ± 0.558
1.692ValPro: 1.692 ± 0.602
1.41ValGln: 1.41 ± 0.747
2.256ValArg: 2.256 ± 1.139
4.512ValSer: 4.512 ± 1.585
5.922ValThr: 5.922 ± 1.007
2.82ValVal: 2.82 ± 0.846
1.128ValTrp: 1.128 ± 0.358
1.128ValTyr: 1.128 ± 0.598
0.0ValXaa: 0.0 ± 0.0
Trp
0.564TrpAla: 0.564 ± 0.41
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.41TrpGlu: 1.41 ± 0.561
1.128TrpPhe: 1.128 ± 0.358
1.128TrpGly: 1.128 ± 0.471
0.282TrpHis: 0.282 ± 0.149
1.128TrpIle: 1.128 ± 0.358
1.128TrpLys: 1.128 ± 0.553
2.538TrpLeu: 2.538 ± 0.87
0.564TrpMet: 0.564 ± 0.34
0.282TrpAsn: 0.282 ± 0.149
0.564TrpPro: 0.564 ± 0.299
0.564TrpGln: 0.564 ± 0.299
0.564TrpArg: 0.564 ± 0.276
1.692TrpSer: 1.692 ± 0.568
1.128TrpThr: 1.128 ± 0.529
0.846TrpVal: 0.846 ± 0.432
0.0TrpTrp: 0.0 ± 0.0
1.128TrpTyr: 1.128 ± 0.553
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.846TyrAla: 0.846 ± 0.284
1.692TyrCys: 1.692 ± 0.894
1.128TyrAsp: 1.128 ± 0.41
1.128TyrGlu: 1.128 ± 0.36
3.102TyrPhe: 3.102 ± 0.985
2.82TyrGly: 2.82 ± 0.911
0.564TyrHis: 0.564 ± 0.299
2.538TyrIle: 2.538 ± 0.708
3.384TyrLys: 3.384 ± 0.755
3.948TyrLeu: 3.948 ± 0.562
0.282TyrMet: 0.282 ± 0.395
1.128TyrAsn: 1.128 ± 0.553
2.538TyrPro: 2.538 ± 0.674
1.41TyrGln: 1.41 ± 0.449
1.974TyrArg: 1.974 ± 0.965
2.538TyrSer: 2.538 ± 0.658
0.846TyrThr: 0.846 ± 0.415
1.692TyrVal: 1.692 ± 0.602
0.282TyrTrp: 0.282 ± 0.342
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski