Amino acid dipepetide frequency for Isfahan virus (ISFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.503AlaAla: 4.503 ± 2.093
1.97AlaCys: 1.97 ± 1.071
2.533AlaAsp: 2.533 ± 0.43
2.533AlaGlu: 2.533 ± 0.993
1.689AlaPhe: 1.689 ± 1.345
2.815AlaGly: 2.815 ± 0.511
0.844AlaHis: 0.844 ± 0.307
2.252AlaIle: 2.252 ± 0.562
2.252AlaLys: 2.252 ± 0.541
5.629AlaLeu: 5.629 ± 0.539
0.563AlaMet: 0.563 ± 0.331
1.689AlaAsn: 1.689 ± 0.327
2.252AlaPro: 2.252 ± 1.458
2.252AlaGln: 2.252 ± 0.737
1.97AlaArg: 1.97 ± 0.869
3.94AlaSer: 3.94 ± 0.967
3.096AlaThr: 3.096 ± 1.316
3.94AlaVal: 3.94 ± 0.478
0.563AlaTrp: 0.563 ± 0.76
2.252AlaTyr: 2.252 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
0.844CysAla: 0.844 ± 0.348
0.281CysCys: 0.281 ± 0.153
0.563CysAsp: 0.563 ± 0.501
0.563CysGlu: 0.563 ± 0.308
0.844CysPhe: 0.844 ± 0.348
1.126CysGly: 1.126 ± 0.374
0.563CysHis: 0.563 ± 0.501
1.126CysIle: 1.126 ± 0.492
1.689CysLys: 1.689 ± 0.502
1.407CysLeu: 1.407 ± 0.482
0.0CysMet: 0.0 ± 0.0
0.844CysAsn: 0.844 ± 0.672
0.844CysPro: 0.844 ± 0.307
1.126CysGln: 1.126 ± 0.374
1.126CysArg: 1.126 ± 0.292
1.97CysSer: 1.97 ± 0.704
0.563CysThr: 0.563 ± 0.306
0.844CysVal: 0.844 ± 0.459
0.563CysTrp: 0.563 ± 0.306
0.281CysTyr: 0.281 ± 0.447
0.0CysXaa: 0.0 ± 0.0
Asp
1.97AspAla: 1.97 ± 0.795
1.126AspCys: 1.126 ± 0.683
5.066AspAsp: 5.066 ± 2.154
4.503AspGlu: 4.503 ± 1.447
2.252AspPhe: 2.252 ± 0.827
3.659AspGly: 3.659 ± 0.579
0.844AspHis: 0.844 ± 0.479
1.97AspIle: 1.97 ± 0.992
2.815AspLys: 2.815 ± 0.855
6.473AspLeu: 6.473 ± 1.372
1.97AspMet: 1.97 ± 0.64
2.252AspAsn: 2.252 ± 0.579
3.096AspPro: 3.096 ± 0.731
1.689AspGln: 1.689 ± 0.61
2.252AspArg: 2.252 ± 0.896
3.377AspSer: 3.377 ± 0.984
2.252AspThr: 2.252 ± 1.086
4.503AspVal: 4.503 ± 0.672
1.97AspTrp: 1.97 ± 0.491
5.348AspTyr: 5.348 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
1.689GluAla: 1.689 ± 1.064
0.281GluCys: 0.281 ± 0.378
3.096GluAsp: 3.096 ± 2.838
3.096GluGlu: 3.096 ± 0.619
4.503GluPhe: 4.503 ± 0.847
2.815GluGly: 2.815 ± 0.365
1.126GluHis: 1.126 ± 0.407
2.815GluIle: 2.815 ± 0.881
3.377GluLys: 3.377 ± 1.122
5.91GluLeu: 5.91 ± 0.859
1.126GluMet: 1.126 ± 0.778
1.407GluAsn: 1.407 ± 1.059
1.97GluPro: 1.97 ± 0.555
0.844GluGln: 0.844 ± 0.381
1.407GluArg: 1.407 ± 0.534
4.503GluSer: 4.503 ± 1.133
5.348GluThr: 5.348 ± 1.391
4.222GluVal: 4.222 ± 1.025
1.126GluTrp: 1.126 ± 0.4
2.533GluTyr: 2.533 ± 0.574
0.0GluXaa: 0.0 ± 0.0
Phe
1.97PheAla: 1.97 ± 0.568
1.126PheCys: 1.126 ± 0.616
1.689PheAsp: 1.689 ± 0.502
0.844PheGlu: 0.844 ± 0.459
1.126PhePhe: 1.126 ± 0.292
3.377PheGly: 3.377 ± 0.727
2.533PheHis: 2.533 ± 0.541
0.563PheIle: 0.563 ± 0.306
3.377PheLys: 3.377 ± 0.864
4.503PheLeu: 4.503 ± 1.877
1.126PheMet: 1.126 ± 0.4
1.407PheAsn: 1.407 ± 0.366
3.659PhePro: 3.659 ± 1.368
1.97PheGln: 1.97 ± 0.747
1.689PheArg: 1.689 ± 0.61
3.377PheSer: 3.377 ± 0.778
2.533PheThr: 2.533 ± 1.038
1.689PheVal: 1.689 ± 0.403
0.563PheTrp: 0.563 ± 0.331
0.844PheTyr: 0.844 ± 0.726
0.0PheXaa: 0.0 ± 0.0
Gly
1.97GlyAla: 1.97 ± 0.667
0.281GlyCys: 0.281 ± 0.447
3.94GlyAsp: 3.94 ± 0.534
1.689GlyGlu: 1.689 ± 0.762
2.533GlyPhe: 2.533 ± 0.849
2.815GlyGly: 2.815 ± 1.23
0.563GlyHis: 0.563 ± 0.308
3.94GlyIle: 3.94 ± 0.93
3.377GlyLys: 3.377 ± 0.427
9.569GlyLeu: 9.569 ± 1.444
3.096GlyMet: 3.096 ± 1.039
2.252GlyAsn: 2.252 ± 0.579
1.97GlyPro: 1.97 ± 0.591
2.533GlyGln: 2.533 ± 0.801
3.94GlyArg: 3.94 ± 1.056
4.503GlySer: 4.503 ± 0.711
3.377GlyThr: 3.377 ± 0.855
4.785GlyVal: 4.785 ± 0.837
0.844GlyTrp: 0.844 ± 0.307
1.97GlyTyr: 1.97 ± 1.134
0.0GlyXaa: 0.0 ± 0.0
His
1.126HisAla: 1.126 ± 0.683
0.281HisCys: 0.281 ± 0.378
1.126HisAsp: 1.126 ± 0.374
1.126HisGlu: 1.126 ± 0.612
1.689HisPhe: 1.689 ± 0.661
0.563HisGly: 0.563 ± 0.306
0.281HisHis: 0.281 ± 0.447
2.252HisIle: 2.252 ± 0.633
1.407HisLys: 1.407 ± 0.534
1.126HisLeu: 1.126 ± 0.612
0.281HisMet: 0.281 ± 0.378
0.281HisAsn: 0.281 ± 0.378
1.407HisPro: 1.407 ± 0.595
1.126HisGln: 1.126 ± 0.407
2.252HisArg: 2.252 ± 0.562
2.533HisSer: 2.533 ± 0.387
0.844HisThr: 0.844 ± 0.342
1.689HisVal: 1.689 ± 0.75
1.126HisTrp: 1.126 ± 0.374
0.281HisTyr: 0.281 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
2.252IleAla: 2.252 ± 0.321
0.563IleCys: 0.563 ± 0.306
5.348IleAsp: 5.348 ± 0.692
3.377IleGlu: 3.377 ± 0.521
1.126IlePhe: 1.126 ± 0.612
3.377IleGly: 3.377 ± 0.367
1.126IleHis: 1.126 ± 0.612
2.815IleIle: 2.815 ± 0.749
3.94IleLys: 3.94 ± 0.735
4.785IleLeu: 4.785 ± 0.899
0.844IleMet: 0.844 ± 0.725
2.815IleAsn: 2.815 ± 1.001
4.785IlePro: 4.785 ± 1.135
3.94IleGln: 3.94 ± 1.174
6.473IleArg: 6.473 ± 1.165
6.473IleSer: 6.473 ± 1.008
4.503IleThr: 4.503 ± 1.243
2.815IleVal: 2.815 ± 0.879
0.563IleTrp: 0.563 ± 0.389
2.252IleTyr: 2.252 ± 0.593
0.0IleXaa: 0.0 ± 0.0
Lys
1.689LysAla: 1.689 ± 0.68
1.126LysCys: 1.126 ± 0.492
3.377LysAsp: 3.377 ± 0.367
4.222LysGlu: 4.222 ± 0.808
2.252LysPhe: 2.252 ± 0.78
4.222LysGly: 4.222 ± 1.446
0.563LysHis: 0.563 ± 0.306
5.91LysIle: 5.91 ± 1.432
5.91LysLys: 5.91 ± 2.168
5.629LysLeu: 5.629 ± 1.096
1.407LysMet: 1.407 ± 0.482
3.94LysAsn: 3.94 ± 1.105
1.689LysPro: 1.689 ± 0.599
2.252LysGln: 2.252 ± 0.644
3.659LysArg: 3.659 ± 1.178
7.318LysSer: 7.318 ± 0.664
3.096LysThr: 3.096 ± 0.711
4.222LysVal: 4.222 ± 0.588
1.407LysTrp: 1.407 ± 0.27
1.407LysTyr: 1.407 ± 0.482
0.0LysXaa: 0.0 ± 0.0
Leu
4.785LeuAla: 4.785 ± 1.138
2.815LeuCys: 2.815 ± 0.729
3.659LeuAsp: 3.659 ± 0.534
6.473LeuGlu: 6.473 ± 1.722
4.785LeuPhe: 4.785 ± 0.753
7.036LeuGly: 7.036 ± 0.996
2.533LeuHis: 2.533 ± 1.054
9.006LeuIle: 9.006 ± 1.55
7.318LeuLys: 7.318 ± 1.034
9.006LeuLeu: 9.006 ± 1.178
4.503LeuMet: 4.503 ± 0.587
5.348LeuAsn: 5.348 ± 1.047
3.377LeuPro: 3.377 ± 0.415
2.815LeuGln: 2.815 ± 0.52
7.036LeuArg: 7.036 ± 0.816
7.599LeuSer: 7.599 ± 1.238
6.192LeuThr: 6.192 ± 0.973
4.503LeuVal: 4.503 ± 0.843
1.407LeuTrp: 1.407 ± 0.675
2.533LeuTyr: 2.533 ± 0.835
0.0LeuXaa: 0.0 ± 0.0
Met
1.97MetAla: 1.97 ± 0.591
0.281MetCys: 0.281 ± 0.153
1.407MetAsp: 1.407 ± 1.25
2.815MetGlu: 2.815 ± 0.25
1.689MetPhe: 1.689 ± 0.542
2.533MetGly: 2.533 ± 0.352
0.563MetHis: 0.563 ± 0.389
2.533MetIle: 2.533 ± 1.069
3.096MetLys: 3.096 ± 0.407
1.689MetLeu: 1.689 ± 0.691
1.407MetMet: 1.407 ± 0.874
0.844MetAsn: 0.844 ± 0.342
0.281MetPro: 0.281 ± 0.38
0.563MetGln: 0.563 ± 0.306
0.281MetArg: 0.281 ± 0.378
3.096MetSer: 3.096 ± 0.676
1.97MetThr: 1.97 ± 0.491
1.126MetVal: 1.126 ± 0.374
0.563MetTrp: 0.563 ± 0.501
0.563MetTyr: 0.563 ± 0.756
0.0MetXaa: 0.0 ± 0.0
Asn
3.096AsnAla: 3.096 ± 0.731
0.281AsnCys: 0.281 ± 0.153
2.252AsnAsp: 2.252 ± 1.148
1.407AsnGlu: 1.407 ± 0.661
1.407AsnPhe: 1.407 ± 0.27
2.815AsnGly: 2.815 ± 1.5
1.407AsnHis: 1.407 ± 0.534
1.689AsnIle: 1.689 ± 0.635
1.97AsnLys: 1.97 ± 1.333
4.785AsnLeu: 4.785 ± 1.26
1.126AsnMet: 1.126 ± 0.407
1.407AsnAsn: 1.407 ± 0.765
2.815AsnPro: 2.815 ± 0.932
1.407AsnGln: 1.407 ± 0.663
1.689AsnArg: 1.689 ± 0.542
3.94AsnSer: 3.94 ± 1.045
1.97AsnThr: 1.97 ± 0.471
1.689AsnVal: 1.689 ± 0.651
1.407AsnTrp: 1.407 ± 0.765
1.689AsnTyr: 1.689 ± 0.918
0.0AsnXaa: 0.0 ± 0.0
Pro
3.096ProAla: 3.096 ± 0.64
0.281ProCys: 0.281 ± 0.38
3.096ProAsp: 3.096 ± 0.552
2.533ProGlu: 2.533 ± 2.056
1.407ProPhe: 1.407 ± 0.791
1.407ProGly: 1.407 ± 0.877
1.407ProHis: 1.407 ± 0.595
3.096ProIle: 3.096 ± 1.199
1.407ProLys: 1.407 ± 0.595
5.066ProLeu: 5.066 ± 1.026
1.407ProMet: 1.407 ± 0.874
0.563ProAsn: 0.563 ± 0.308
2.815ProPro: 2.815 ± 1.788
1.407ProGln: 1.407 ± 0.812
1.407ProArg: 1.407 ± 0.595
5.629ProSer: 5.629 ± 1.316
4.222ProThr: 4.222 ± 0.393
2.533ProVal: 2.533 ± 1.111
0.844ProTrp: 0.844 ± 0.342
1.689ProTyr: 1.689 ± 1.054
0.0ProXaa: 0.0 ± 0.0
Gln
2.815GlnAla: 2.815 ± 0.62
0.844GlnCys: 0.844 ± 0.348
3.377GlnAsp: 3.377 ± 0.855
2.533GlnGlu: 2.533 ± 0.828
1.126GlnPhe: 1.126 ± 0.612
1.407GlnGly: 1.407 ± 0.535
0.563GlnHis: 0.563 ± 0.306
2.252GlnIle: 2.252 ± 0.724
3.377GlnLys: 3.377 ± 0.572
3.377GlnLeu: 3.377 ± 1.527
1.126GlnMet: 1.126 ± 0.492
1.126GlnAsn: 1.126 ± 0.292
0.563GlnPro: 0.563 ± 0.306
0.563GlnGln: 0.563 ± 0.306
1.126GlnArg: 1.126 ± 0.399
3.377GlnSer: 3.377 ± 0.603
2.252GlnThr: 2.252 ± 0.907
1.689GlnVal: 1.689 ± 0.512
0.563GlnTrp: 0.563 ± 0.588
2.252GlnTyr: 2.252 ± 1.176
0.0GlnXaa: 0.0 ± 0.0
Arg
2.533ArgAla: 2.533 ± 0.958
1.126ArgCys: 1.126 ± 0.441
2.815ArgAsp: 2.815 ± 0.785
1.97ArgGlu: 1.97 ± 0.585
1.97ArgPhe: 1.97 ± 0.76
4.503ArgGly: 4.503 ± 0.985
0.844ArgHis: 0.844 ± 0.459
2.252ArgIle: 2.252 ± 0.973
3.377ArgLys: 3.377 ± 0.654
5.91ArgLeu: 5.91 ± 1.046
1.689ArgMet: 1.689 ± 0.332
2.815ArgAsn: 2.815 ± 1.202
2.252ArgPro: 2.252 ± 0.423
2.252ArgGln: 2.252 ± 0.48
1.97ArgArg: 1.97 ± 0.463
3.94ArgSer: 3.94 ± 0.802
3.659ArgThr: 3.659 ± 1.097
2.533ArgVal: 2.533 ± 1.142
1.407ArgTrp: 1.407 ± 0.27
1.407ArgTyr: 1.407 ± 0.675
0.0ArgXaa: 0.0 ± 0.0
Ser
5.348SerAla: 5.348 ± 1.176
0.281SerCys: 0.281 ± 0.378
7.318SerAsp: 7.318 ± 0.882
4.785SerGlu: 4.785 ± 1.435
3.94SerPhe: 3.94 ± 0.896
5.91SerGly: 5.91 ± 1.432
2.815SerHis: 2.815 ± 1.067
6.192SerIle: 6.192 ± 1.103
5.066SerLys: 5.066 ± 0.74
8.725SerLeu: 8.725 ± 1.804
2.533SerMet: 2.533 ± 0.788
4.503SerAsn: 4.503 ± 0.321
3.377SerPro: 3.377 ± 1.028
3.377SerGln: 3.377 ± 0.802
5.629SerArg: 5.629 ± 1.476
11.54SerSer: 11.54 ± 2.863
2.252SerThr: 2.252 ± 0.474
7.318SerVal: 7.318 ± 1.667
1.407SerTrp: 1.407 ± 0.765
3.377SerTyr: 3.377 ± 1.023
0.0SerXaa: 0.0 ± 0.0
Thr
3.94ThrAla: 3.94 ± 0.539
1.407ThrCys: 1.407 ± 0.482
3.096ThrAsp: 3.096 ± 0.766
1.689ThrGlu: 1.689 ± 0.432
1.689ThrPhe: 1.689 ± 0.743
3.096ThrGly: 3.096 ± 0.8
1.407ThrHis: 1.407 ± 0.975
4.785ThrIle: 4.785 ± 1.083
3.94ThrLys: 3.94 ± 0.875
4.785ThrLeu: 4.785 ± 1.34
1.97ThrMet: 1.97 ± 0.667
1.407ThrAsn: 1.407 ± 0.535
3.377ThrPro: 3.377 ± 1.05
2.815ThrGln: 2.815 ± 0.785
1.97ThrArg: 1.97 ± 0.648
6.473ThrSer: 6.473 ± 1.853
4.503ThrThr: 4.503 ± 0.811
1.97ThrVal: 1.97 ± 0.897
1.407ThrTrp: 1.407 ± 0.466
0.844ThrTyr: 0.844 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
1.689ValAla: 1.689 ± 0.543
2.252ValCys: 2.252 ± 0.714
4.503ValAsp: 4.503 ± 1.382
2.815ValGlu: 2.815 ± 1.185
1.689ValPhe: 1.689 ± 0.762
1.689ValGly: 1.689 ± 1.345
1.689ValHis: 1.689 ± 1.024
5.066ValIle: 5.066 ± 1.327
2.252ValLys: 2.252 ± 1.267
7.036ValLeu: 7.036 ± 1.389
1.97ValMet: 1.97 ± 0.872
1.97ValAsn: 1.97 ± 0.747
3.096ValPro: 3.096 ± 0.561
1.97ValGln: 1.97 ± 0.52
3.096ValArg: 3.096 ± 1.348
6.473ValSer: 6.473 ± 1.447
3.096ValThr: 3.096 ± 0.731
1.97ValVal: 1.97 ± 0.491
1.407ValTrp: 1.407 ± 0.579
1.126ValTyr: 1.126 ± 0.424
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.281TrpCys: 0.281 ± 0.153
0.563TrpAsp: 0.563 ± 0.389
2.533TrpGlu: 2.533 ± 0.82
0.563TrpPhe: 0.563 ± 0.308
2.815TrpGly: 2.815 ± 1.067
0.281TrpHis: 0.281 ± 0.153
1.689TrpIle: 1.689 ± 0.933
1.689TrpLys: 1.689 ± 0.613
2.815TrpLeu: 2.815 ± 1.166
0.0TrpMet: 0.0 ± 0.0
0.563TrpAsn: 0.563 ± 0.389
0.281TrpPro: 0.281 ± 0.153
0.0TrpGln: 0.0 ± 0.0
0.281TrpArg: 0.281 ± 0.153
2.533TrpSer: 2.533 ± 0.706
0.844TrpThr: 0.844 ± 0.342
1.126TrpVal: 1.126 ± 0.951
0.281TrpTrp: 0.281 ± 0.38
0.563TrpTyr: 0.563 ± 0.308
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.252TyrAla: 2.252 ± 0.998
0.281TyrCys: 0.281 ± 0.378
0.563TyrAsp: 0.563 ± 0.339
1.689TyrGlu: 1.689 ± 0.65
1.689TyrPhe: 1.689 ± 0.661
1.97TyrGly: 1.97 ± 0.424
0.844TyrHis: 0.844 ± 0.459
1.97TyrIle: 1.97 ± 0.897
3.659TyrLys: 3.659 ± 0.966
4.503TyrLeu: 4.503 ± 0.546
0.844TyrMet: 0.844 ± 0.524
2.815TyrAsn: 2.815 ± 0.48
1.407TyrPro: 1.407 ± 0.935
1.407TyrGln: 1.407 ± 0.661
1.97TyrArg: 1.97 ± 1.077
3.096TyrSer: 3.096 ± 0.657
0.0TyrThr: 0.0 ± 0.0
1.97TyrVal: 1.97 ± 0.433
0.281TyrTrp: 0.281 ± 0.153
1.407TyrTyr: 1.407 ± 0.387
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski