Amino acid dipepetide frequency for Rift valley fever virus (RVFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.372AlaAla: 7.372 ± 3.311
2.106AlaCys: 2.106 ± 0.57
2.896AlaAsp: 2.896 ± 0.365
4.476AlaGlu: 4.476 ± 1.302
3.686AlaPhe: 3.686 ± 0.25
2.896AlaGly: 2.896 ± 0.917
3.949AlaHis: 3.949 ± 1.602
4.213AlaIle: 4.213 ± 0.758
2.37AlaLys: 2.37 ± 0.382
6.319AlaLeu: 6.319 ± 1.658
2.106AlaMet: 2.106 ± 0.493
1.58AlaAsn: 1.58 ± 0.343
2.633AlaPro: 2.633 ± 0.674
1.58AlaGln: 1.58 ± 0.721
2.896AlaArg: 2.896 ± 0.503
5.003AlaSer: 5.003 ± 0.814
3.423AlaThr: 3.423 ± 0.793
3.686AlaVal: 3.686 ± 1.725
0.263AlaTrp: 0.263 ± 0.239
2.106AlaTyr: 2.106 ± 1.132
0.0AlaXaa: 0.0 ± 0.0
Cys
0.79CysAla: 0.79 ± 0.364
0.79CysCys: 0.79 ± 0.445
1.58CysAsp: 1.58 ± 0.727
2.106CysGlu: 2.106 ± 0.57
1.316CysPhe: 1.316 ± 0.837
0.79CysGly: 0.79 ± 0.364
1.053CysHis: 1.053 ± 0.958
1.316CysIle: 1.316 ± 0.498
1.58CysLys: 1.58 ± 0.416
3.686CysLeu: 3.686 ± 1.967
0.79CysMet: 0.79 ± 0.172
1.58CysAsn: 1.58 ± 0.427
1.58CysPro: 1.58 ± 0.727
1.316CysGln: 1.316 ± 0.383
1.053CysArg: 1.053 ± 0.599
3.423CysSer: 3.423 ± 0.819
1.58CysThr: 1.58 ± 0.727
1.316CysVal: 1.316 ± 0.826
0.527CysTrp: 0.527 ± 0.304
0.527CysTyr: 0.527 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
4.476AspAla: 4.476 ± 0.856
0.79AspCys: 0.79 ± 0.172
3.949AspAsp: 3.949 ± 1.015
2.896AspGlu: 2.896 ± 0.697
2.633AspPhe: 2.633 ± 1.044
4.739AspGly: 4.739 ± 1.014
1.58AspHis: 1.58 ± 0.912
3.16AspIle: 3.16 ± 0.831
3.423AspLys: 3.423 ± 1.011
7.109AspLeu: 7.109 ± 2.078
2.106AspMet: 2.106 ± 1.146
2.37AspAsn: 2.37 ± 0.289
2.106AspPro: 2.106 ± 0.979
1.316AspGln: 1.316 ± 0.988
1.58AspArg: 1.58 ± 0.393
3.686AspSer: 3.686 ± 0.141
1.316AspThr: 1.316 ± 0.432
2.896AspVal: 2.896 ± 1.076
1.053AspTrp: 1.053 ± 0.502
1.843AspTyr: 1.843 ± 0.506
0.0AspXaa: 0.0 ± 0.0
Glu
3.949GluAla: 3.949 ± 0.535
1.053GluCys: 1.053 ± 0.599
6.056GluAsp: 6.056 ± 0.553
6.582GluGlu: 6.582 ± 0.49
3.423GluPhe: 3.423 ± 0.793
3.423GluGly: 3.423 ± 0.696
0.79GluHis: 0.79 ± 0.172
5.003GluIle: 5.003 ± 0.517
2.896GluLys: 2.896 ± 0.707
7.899GluLeu: 7.899 ± 1.384
2.37GluMet: 2.37 ± 0.454
2.106GluAsn: 2.106 ± 0.438
2.106GluPro: 2.106 ± 0.29
1.053GluGln: 1.053 ± 0.325
3.16GluArg: 3.16 ± 0.687
5.003GluSer: 5.003 ± 0.854
2.896GluThr: 2.896 ± 0.742
3.686GluVal: 3.686 ± 0.654
0.527GluTrp: 0.527 ± 0.405
1.316GluTyr: 1.316 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
3.686PheAla: 3.686 ± 1.052
1.053PheCys: 1.053 ± 0.599
3.686PheAsp: 3.686 ± 0.931
1.843PheGlu: 1.843 ± 0.635
2.106PhePhe: 2.106 ± 0.34
2.106PheGly: 2.106 ± 0.33
0.527PheHis: 0.527 ± 0.304
1.58PheIle: 1.58 ± 0.296
2.633PheLys: 2.633 ± 0.864
4.476PheLeu: 4.476 ± 0.569
1.316PheMet: 1.316 ± 0.432
2.896PheAsn: 2.896 ± 1.009
2.633PhePro: 2.633 ± 1.175
1.053PheGln: 1.053 ± 0.32
1.843PheArg: 1.843 ± 1.064
5.003PheSer: 5.003 ± 0.363
2.37PheThr: 2.37 ± 0.366
5.529PheVal: 5.529 ± 0.594
0.263PheTrp: 0.263 ± 0.152
0.527PheTyr: 0.527 ± 0.415
0.0PheXaa: 0.0 ± 0.0
Gly
4.213GlyAla: 4.213 ± 0.779
1.843GlyCys: 1.843 ± 0.635
3.16GlyAsp: 3.16 ± 0.54
2.633GlyGlu: 2.633 ± 1.161
4.213GlyPhe: 4.213 ± 1.094
3.686GlyGly: 3.686 ± 0.649
1.843GlyHis: 1.843 ± 0.267
3.949GlyIle: 3.949 ± 0.551
4.476GlyLys: 4.476 ± 0.572
3.949GlyLeu: 3.949 ± 1.161
2.106GlyMet: 2.106 ± 0.6
1.843GlyAsn: 1.843 ± 0.865
2.633GlyPro: 2.633 ± 0.676
1.843GlyGln: 1.843 ± 0.961
2.633GlyArg: 2.633 ± 0.737
6.846GlySer: 6.846 ± 2.112
2.37GlyThr: 2.37 ± 1.207
5.003GlyVal: 5.003 ± 1.083
0.79GlyTrp: 0.79 ± 0.635
1.316GlyTyr: 1.316 ± 0.276
0.0GlyXaa: 0.0 ± 0.0
His
0.79HisAla: 0.79 ± 0.364
0.527HisCys: 0.527 ± 0.479
1.58HisAsp: 1.58 ± 0.343
1.316HisGlu: 1.316 ± 0.671
0.79HisPhe: 0.79 ± 0.172
2.896HisGly: 2.896 ± 0.915
0.79HisHis: 0.79 ± 0.369
1.58HisIle: 1.58 ± 0.578
1.58HisLys: 1.58 ± 0.427
2.106HisLeu: 2.106 ± 0.583
0.79HisMet: 0.79 ± 0.718
0.79HisAsn: 0.79 ± 0.364
0.79HisPro: 0.79 ± 0.424
0.79HisGln: 0.79 ± 0.456
1.58HisArg: 1.58 ± 1.184
1.053HisSer: 1.053 ± 0.491
1.316HisThr: 1.316 ± 0.276
1.58HisVal: 1.58 ± 0.416
0.0HisTrp: 0.0 ± 0.0
1.58HisTyr: 1.58 ± 0.578
0.0HisXaa: 0.0 ± 0.0
Ile
4.213IleAla: 4.213 ± 0.838
1.58IleCys: 1.58 ± 0.343
3.686IleAsp: 3.686 ± 0.764
3.686IleGlu: 3.686 ± 0.996
1.843IlePhe: 1.843 ± 0.727
3.423IleGly: 3.423 ± 0.43
0.527IleHis: 0.527 ± 0.304
3.16IleIle: 3.16 ± 0.874
3.16IleLys: 3.16 ± 0.869
4.476IleLeu: 4.476 ± 0.939
0.263IleMet: 0.263 ± 0.152
2.37IleAsn: 2.37 ± 0.634
3.423IlePro: 3.423 ± 0.954
2.896IleGln: 2.896 ± 0.411
4.739IleArg: 4.739 ± 0.812
4.739IleSer: 4.739 ± 0.892
3.686IleThr: 3.686 ± 0.903
3.949IleVal: 3.949 ± 0.852
0.263IleTrp: 0.263 ± 0.152
1.053IleTyr: 1.053 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
2.896LysAla: 2.896 ± 0.576
2.37LysCys: 2.37 ± 1.792
2.633LysAsp: 2.633 ± 0.561
3.423LysGlu: 3.423 ± 1.063
2.896LysPhe: 2.896 ± 1.009
5.266LysGly: 5.266 ± 1.957
1.053LysHis: 1.053 ± 0.608
3.423LysIle: 3.423 ± 0.682
5.266LysLys: 5.266 ± 0.828
4.476LysLeu: 4.476 ± 0.927
2.896LysMet: 2.896 ± 0.668
1.58LysAsn: 1.58 ± 0.343
3.423LysPro: 3.423 ± 0.618
1.843LysGln: 1.843 ± 0.498
2.633LysArg: 2.633 ± 0.321
2.896LysSer: 2.896 ± 1.009
4.476LysThr: 4.476 ± 0.724
4.476LysVal: 4.476 ± 1.904
1.58LysTrp: 1.58 ± 0.912
1.58LysTyr: 1.58 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
6.319LeuAla: 6.319 ± 0.997
1.843LeuCys: 1.843 ± 0.527
3.423LeuAsp: 3.423 ± 0.663
5.529LeuGlu: 5.529 ± 1.534
4.476LeuPhe: 4.476 ± 1.302
3.686LeuGly: 3.686 ± 0.533
1.843LeuHis: 1.843 ± 0.267
6.582LeuIle: 6.582 ± 1.03
6.582LeuLys: 6.582 ± 1.814
7.372LeuLeu: 7.372 ± 0.155
4.476LeuMet: 4.476 ± 1.24
3.16LeuAsn: 3.16 ± 0.687
3.423LeuPro: 3.423 ± 0.989
4.213LeuGln: 4.213 ± 0.762
6.319LeuArg: 6.319 ± 1.288
12.112LeuSer: 12.112 ± 0.577
3.16LeuThr: 3.16 ± 0.8
4.476LeuVal: 4.476 ± 0.827
0.527LeuTrp: 0.527 ± 0.304
2.896LeuTyr: 2.896 ± 0.411
0.0LeuXaa: 0.0 ± 0.0
Met
1.843MetAla: 1.843 ± 0.428
0.263MetCys: 0.263 ± 0.152
2.896MetAsp: 2.896 ± 0.926
2.633MetGlu: 2.633 ± 0.931
1.316MetPhe: 1.316 ± 0.432
2.896MetGly: 2.896 ± 1.439
1.58MetHis: 1.58 ± 0.695
2.106MetIle: 2.106 ± 0.379
1.843MetLys: 1.843 ± 0.428
2.106MetLeu: 2.106 ± 0.34
2.37MetMet: 2.37 ± 1.358
0.79MetAsn: 0.79 ± 0.424
0.263MetPro: 0.263 ± 0.239
1.053MetGln: 1.053 ± 0.608
1.316MetArg: 1.316 ± 0.338
1.58MetSer: 1.58 ± 0.393
1.843MetThr: 1.843 ± 0.382
2.106MetVal: 2.106 ± 0.554
0.527MetTrp: 0.527 ± 0.479
1.316MetTyr: 1.316 ± 0.347
0.0MetXaa: 0.0 ± 0.0
Asn
1.843AsnAla: 1.843 ± 0.447
1.316AsnCys: 1.316 ± 0.432
3.16AsnAsp: 3.16 ± 0.592
1.58AsnGlu: 1.58 ± 0.393
2.896AsnPhe: 2.896 ± 0.411
1.316AsnGly: 1.316 ± 0.383
0.527AsnHis: 0.527 ± 0.304
1.58AsnIle: 1.58 ± 0.343
2.106AsnLys: 2.106 ± 0.65
5.003AsnLeu: 5.003 ± 0.87
0.263AsnMet: 0.263 ± 0.152
1.843AsnAsn: 1.843 ± 0.267
3.686AsnPro: 3.686 ± 1.171
1.316AsnGln: 1.316 ± 0.432
2.106AsnArg: 2.106 ± 0.57
2.106AsnSer: 2.106 ± 0.512
0.527AsnThr: 0.527 ± 0.142
1.843AsnVal: 1.843 ± 0.737
1.316AsnTrp: 1.316 ± 0.338
1.58AsnTyr: 1.58 ± 0.296
0.0AsnXaa: 0.0 ± 0.0
Pro
1.58ProAla: 1.58 ± 0.549
1.316ProCys: 1.316 ± 0.639
1.58ProAsp: 1.58 ± 0.416
5.266ProGlu: 5.266 ± 1.359
2.37ProPhe: 2.37 ± 0.46
2.106ProGly: 2.106 ± 0.29
1.053ProHis: 1.053 ± 0.599
1.843ProIle: 1.843 ± 0.382
2.106ProLys: 2.106 ± 0.57
3.686ProLeu: 3.686 ± 0.601
1.053ProMet: 1.053 ± 0.325
2.106ProAsn: 2.106 ± 0.834
3.686ProPro: 3.686 ± 1.347
0.79ProGln: 0.79 ± 0.364
3.16ProArg: 3.16 ± 0.721
4.739ProSer: 4.739 ± 1.9
0.79ProThr: 0.79 ± 0.456
2.106ProVal: 2.106 ± 0.834
0.79ProTrp: 0.79 ± 0.456
1.58ProTyr: 1.58 ± 0.891
0.0ProXaa: 0.0 ± 0.0
Gln
2.896GlnAla: 2.896 ± 1.349
1.843GlnCys: 1.843 ± 0.708
1.053GlnAsp: 1.053 ± 0.285
1.58GlnGlu: 1.58 ± 0.296
0.527GlnPhe: 0.527 ± 0.884
2.896GlnGly: 2.896 ± 1.042
0.79GlnHis: 0.79 ± 0.456
2.37GlnIle: 2.37 ± 0.261
1.58GlnLys: 1.58 ± 0.912
2.106GlnLeu: 2.106 ± 0.438
0.79GlnMet: 0.79 ± 0.172
0.527GlnAsn: 0.527 ± 0.304
1.053GlnPro: 1.053 ± 0.491
1.053GlnGln: 1.053 ± 0.608
1.58GlnArg: 1.58 ± 0.416
3.686GlnSer: 3.686 ± 1.603
1.316GlnThr: 1.316 ± 0.498
1.316GlnVal: 1.316 ± 0.276
0.527GlnTrp: 0.527 ± 0.405
1.053GlnTyr: 1.053 ± 0.502
0.0GlnXaa: 0.0 ± 0.0
Arg
2.37ArgAla: 2.37 ± 0.289
2.106ArgCys: 2.106 ± 0.57
2.896ArgAsp: 2.896 ± 0.503
5.529ArgGlu: 5.529 ± 1.359
1.58ArgPhe: 1.58 ± 0.727
3.423ArgGly: 3.423 ± 0.793
0.79ArgHis: 0.79 ± 0.388
3.423ArgIle: 3.423 ± 1.37
2.106ArgLys: 2.106 ± 0.583
3.423ArgLeu: 3.423 ± 1.361
2.37ArgMet: 2.37 ± 0.977
2.37ArgAsn: 2.37 ± 0.772
2.37ArgPro: 2.37 ± 0.454
1.843ArgGln: 1.843 ± 0.453
2.37ArgArg: 2.37 ± 1.934
3.686ArgSer: 3.686 ± 1.394
3.16ArgThr: 3.16 ± 0.84
5.266ArgVal: 5.266 ± 1.826
0.527ArgTrp: 0.527 ± 0.142
1.316ArgTyr: 1.316 ± 0.432
0.0ArgXaa: 0.0 ± 0.0
Ser
5.793SerAla: 5.793 ± 1.058
3.16SerCys: 3.16 ± 1.494
4.213SerAsp: 4.213 ± 1.988
5.266SerGlu: 5.266 ± 0.839
5.003SerPhe: 5.003 ± 0.363
5.793SerGly: 5.793 ± 1.128
1.58SerHis: 1.58 ± 0.502
2.896SerIle: 2.896 ± 1.009
7.109SerLys: 7.109 ± 1.497
8.425SerLeu: 8.425 ± 0.765
2.106SerMet: 2.106 ± 0.833
2.633SerAsn: 2.633 ± 1.133
3.686SerPro: 3.686 ± 0.802
1.843SerGln: 1.843 ± 0.635
3.686SerArg: 3.686 ± 0.955
9.742SerSer: 9.742 ± 1.331
5.003SerThr: 5.003 ± 2.072
6.582SerVal: 6.582 ± 0.896
1.843SerTrp: 1.843 ± 0.382
2.896SerTyr: 2.896 ± 0.679
0.0SerXaa: 0.0 ± 0.0
Thr
2.37ThrAla: 2.37 ± 1.207
2.106ThrCys: 2.106 ± 0.979
2.106ThrAsp: 2.106 ± 0.877
2.896ThrGlu: 2.896 ± 0.326
1.58ThrPhe: 1.58 ± 0.393
4.213ThrGly: 4.213 ± 1.415
0.527ThrHis: 0.527 ± 0.142
3.686ThrIle: 3.686 ± 0.907
3.16ThrLys: 3.16 ± 1.183
6.056ThrLeu: 6.056 ± 0.553
1.316ThrMet: 1.316 ± 0.684
2.106ThrAsn: 2.106 ± 0.29
0.527ThrPro: 0.527 ± 0.304
1.316ThrGln: 1.316 ± 0.327
3.423ThrArg: 3.423 ± 0.814
3.686ThrSer: 3.686 ± 1.152
3.16ThrThr: 3.16 ± 0.372
2.106ThrVal: 2.106 ± 0.627
0.0ThrTrp: 0.0 ± 0.0
1.053ThrTyr: 1.053 ± 0.599
0.0ThrXaa: 0.0 ± 0.0
Val
4.213ValAla: 4.213 ± 0.474
2.106ValCys: 2.106 ± 1.198
3.423ValAsp: 3.423 ± 1.367
3.686ValGlu: 3.686 ± 1.152
2.37ValPhe: 2.37 ± 1.158
3.949ValGly: 3.949 ± 1.718
2.633ValHis: 2.633 ± 0.674
2.106ValIle: 2.106 ± 0.847
3.423ValLys: 3.423 ± 0.711
5.529ValLeu: 5.529 ± 1.023
1.843ValMet: 1.843 ± 0.498
2.633ValAsn: 2.633 ± 0.725
2.106ValPro: 2.106 ± 0.639
2.106ValGln: 2.106 ± 0.578
3.686ValArg: 3.686 ± 0.625
7.636ValSer: 7.636 ± 1.215
3.16ValThr: 3.16 ± 0.618
5.529ValVal: 5.529 ± 0.973
0.79ValTrp: 0.79 ± 0.172
2.896ValTyr: 2.896 ± 1.009
0.0ValXaa: 0.0 ± 0.0
Trp
1.58TrpAla: 1.58 ± 0.912
0.263TrpCys: 0.263 ± 0.152
0.527TrpAsp: 0.527 ± 0.142
0.527TrpGlu: 0.527 ± 0.405
0.79TrpPhe: 0.79 ± 0.364
1.053TrpGly: 1.053 ± 0.285
0.0TrpHis: 0.0 ± 0.0
1.053TrpIle: 1.053 ± 0.32
0.263TrpLys: 0.263 ± 0.239
1.053TrpLeu: 1.053 ± 0.325
0.527TrpMet: 0.527 ± 0.142
1.053TrpAsn: 1.053 ± 0.291
0.527TrpPro: 0.527 ± 0.415
0.0TrpGln: 0.0 ± 0.0
0.79TrpArg: 0.79 ± 0.172
0.527TrpSer: 0.527 ± 0.304
0.79TrpThr: 0.79 ± 0.388
1.053TrpVal: 1.053 ± 0.502
0.263TrpTrp: 0.263 ± 0.152
0.263TrpTyr: 0.263 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.37TyrAla: 2.37 ± 0.775
0.263TyrCys: 0.263 ± 0.239
0.79TyrAsp: 0.79 ± 0.456
1.843TyrGlu: 1.843 ± 0.382
1.316TyrPhe: 1.316 ± 0.624
0.79TyrGly: 0.79 ± 0.424
0.79TyrHis: 0.79 ± 0.456
2.106TyrIle: 2.106 ± 0.654
3.423TyrLys: 3.423 ± 0.634
3.16TyrLeu: 3.16 ± 0.8
0.527TyrMet: 0.527 ± 0.304
1.58TyrAsn: 1.58 ± 0.744
1.053TyrPro: 1.053 ± 0.502
1.316TyrGln: 1.316 ± 0.873
2.37TyrArg: 2.37 ± 0.454
2.106TyrSer: 2.106 ± 0.29
1.053TyrThr: 1.053 ± 0.608
1.316TyrVal: 1.316 ± 0.276
0.527TyrTrp: 0.527 ± 0.142
0.263TyrTyr: 0.263 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski