Amino acid dipepetide frequency for Bunyamwera virus (BUNV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.997AlaAla: 2.997 ± 2.784
0.5AlaCys: 0.5 ± 0.471
2.498AlaAsp: 2.498 ± 0.787
3.497AlaGlu: 3.497 ± 0.499
1.748AlaPhe: 1.748 ± 0.987
2.997AlaGly: 2.997 ± 0.698
1.249AlaHis: 1.249 ± 0.49
3.746AlaIle: 3.746 ± 0.402
3.746AlaLys: 3.746 ± 0.834
3.996AlaLeu: 3.996 ± 0.776
1.499AlaMet: 1.499 ± 0.639
3.996AlaAsn: 3.996 ± 0.953
1.249AlaPro: 1.249 ± 0.367
1.748AlaGln: 1.748 ± 0.703
2.248AlaArg: 2.248 ± 2.066
3.497AlaSer: 3.497 ± 0.66
1.748AlaThr: 1.748 ± 1.326
3.247AlaVal: 3.247 ± 1.683
0.0AlaTrp: 0.0 ± 0.0
2.248AlaTyr: 2.248 ± 0.465
0.0AlaXaa: 0.0 ± 0.0
Cys
1.998CysAla: 1.998 ± 0.585
0.5CysCys: 0.5 ± 0.471
0.999CysAsp: 0.999 ± 0.942
0.5CysGlu: 0.5 ± 0.471
1.249CysPhe: 1.249 ± 0.539
1.748CysGly: 1.748 ± 1.649
0.5CysHis: 0.5 ± 0.17
1.998CysIle: 1.998 ± 0.917
2.498CysLys: 2.498 ± 1.078
1.499CysLeu: 1.499 ± 0.509
0.999CysMet: 0.999 ± 0.9
2.498CysAsn: 2.498 ± 0.848
0.999CysPro: 0.999 ± 0.339
1.499CysGln: 1.499 ± 0.76
0.749CysArg: 0.749 ± 0.38
0.749CysSer: 0.749 ± 0.225
2.747CysThr: 2.747 ± 1.916
1.998CysVal: 1.998 ± 1.545
0.5CysTrp: 0.5 ± 0.471
0.749CysTyr: 0.749 ± 0.38
0.0CysXaa: 0.0 ± 0.0
Asp
1.748AspAla: 1.748 ± 0.79
1.998AspCys: 1.998 ± 1.218
3.247AspAsp: 3.247 ± 0.322
3.746AspGlu: 3.746 ± 1.258
3.996AspPhe: 3.996 ± 0.776
1.998AspGly: 1.998 ± 1.269
0.999AspHis: 0.999 ± 0.897
3.996AspIle: 3.996 ± 1.392
4.745AspLys: 4.745 ± 1.391
3.497AspLeu: 3.497 ± 1.13
0.5AspMet: 0.5 ± 0.312
3.497AspAsn: 3.497 ± 1.58
1.998AspPro: 1.998 ± 0.4
2.498AspGln: 2.498 ± 0.734
2.747AspArg: 2.747 ± 1.037
2.498AspSer: 2.498 ± 0.712
3.497AspThr: 3.497 ± 1.675
3.247AspVal: 3.247 ± 0.569
0.25AspTrp: 0.25 ± 0.236
2.997AspTyr: 2.997 ± 1.044
0.0AspXaa: 0.0 ± 0.0
Glu
3.996GluAla: 3.996 ± 0.513
1.499GluCys: 1.499 ± 0.45
2.747GluAsp: 2.747 ± 1.296
4.246GluGlu: 4.246 ± 0.484
3.746GluPhe: 3.746 ± 0.634
1.748GluGly: 1.748 ± 0.987
1.748GluHis: 1.748 ± 0.703
6.743GluIle: 6.743 ± 1.581
5.744GluLys: 5.744 ± 2.15
5.744GluLeu: 5.744 ± 1.248
2.248GluMet: 2.248 ± 1.575
1.249GluAsn: 1.249 ± 0.539
1.998GluPro: 1.998 ± 1.269
1.998GluGln: 1.998 ± 1.247
3.247GluArg: 3.247 ± 1.183
2.498GluSer: 2.498 ± 0.712
3.497GluThr: 3.497 ± 1.051
4.246GluVal: 4.246 ± 3.141
0.5GluTrp: 0.5 ± 0.781
1.249GluTyr: 1.249 ± 0.49
0.0GluXaa: 0.0 ± 0.0
Phe
1.249PheAla: 1.249 ± 0.539
1.748PheCys: 1.748 ± 0.987
2.747PheAsp: 2.747 ± 0.289
2.498PheGlu: 2.498 ± 1.078
2.498PhePhe: 2.498 ± 0.629
3.497PheGly: 3.497 ± 1.12
1.998PheHis: 1.998 ± 0.596
3.497PheIle: 3.497 ± 1.325
3.247PheLys: 3.247 ± 0.569
4.745PheLeu: 4.745 ± 3.086
1.998PheMet: 1.998 ± 0.655
1.998PheAsn: 1.998 ± 0.943
2.248PhePro: 2.248 ± 1.194
0.749PheGln: 0.749 ± 0.38
3.247PheArg: 3.247 ± 1.356
3.746PheSer: 3.746 ± 1.47
4.496PheThr: 4.496 ± 1.671
2.997PheVal: 2.997 ± 0.89
0.5PheTrp: 0.5 ± 0.312
1.998PheTyr: 1.998 ± 1.365
0.0PheXaa: 0.0 ± 0.0
Gly
1.499GlyAla: 1.499 ± 0.887
1.998GlyCys: 1.998 ± 0.917
2.997GlyAsp: 2.997 ± 0.9
2.498GlyGlu: 2.498 ± 0.629
1.499GlyPhe: 1.499 ± 0.882
1.499GlyGly: 1.499 ± 0.882
0.999GlyHis: 0.999 ± 0.609
3.497GlyIle: 3.497 ± 1.768
2.997GlyLys: 2.997 ± 0.698
4.496GlyLeu: 4.496 ± 2.188
0.0GlyMet: 0.0 ± 0.0
2.747GlyAsn: 2.747 ± 0.744
1.748GlyPro: 1.748 ± 0.703
2.248GlyGln: 2.248 ± 1.139
1.748GlyArg: 1.748 ± 1.534
2.997GlySer: 2.997 ± 1.375
3.497GlyThr: 3.497 ± 2.335
2.248GlyVal: 2.248 ± 0.689
1.249GlyTrp: 1.249 ± 1.477
1.249GlyTyr: 1.249 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
0.749HisAla: 0.749 ± 0.38
0.999HisCys: 0.999 ± 0.609
1.499HisAsp: 1.499 ± 0.882
0.999HisGlu: 0.999 ± 0.339
1.249HisPhe: 1.249 ± 0.367
1.499HisGly: 1.499 ± 0.45
0.749HisHis: 0.749 ± 0.468
1.998HisIle: 1.998 ± 0.596
2.248HisLys: 2.248 ± 0.675
2.248HisLeu: 2.248 ± 0.71
0.999HisMet: 0.999 ± 0.339
1.249HisAsn: 1.249 ± 0.367
0.5HisPro: 0.5 ± 0.17
0.0HisGln: 0.0 ± 0.0
1.249HisArg: 1.249 ± 0.755
1.499HisSer: 1.499 ± 0.45
1.499HisThr: 1.499 ± 1.467
0.749HisVal: 0.749 ± 0.468
0.5HisTrp: 0.5 ± 0.17
0.5HisTyr: 0.5 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
4.496IleAla: 4.496 ± 1.351
1.499IleCys: 1.499 ± 1.075
2.997IleAsp: 2.997 ± 1.277
5.495IleGlu: 5.495 ± 1.061
4.995IlePhe: 4.995 ± 1.498
3.996IleGly: 3.996 ± 0.513
0.749IleHis: 0.749 ± 0.468
7.992IleIle: 7.992 ± 1.003
7.742IleLys: 7.742 ± 1.723
10.24IleLeu: 10.24 ± 3.14
1.998IleMet: 1.998 ± 0.585
5.744IleAsn: 5.744 ± 0.639
2.747IlePro: 2.747 ± 0.806
2.747IleGln: 2.747 ± 1.128
2.498IleArg: 2.498 ± 1.209
6.244IleSer: 6.244 ± 1.836
5.495IleThr: 5.495 ± 0.982
3.746IleVal: 3.746 ± 1.101
0.999IleTrp: 0.999 ± 0.623
2.248IleTyr: 2.248 ± 0.348
0.0IleXaa: 0.0 ± 0.0
Lys
4.745LysAla: 4.745 ± 1.528
1.249LysCys: 1.249 ± 1.178
5.245LysAsp: 5.245 ± 0.996
6.494LysGlu: 6.494 ± 0.94
3.247LysPhe: 3.247 ± 0.67
2.997LysGly: 2.997 ± 0.871
1.748LysHis: 1.748 ± 0.526
5.245LysIle: 5.245 ± 0.61
5.744LysLys: 5.744 ± 1.265
8.492LysLeu: 8.492 ± 1.351
1.998LysMet: 1.998 ± 0.536
4.496LysAsn: 4.496 ± 0.782
1.998LysPro: 1.998 ± 0.679
1.998LysGln: 1.998 ± 0.696
2.498LysArg: 2.498 ± 1.624
5.744LysSer: 5.744 ± 1.034
6.494LysThr: 6.494 ± 1.794
3.746LysVal: 3.746 ± 0.834
0.999LysTrp: 0.999 ± 0.339
2.498LysTyr: 2.498 ± 0.949
0.0LysXaa: 0.0 ± 0.0
Leu
6.244LeuAla: 6.244 ± 3.46
2.498LeuCys: 2.498 ± 0.734
6.494LeuAsp: 6.494 ± 2.365
7.493LeuGlu: 7.493 ± 1.856
4.995LeuPhe: 4.995 ± 1.96
3.996LeuGly: 3.996 ± 3.317
2.498LeuHis: 2.498 ± 1.388
9.491LeuIle: 9.491 ± 1.944
6.494LeuLys: 6.494 ± 1.729
8.492LeuLeu: 8.492 ± 2.065
1.499LeuMet: 1.499 ± 0.935
4.995LeuAsn: 4.995 ± 1.701
3.996LeuPro: 3.996 ± 1.753
2.747LeuGln: 2.747 ± 1.296
2.248LeuArg: 2.248 ± 0.827
5.994LeuSer: 5.994 ± 1.897
7.992LeuThr: 7.992 ± 5.449
3.996LeuVal: 3.996 ± 1.753
0.5LeuTrp: 0.5 ± 0.312
3.497LeuTyr: 3.497 ± 0.66
0.0LeuXaa: 0.0 ± 0.0
Met
0.5MetAla: 0.5 ± 0.17
0.999MetCys: 0.999 ± 0.348
1.249MetAsp: 1.249 ± 0.779
1.249MetGlu: 1.249 ± 0.607
1.249MetPhe: 1.249 ± 0.582
0.5MetGly: 0.5 ± 0.781
0.999MetHis: 0.999 ± 0.339
2.997MetIle: 2.997 ± 0.301
2.248MetLys: 2.248 ± 0.836
2.498MetLeu: 2.498 ± 0.961
1.998MetMet: 1.998 ± 1.173
0.999MetAsn: 0.999 ± 0.348
1.249MetPro: 1.249 ± 0.367
1.249MetGln: 1.249 ± 0.367
1.748MetArg: 1.748 ± 0.464
2.498MetSer: 2.498 ± 1.152
1.249MetThr: 1.249 ± 1.052
1.499MetVal: 1.499 ± 0.9
0.0MetTrp: 0.0 ± 0.0
0.5MetTyr: 0.5 ± 0.17
0.0MetXaa: 0.0 ± 0.0
Asn
2.248AsnAla: 2.248 ± 0.836
0.25AsnCys: 0.25 ± 0.236
3.746AsnAsp: 3.746 ± 0.427
2.747AsnGlu: 2.747 ± 0.911
3.497AsnPhe: 3.497 ± 1.09
1.499AsnGly: 1.499 ± 1.533
1.499AsnHis: 1.499 ± 0.76
3.996AsnIle: 3.996 ± 1.169
3.996AsnLys: 3.996 ± 1.617
4.496AsnLeu: 4.496 ± 1.397
1.748AsnMet: 1.748 ± 0.667
3.746AsnAsn: 3.746 ± 0.634
2.498AsnPro: 2.498 ± 1.213
1.249AsnGln: 1.249 ± 0.779
2.498AsnArg: 2.498 ± 0.36
4.745AsnSer: 4.745 ± 1.415
4.246AsnThr: 4.246 ± 1.002
2.747AsnVal: 2.747 ± 0.812
1.249AsnTrp: 1.249 ± 0.367
3.497AsnTyr: 3.497 ± 1.13
0.0AsnXaa: 0.0 ± 0.0
Pro
1.499ProAla: 1.499 ± 0.817
0.25ProCys: 0.25 ± 0.236
1.499ProAsp: 1.499 ± 0.503
2.498ProGlu: 2.498 ± 1.213
1.499ProPhe: 1.499 ± 0.45
3.247ProGly: 3.247 ± 1.72
0.25ProHis: 0.25 ± 0.236
2.498ProIle: 2.498 ± 0.734
1.748ProLys: 1.748 ± 0.737
3.996ProLeu: 3.996 ± 1.475
0.749ProMet: 0.749 ± 0.38
1.748ProAsn: 1.748 ± 0.565
0.749ProPro: 0.749 ± 0.468
0.749ProGln: 0.749 ± 0.468
1.249ProArg: 1.249 ± 0.367
2.248ProSer: 2.248 ± 0.827
1.249ProThr: 1.249 ± 0.539
2.747ProVal: 2.747 ± 0.289
0.5ProTrp: 0.5 ± 0.312
1.249ProTyr: 1.249 ± 0.367
0.0ProXaa: 0.0 ± 0.0
Gln
2.248GlnAla: 2.248 ± 0.689
0.999GlnCys: 0.999 ± 0.339
0.999GlnAsp: 0.999 ± 0.348
1.499GlnGlu: 1.499 ± 0.45
1.499GlnPhe: 1.499 ± 0.817
1.748GlnGly: 1.748 ± 0.526
1.499GlnHis: 1.499 ± 0.509
3.247GlnIle: 3.247 ± 0.95
4.246GlnLys: 4.246 ± 0.728
1.998GlnLeu: 1.998 ± 0.679
0.749GlnMet: 0.749 ± 0.225
1.249GlnAsn: 1.249 ± 0.539
0.5GlnPro: 0.5 ± 0.471
1.499GlnGln: 1.499 ± 0.509
2.248GlnArg: 2.248 ± 2.066
2.248GlnSer: 2.248 ± 0.846
1.748GlnThr: 1.748 ± 0.703
0.999GlnVal: 0.999 ± 0.339
0.0GlnTrp: 0.0 ± 0.0
1.748GlnTyr: 1.748 ± 0.565
0.0GlnXaa: 0.0 ± 0.0
Arg
1.499ArgAla: 1.499 ± 0.935
1.499ArgCys: 1.499 ± 0.887
1.249ArgAsp: 1.249 ± 0.607
2.498ArgGlu: 2.498 ± 0.629
2.747ArgPhe: 2.747 ± 0.744
0.999ArgGly: 0.999 ± 0.348
1.249ArgHis: 1.249 ± 0.49
2.747ArgIle: 2.747 ± 1.075
2.747ArgLys: 2.747 ± 0.491
3.497ArgLeu: 3.497 ± 2.182
1.249ArgMet: 1.249 ± 0.841
2.498ArgAsn: 2.498 ± 0.961
0.999ArgPro: 0.999 ± 0.339
0.999ArgGln: 0.999 ± 0.687
1.249ArgArg: 1.249 ± 0.871
3.497ArgSer: 3.497 ± 0.895
1.748ArgThr: 1.748 ± 0.565
2.997ArgVal: 2.997 ± 0.301
0.5ArgTrp: 0.5 ± 0.17
2.498ArgTyr: 2.498 ± 1.387
0.0ArgXaa: 0.0 ± 0.0
Ser
2.997SerAla: 2.997 ± 1.067
3.497SerCys: 3.497 ± 2.291
3.746SerAsp: 3.746 ± 1.47
3.497SerGlu: 3.497 ± 1.835
2.248SerPhe: 2.248 ± 0.348
2.997SerGly: 2.997 ± 0.684
1.249SerHis: 1.249 ± 0.896
7.493SerIle: 7.493 ± 2.265
6.244SerLys: 6.244 ± 0.907
9.241SerLeu: 9.241 ± 2.145
3.247SerMet: 3.247 ± 1.01
0.999SerAsn: 0.999 ± 0.339
1.748SerPro: 1.748 ± 0.526
2.498SerGln: 2.498 ± 0.856
3.247SerArg: 3.247 ± 1.183
5.245SerSer: 5.245 ± 1.595
4.995SerThr: 4.995 ± 2.812
4.246SerVal: 4.246 ± 1.613
0.5SerTrp: 0.5 ± 0.17
1.998SerTyr: 1.998 ± 0.596
0.0SerXaa: 0.0 ± 0.0
Thr
3.247ThrAla: 3.247 ± 0.661
1.748ThrCys: 1.748 ± 0.987
3.746ThrAsp: 3.746 ± 1.191
3.996ThrGlu: 3.996 ± 0.776
4.496ThrPhe: 4.496 ± 0.828
3.247ThrGly: 3.247 ± 1.891
0.999ThrHis: 0.999 ± 0.609
5.744ThrIle: 5.744 ± 1.739
3.996ThrLys: 3.996 ± 1.357
5.994ThrLeu: 5.994 ± 4.211
1.249ThrMet: 1.249 ± 1.473
3.247ThrAsn: 3.247 ± 1.114
1.998ThrPro: 1.998 ± 1.836
2.498ThrGln: 2.498 ± 1.742
1.249ThrArg: 1.249 ± 0.539
5.994ThrSer: 5.994 ± 0.601
2.498ThrThr: 2.498 ± 2.35
4.246ThrVal: 4.246 ± 1.793
1.249ThrTrp: 1.249 ± 1.477
4.745ThrTyr: 4.745 ± 1.607
0.0ThrXaa: 0.0 ± 0.0
Val
2.997ValAla: 2.997 ± 1.241
1.998ValCys: 1.998 ± 0.679
4.246ValAsp: 4.246 ± 1.234
2.498ValGlu: 2.498 ± 0.787
3.497ValPhe: 3.497 ± 1.051
2.997ValGly: 2.997 ± 1.018
1.249ValHis: 1.249 ± 0.582
2.747ValIle: 2.747 ± 1.916
2.498ValLys: 2.498 ± 0.787
4.995ValLeu: 4.995 ± 1.567
0.999ValMet: 0.999 ± 1.023
3.497ValAsn: 3.497 ± 1.003
1.499ValPro: 1.499 ± 0.618
1.249ValGln: 1.249 ± 1.462
1.748ValArg: 1.748 ± 0.987
6.244ValSer: 6.244 ± 3.47
3.497ValThr: 3.497 ± 0.781
4.745ValVal: 4.745 ± 1.925
0.5ValTrp: 0.5 ± 0.17
3.247ValTyr: 3.247 ± 1.114
0.0ValXaa: 0.0 ± 0.0
Trp
0.25TrpAla: 0.25 ± 0.236
0.5TrpCys: 0.5 ± 0.17
0.25TrpAsp: 0.25 ± 0.156
1.249TrpGlu: 1.249 ± 0.582
0.749TrpPhe: 0.749 ± 0.225
0.25TrpGly: 0.25 ± 0.236
0.0TrpHis: 0.0 ± 0.0
0.25TrpIle: 0.25 ± 0.156
0.999TrpLys: 0.999 ± 1.572
1.499TrpLeu: 1.499 ± 0.509
0.0TrpMet: 0.0 ± 0.0
0.999TrpAsn: 0.999 ± 0.634
0.0TrpPro: 0.0 ± 0.0
0.5TrpGln: 0.5 ± 0.312
0.25TrpArg: 0.25 ± 0.236
0.999TrpSer: 0.999 ± 0.623
0.999TrpThr: 0.999 ± 0.634
0.5TrpVal: 0.5 ± 0.17
0.0TrpTrp: 0.0 ± 0.0
0.25TrpTyr: 0.25 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.249TyrAla: 1.249 ± 0.607
0.749TyrCys: 0.749 ± 0.38
1.499TyrAsp: 1.499 ± 0.618
1.499TyrGlu: 1.499 ± 0.817
1.249TyrPhe: 1.249 ± 0.49
0.25TyrGly: 0.25 ± 0.236
0.749TyrHis: 0.749 ± 0.707
4.745TyrIle: 4.745 ± 0.552
3.746TyrLys: 3.746 ± 0.427
4.496TyrLeu: 4.496 ± 2.275
1.499TyrMet: 1.499 ± 0.879
4.745TyrAsn: 4.745 ± 0.694
1.499TyrPro: 1.499 ± 0.62
2.248TyrGln: 2.248 ± 0.675
0.999TyrArg: 0.999 ± 0.348
2.747TyrSer: 2.747 ± 1.595
2.997TyrThr: 2.997 ± 0.89
1.998TyrVal: 1.998 ± 0.679
0.0TyrTrp: 0.0 ± 0.0
0.999TyrTyr: 0.999 ± 0.609
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4005 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski