Amino acid dipepetide frequency for Bunyavirus La Crosse

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.703AlaAla: 1.703 ± 3.214
1.022AlaCys: 1.022 ± 0.538
2.384AlaAsp: 2.384 ± 0.761
3.747AlaGlu: 3.747 ± 1.085
1.362AlaPhe: 1.362 ± 0.717
1.022AlaGly: 1.022 ± 1.332
1.362AlaHis: 1.362 ± 0.692
5.45AlaIle: 5.45 ± 0.818
4.087AlaLys: 4.087 ± 2.691
3.747AlaLeu: 3.747 ± 2.552
1.703AlaMet: 1.703 ± 0.896
2.725AlaAsn: 2.725 ± 1.181
1.022AlaPro: 1.022 ± 0.538
1.022AlaGln: 1.022 ± 0.538
3.065AlaArg: 3.065 ± 0.669
3.065AlaSer: 3.065 ± 3.346
1.703AlaThr: 1.703 ± 0.896
1.362AlaVal: 1.362 ± 1.203
0.341AlaTrp: 0.341 ± 0.179
2.044AlaTyr: 2.044 ± 1.076
0.0AlaXaa: 0.0 ± 0.0
Cys
1.022CysAla: 1.022 ± 0.538
0.681CysCys: 0.681 ± 0.787
0.0CysAsp: 0.0 ± 0.0
0.681CysGlu: 0.681 ± 0.359
1.362CysPhe: 1.362 ± 0.692
0.341CysGly: 0.341 ± 0.179
0.0CysHis: 0.0 ± 0.0
1.703CysIle: 1.703 ± 0.711
1.362CysLys: 1.362 ± 0.717
4.087CysLeu: 4.087 ± 2.877
1.362CysMet: 1.362 ± 0.944
0.681CysAsn: 0.681 ± 0.359
0.681CysPro: 0.681 ± 0.359
1.022CysGln: 1.022 ± 0.719
0.0CysArg: 0.0 ± 0.0
1.362CysSer: 1.362 ± 1.575
0.341CysThr: 0.341 ± 0.179
1.022CysVal: 1.022 ± 0.633
0.0CysTrp: 0.0 ± 0.0
0.341CysTyr: 0.341 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
3.065AspAla: 3.065 ± 1.027
0.341AspCys: 0.341 ± 0.179
2.384AspAsp: 2.384 ± 0.707
2.725AspGlu: 2.725 ± 0.886
3.406AspPhe: 3.406 ± 1.178
1.362AspGly: 1.362 ± 0.717
0.681AspHis: 0.681 ± 0.359
7.493AspIle: 7.493 ± 3.284
5.109AspLys: 5.109 ± 1.355
6.812AspLeu: 6.812 ± 1.174
2.384AspMet: 2.384 ± 0.844
4.087AspAsn: 4.087 ± 1.114
3.747AspPro: 3.747 ± 1.085
2.044AspGln: 2.044 ± 0.662
2.044AspArg: 2.044 ± 1.076
2.384AspSer: 2.384 ± 3.204
1.362AspThr: 1.362 ± 1.951
2.384AspVal: 2.384 ± 1.21
0.341AspTrp: 0.341 ± 0.179
3.747AspTyr: 3.747 ± 1.379
0.0AspXaa: 0.0 ± 0.0
Glu
3.406GluAla: 3.406 ± 1.793
0.681GluCys: 0.681 ± 0.359
2.044GluAsp: 2.044 ± 0.993
2.725GluGlu: 2.725 ± 1.434
5.109GluPhe: 5.109 ± 2.689
2.384GluGly: 2.384 ± 0.866
1.022GluHis: 1.022 ± 0.538
4.087GluIle: 4.087 ± 1.5
4.428GluLys: 4.428 ± 1.74
5.45GluLeu: 5.45 ± 2.178
2.044GluMet: 2.044 ± 0.662
4.087GluAsn: 4.087 ± 1.5
1.703GluPro: 1.703 ± 0.896
3.406GluGln: 3.406 ± 1.178
3.747GluArg: 3.747 ± 1.336
4.428GluSer: 4.428 ± 2.054
2.725GluThr: 2.725 ± 1.032
2.725GluVal: 2.725 ± 0.886
1.022GluTrp: 1.022 ± 0.633
2.725GluTyr: 2.725 ± 1.156
0.0GluXaa: 0.0 ± 0.0
Phe
1.703PheAla: 1.703 ± 0.711
1.703PheCys: 1.703 ± 1.089
2.725PheAsp: 2.725 ± 0.664
2.725PheGlu: 2.725 ± 1.434
1.703PhePhe: 1.703 ± 1.73
2.044PheGly: 2.044 ± 1.265
0.341PheHis: 0.341 ± 0.887
4.428PheIle: 4.428 ± 1.989
4.087PheLys: 4.087 ± 1.5
7.153PheLeu: 7.153 ± 5.837
1.022PheMet: 1.022 ± 0.538
4.087PheAsn: 4.087 ± 1.308
2.384PhePro: 2.384 ± 1.924
0.341PheGln: 0.341 ± 1.111
3.065PheArg: 3.065 ± 1.119
5.79PheSer: 5.79 ± 1.538
3.406PheThr: 3.406 ± 1.265
1.703PheVal: 1.703 ± 0.896
0.681PheTrp: 0.681 ± 0.359
2.384PheTyr: 2.384 ± 1.21
0.0PheXaa: 0.0 ± 0.0
Gly
1.703GlyAla: 1.703 ± 1.089
1.703GlyCys: 1.703 ± 0.896
2.725GlyAsp: 2.725 ± 1.434
3.065GlyGlu: 3.065 ± 0.883
1.703GlyPhe: 1.703 ± 1.814
1.362GlyGly: 1.362 ± 0.59
0.681GlyHis: 0.681 ± 0.359
3.065GlyIle: 3.065 ± 3.182
2.384GlyLys: 2.384 ± 0.924
3.406GlyLeu: 3.406 ± 1.178
1.362GlyMet: 1.362 ± 0.59
3.065GlyAsn: 3.065 ± 1.235
1.022GlyPro: 1.022 ± 0.633
1.022GlyGln: 1.022 ± 0.538
2.725GlyArg: 2.725 ± 1.883
3.406GlySer: 3.406 ± 1.218
1.703GlyThr: 1.703 ± 2.086
1.362GlyVal: 1.362 ± 2.345
0.681GlyTrp: 0.681 ± 0.718
2.044GlyTyr: 2.044 ± 1.265
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.681HisCys: 0.681 ± 0.787
1.703HisAsp: 1.703 ± 0.711
1.362HisGlu: 1.362 ± 0.59
1.362HisPhe: 1.362 ± 0.59
1.362HisGly: 1.362 ± 0.717
0.681HisHis: 0.681 ± 0.359
2.044HisIle: 2.044 ± 0.771
2.725HisLys: 2.725 ± 0.984
1.022HisLeu: 1.022 ± 0.538
1.022HisMet: 1.022 ± 0.538
1.362HisAsn: 1.362 ± 0.717
0.341HisPro: 0.341 ± 0.179
0.681HisGln: 0.681 ± 1.084
0.681HisArg: 0.681 ± 1.669
2.725HisSer: 2.725 ± 1.04
0.0HisThr: 0.0 ± 0.0
0.341HisVal: 0.341 ± 0.179
0.341HisTrp: 0.341 ± 0.179
0.341HisTyr: 0.341 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
5.45IleAla: 5.45 ± 2.033
1.362IleCys: 1.362 ± 1.575
4.768IleAsp: 4.768 ± 1.983
4.087IleGlu: 4.087 ± 2.151
5.109IlePhe: 5.109 ± 3.071
2.725IleGly: 2.725 ± 1.454
3.065IleHis: 3.065 ± 1.235
5.79IleIle: 5.79 ± 3.05
7.153IleLys: 7.153 ± 2.438
10.899IleLeu: 10.899 ± 3.867
2.725IleMet: 2.725 ± 0.941
5.45IleAsn: 5.45 ± 1.197
1.703IlePro: 1.703 ± 0.711
1.362IleGln: 1.362 ± 0.717
4.087IleArg: 4.087 ± 1.577
7.493IleSer: 7.493 ± 3.54
5.45IleThr: 5.45 ± 2.869
2.725IleVal: 2.725 ± 1.434
1.703IleTrp: 1.703 ± 1.472
3.406IleTyr: 3.406 ± 1.178
0.0IleXaa: 0.0 ± 0.0
Lys
3.406LysAla: 3.406 ± 2.684
1.362LysCys: 1.362 ± 0.717
4.768LysAsp: 4.768 ± 1.202
7.834LysGlu: 7.834 ± 3.487
3.747LysPhe: 3.747 ± 2.186
2.725LysGly: 2.725 ± 0.886
1.703LysHis: 1.703 ± 0.896
5.109LysIle: 5.109 ± 2.074
4.768LysLys: 4.768 ± 1.311
7.153LysLeu: 7.153 ± 2.438
3.747LysMet: 3.747 ± 1.938
4.428LysAsn: 4.428 ± 1.666
2.384LysPro: 2.384 ± 0.761
3.065LysGln: 3.065 ± 1.027
1.703LysArg: 1.703 ± 0.896
5.79LysSer: 5.79 ± 1.462
4.087LysThr: 4.087 ± 1.222
4.087LysVal: 4.087 ± 1.5
1.022LysTrp: 1.022 ± 1.547
2.725LysTyr: 2.725 ± 1.889
0.0LysXaa: 0.0 ± 0.0
Leu
5.45LeuAla: 5.45 ± 2.922
2.384LeuCys: 2.384 ± 3.239
5.79LeuAsp: 5.79 ± 1.462
6.471LeuGlu: 6.471 ± 1.969
6.131LeuPhe: 6.131 ± 3.733
3.747LeuGly: 3.747 ± 3.566
2.044LeuHis: 2.044 ± 1.076
7.834LeuIle: 7.834 ± 1.095
8.174LeuLys: 8.174 ± 1.991
11.58LeuLeu: 11.58 ± 6.6
3.065LeuMet: 3.065 ± 1.233
6.131LeuAsn: 6.131 ± 3.956
3.406LeuPro: 3.406 ± 2.566
2.044LeuGln: 2.044 ± 1.18
2.725LeuArg: 2.725 ± 0.886
10.899LeuSer: 10.899 ± 5.43
8.515LeuThr: 8.515 ± 1.955
4.768LeuVal: 4.768 ± 0.928
0.341LeuTrp: 0.341 ± 0.179
4.087LeuTyr: 4.087 ± 1.577
0.0LeuXaa: 0.0 ± 0.0
Met
1.022MetAla: 1.022 ± 0.538
1.022MetCys: 1.022 ± 0.538
3.406MetAsp: 3.406 ± 1.925
1.362MetGlu: 1.362 ± 0.717
1.022MetPhe: 1.022 ± 0.633
1.362MetGly: 1.362 ± 0.717
0.341MetHis: 0.341 ± 0.179
1.362MetIle: 1.362 ± 0.717
2.384MetLys: 2.384 ± 1.255
1.022MetLeu: 1.022 ± 0.538
2.384MetMet: 2.384 ± 1.094
1.703MetAsn: 1.703 ± 0.896
1.703MetPro: 1.703 ± 1.472
2.044MetGln: 2.044 ± 2.332
1.362MetArg: 1.362 ± 0.717
4.087MetSer: 4.087 ± 1.929
2.384MetThr: 2.384 ± 1.062
2.384MetVal: 2.384 ± 0.866
0.0MetTrp: 0.0 ± 0.0
1.022MetTyr: 1.022 ± 0.538
0.0MetXaa: 0.0 ± 0.0
Asn
4.087AsnAla: 4.087 ± 1.049
1.362AsnCys: 1.362 ± 1.575
4.087AsnAsp: 4.087 ± 1.323
3.065AsnGlu: 3.065 ± 1.027
4.768AsnPhe: 4.768 ± 3.366
3.065AsnGly: 3.065 ± 1.354
1.022AsnHis: 1.022 ± 1.538
5.109AsnIle: 5.109 ± 1.904
2.044AsnLys: 2.044 ± 1.076
5.79AsnLeu: 5.79 ± 1.666
1.703AsnMet: 1.703 ± 0.896
4.768AsnAsn: 4.768 ± 1.757
4.428AsnPro: 4.428 ± 1.894
1.362AsnGln: 1.362 ± 0.717
3.065AsnArg: 3.065 ± 1.027
4.087AsnSer: 4.087 ± 1.308
2.725AsnThr: 2.725 ± 0.907
2.044AsnVal: 2.044 ± 0.771
1.022AsnTrp: 1.022 ± 0.538
2.384AsnTyr: 2.384 ± 1.255
0.0AsnXaa: 0.0 ± 0.0
Pro
1.703ProAla: 1.703 ± 0.952
0.0ProCys: 0.0 ± 0.0
2.725ProAsp: 2.725 ± 0.886
3.065ProGlu: 3.065 ± 1.178
1.362ProPhe: 1.362 ± 0.944
4.087ProGly: 4.087 ± 1.172
0.341ProHis: 0.341 ± 0.179
4.768ProIle: 4.768 ± 1.49
1.703ProLys: 1.703 ± 0.899
2.044ProLeu: 2.044 ± 1.322
1.362ProMet: 1.362 ± 0.561
1.022ProAsn: 1.022 ± 0.719
1.362ProPro: 1.362 ± 0.944
1.703ProGln: 1.703 ± 1.762
0.681ProArg: 0.681 ± 1.084
1.703ProSer: 1.703 ± 0.916
1.703ProThr: 1.703 ± 0.952
1.703ProVal: 1.703 ± 0.896
0.681ProTrp: 0.681 ± 0.359
0.681ProTyr: 0.681 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
0.681GlnAla: 0.681 ± 0.359
0.341GlnCys: 0.341 ± 0.179
1.703GlnAsp: 1.703 ± 0.896
1.022GlnGlu: 1.022 ± 0.538
2.044GlnPhe: 2.044 ± 0.993
1.703GlnGly: 1.703 ± 1.286
0.681GlnHis: 0.681 ± 0.718
2.725GlnIle: 2.725 ± 1.434
3.747GlnLys: 3.747 ± 1.085
2.725GlnLeu: 2.725 ± 2.188
0.341GlnMet: 0.341 ± 1.189
1.703GlnAsn: 1.703 ± 1.242
0.341GlnPro: 0.341 ± 0.179
1.022GlnGln: 1.022 ± 0.999
4.087GlnArg: 4.087 ± 1.885
1.703GlnSer: 1.703 ± 0.896
1.703GlnThr: 1.703 ± 0.896
1.703GlnVal: 1.703 ± 1.286
0.0GlnTrp: 0.0 ± 0.0
1.362GlnTyr: 1.362 ± 1.876
0.0GlnXaa: 0.0 ± 0.0
Arg
1.703ArgAla: 1.703 ± 0.711
0.681ArgCys: 0.681 ± 0.359
3.406ArgAsp: 3.406 ± 1.265
3.406ArgGlu: 3.406 ± 1.793
1.703ArgPhe: 1.703 ± 0.896
1.022ArgGly: 1.022 ± 0.999
2.384ArgHis: 2.384 ± 1.255
4.428ArgIle: 4.428 ± 1.666
3.406ArgLys: 3.406 ± 1.202
5.109ArgLeu: 5.109 ± 3.482
0.341ArgMet: 0.341 ± 0.179
3.406ArgAsn: 3.406 ± 1.178
0.681ArgPro: 0.681 ± 1.084
1.362ArgGln: 1.362 ± 0.59
1.362ArgArg: 1.362 ± 0.717
3.747ArgSer: 3.747 ± 1.332
1.703ArgThr: 1.703 ± 1.089
2.384ArgVal: 2.384 ± 1.457
1.022ArgTrp: 1.022 ± 1.538
2.044ArgTyr: 2.044 ± 0.788
0.0ArgXaa: 0.0 ± 0.0
Ser
3.065SerAla: 3.065 ± 2.975
1.022SerCys: 1.022 ± 0.538
4.428SerAsp: 4.428 ± 1.043
3.065SerGlu: 3.065 ± 1.614
2.725SerPhe: 2.725 ± 1.008
3.065SerGly: 3.065 ± 1.932
2.044SerHis: 2.044 ± 1.139
9.877SerIle: 9.877 ± 4.388
7.834SerLys: 7.834 ± 3.459
12.262SerLeu: 12.262 ± 5.11
1.022SerMet: 1.022 ± 0.999
5.109SerAsn: 5.109 ± 2.476
2.725SerPro: 2.725 ± 0.907
1.703SerGln: 1.703 ± 2.086
4.087SerArg: 4.087 ± 1.222
5.79SerSer: 5.79 ± 4.56
4.768SerThr: 4.768 ± 2.032
5.109SerVal: 5.109 ± 1.061
0.681SerTrp: 0.681 ± 1.027
2.044SerTyr: 2.044 ± 1.438
0.0SerXaa: 0.0 ± 0.0
Thr
2.044ThrAla: 2.044 ± 1.265
0.341ThrCys: 0.341 ± 0.179
3.406ThrAsp: 3.406 ± 1.218
3.065ThrGlu: 3.065 ± 1.027
3.747ThrPhe: 3.747 ± 1.209
2.384ThrGly: 2.384 ± 1.643
0.681ThrHis: 0.681 ± 0.359
5.45ThrIle: 5.45 ± 2.376
3.747ThrLys: 3.747 ± 1.531
3.747ThrLeu: 3.747 ± 2.154
1.022ThrMet: 1.022 ± 0.538
2.384ThrAsn: 2.384 ± 1.255
2.384ThrPro: 2.384 ± 1.062
1.362ThrGln: 1.362 ± 0.941
3.406ThrArg: 3.406 ± 1.265
5.109ThrSer: 5.109 ± 1.061
3.406ThrThr: 3.406 ± 2.438
2.384ThrVal: 2.384 ± 1.062
0.341ThrTrp: 0.341 ± 0.179
3.406ThrTyr: 3.406 ± 1.793
0.0ThrXaa: 0.0 ± 0.0
Val
2.044ValAla: 2.044 ± 1.265
1.022ValCys: 1.022 ± 0.538
2.044ValAsp: 2.044 ± 1.076
2.725ValGlu: 2.725 ± 1.34
2.384ValPhe: 2.384 ± 0.761
2.384ValGly: 2.384 ± 0.761
1.362ValHis: 1.362 ± 0.717
2.384ValIle: 2.384 ± 0.761
3.406ValLys: 3.406 ± 1.271
4.768ValLeu: 4.768 ± 0.928
1.703ValMet: 1.703 ± 0.718
2.044ValAsn: 2.044 ± 0.993
1.022ValPro: 1.022 ± 0.538
1.703ValGln: 1.703 ± 0.916
1.703ValArg: 1.703 ± 1.486
5.45ValSer: 5.45 ± 1.291
2.384ValThr: 2.384 ± 0.967
2.044ValVal: 2.044 ± 0.993
0.0ValTrp: 0.0 ± 0.0
1.703ValTyr: 1.703 ± 0.952
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.341TrpCys: 0.341 ± 0.179
0.681TrpAsp: 0.681 ± 1.324
0.341TrpGlu: 0.341 ± 0.179
0.341TrpPhe: 0.341 ± 0.179
0.681TrpGly: 0.681 ± 0.718
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.022TrpLys: 1.022 ± 0.538
1.022TrpLeu: 1.022 ± 0.97
1.022TrpMet: 1.022 ± 0.633
1.022TrpAsn: 1.022 ± 0.538
0.0TrpPro: 0.0 ± 0.0
1.022TrpGln: 1.022 ± 0.633
0.341TrpArg: 0.341 ± 1.189
1.022TrpSer: 1.022 ± 0.538
0.681TrpThr: 0.681 ± 1.084
0.681TrpVal: 0.681 ± 0.718
0.0TrpTrp: 0.0 ± 0.0
0.681TrpTyr: 0.681 ± 1.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.681TyrAla: 0.681 ± 0.359
0.0TyrCys: 0.0 ± 0.0
2.725TyrAsp: 2.725 ± 0.886
3.406TyrGlu: 3.406 ± 1.793
2.384TyrPhe: 2.384 ± 1.255
1.362TyrGly: 1.362 ± 0.59
0.341TyrHis: 0.341 ± 0.179
3.406TyrIle: 3.406 ± 1.265
2.384TyrLys: 2.384 ± 1.062
5.79TyrLeu: 5.79 ± 3.79
1.703TyrMet: 1.703 ± 0.601
2.725TyrAsn: 2.725 ± 0.886
1.362TyrPro: 1.362 ± 0.59
2.044TyrGln: 2.044 ± 1.076
1.362TyrArg: 1.362 ± 0.717
2.384TyrSer: 2.384 ± 0.866
3.065TyrThr: 3.065 ± 1.614
1.362TyrVal: 1.362 ± 0.692
0.681TyrTrp: 0.681 ± 0.787
1.022TyrTyr: 1.022 ± 0.538
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski