Amino acid dipepetide frequency for Shamonda orthobunyavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.764AlaAla: 2.764 ± 2.245
2.01AlaCys: 2.01 ± 0.808
2.764AlaAsp: 2.764 ± 1.485
4.523AlaGlu: 4.523 ± 2.868
2.513AlaPhe: 2.513 ± 2.309
1.508AlaGly: 1.508 ± 0.434
0.251AlaHis: 0.251 ± 0.153
5.025AlaIle: 5.025 ± 0.966
3.769AlaLys: 3.769 ± 1.421
5.025AlaLeu: 5.025 ± 1.264
1.005AlaMet: 1.005 ± 0.328
3.266AlaAsn: 3.266 ± 1.384
1.005AlaPro: 1.005 ± 0.724
1.508AlaGln: 1.508 ± 1.573
1.759AlaArg: 1.759 ± 2.481
4.271AlaSer: 4.271 ± 1.479
4.02AlaThr: 4.02 ± 0.327
1.759AlaVal: 1.759 ± 0.448
0.503AlaTrp: 0.503 ± 0.851
2.764AlaTyr: 2.764 ± 1.087
0.0AlaXaa: 0.0 ± 0.0
Cys
2.01CysAla: 2.01 ± 0.808
0.503CysCys: 0.503 ± 0.306
0.754CysAsp: 0.754 ± 0.202
1.005CysGlu: 1.005 ± 0.289
1.256CysPhe: 1.256 ± 0.767
2.01CysGly: 2.01 ± 1.421
0.503CysHis: 0.503 ± 0.438
2.513CysIle: 2.513 ± 0.723
1.759CysLys: 1.759 ± 0.823
3.015CysLeu: 3.015 ± 1.352
0.503CysMet: 0.503 ± 0.306
0.754CysAsn: 0.754 ± 0.338
0.754CysPro: 0.754 ± 0.338
1.508CysGln: 1.508 ± 1.028
2.261CysArg: 2.261 ± 0.751
2.261CysSer: 2.261 ± 0.751
2.513CysThr: 2.513 ± 1.301
0.754CysVal: 0.754 ± 0.657
0.0CysTrp: 0.0 ± 0.0
2.01CysTyr: 2.01 ± 0.51
0.0CysXaa: 0.0 ± 0.0
Asp
2.261AspAla: 2.261 ± 0.42
1.005AspCys: 1.005 ± 0.289
4.271AspAsp: 4.271 ± 1.243
4.02AspGlu: 4.02 ± 1.556
3.015AspPhe: 3.015 ± 1.493
1.508AspGly: 1.508 ± 0.665
1.759AspHis: 1.759 ± 0.448
6.03AspIle: 6.03 ± 1.888
3.266AspLys: 3.266 ± 1.036
4.774AspLeu: 4.774 ± 0.286
1.256AspMet: 1.256 ± 0.316
3.769AspAsn: 3.769 ± 1.421
1.256AspPro: 1.256 ± 0.765
1.508AspGln: 1.508 ± 0.404
2.01AspArg: 2.01 ± 0.918
2.764AspSer: 2.764 ± 0.709
3.266AspThr: 3.266 ± 0.298
3.769AspVal: 3.769 ± 1.204
0.251AspTrp: 0.251 ± 0.219
2.513AspTyr: 2.513 ± 0.94
0.0AspXaa: 0.0 ± 0.0
Glu
3.015GluAla: 3.015 ± 0.264
1.005GluCys: 1.005 ± 0.289
3.266GluAsp: 3.266 ± 0.6
3.769GluGlu: 3.769 ± 0.517
4.02GluPhe: 4.02 ± 1.311
2.01GluGly: 2.01 ± 0.808
1.256GluHis: 1.256 ± 0.472
6.533GluIle: 6.533 ± 0.815
4.271GluLys: 4.271 ± 1.704
3.518GluLeu: 3.518 ± 1.265
3.015GluMet: 3.015 ± 1.748
2.764GluAsn: 2.764 ± 0.893
2.764GluPro: 2.764 ± 1.087
2.01GluGln: 2.01 ± 0.578
2.261GluArg: 2.261 ± 1.377
3.015GluSer: 3.015 ± 1.141
1.759GluThr: 1.759 ± 0.448
3.769GluVal: 3.769 ± 1.021
0.503GluTrp: 0.503 ± 0.89
2.513GluTyr: 2.513 ± 0.723
0.0GluXaa: 0.0 ± 0.0
Phe
1.759PheAla: 1.759 ± 0.605
1.759PheCys: 1.759 ± 0.448
2.513PheAsp: 2.513 ± 0.39
3.266PheGlu: 3.266 ± 0.936
1.256PhePhe: 1.256 ± 0.678
2.513PheGly: 2.513 ± 2.427
0.754PheHis: 0.754 ± 0.202
3.015PheIle: 3.015 ± 1.331
2.764PheLys: 2.764 ± 0.727
4.523PheLeu: 4.523 ± 1.996
0.503PheMet: 0.503 ± 0.851
2.261PheAsn: 2.261 ± 1.474
0.754PhePro: 0.754 ± 1.156
1.256PheGln: 1.256 ± 0.316
2.513PheArg: 2.513 ± 0.94
3.518PheSer: 3.518 ± 0.393
5.528PheThr: 5.528 ± 0.298
2.01PheVal: 2.01 ± 0.918
0.251PheTrp: 0.251 ± 0.153
1.759PheTyr: 1.759 ± 0.605
0.0PheXaa: 0.0 ± 0.0
Gly
1.508GlyAla: 1.508 ± 0.665
2.01GlyCys: 2.01 ± 1.054
2.513GlyAsp: 2.513 ± 0.632
3.518GlyGlu: 3.518 ± 1.265
1.759GlyPhe: 1.759 ± 1.582
1.759GlyGly: 1.759 ± 0.605
0.251GlyHis: 0.251 ± 0.153
4.523GlyIle: 4.523 ± 1.205
2.513GlyLys: 2.513 ± 0.39
2.764GlyLeu: 2.764 ± 0.727
2.513GlyMet: 2.513 ± 0.886
2.01GlyAsn: 2.01 ± 0.489
2.261GlyPro: 2.261 ± 0.605
2.513GlyGln: 2.513 ± 1.325
1.508GlyArg: 1.508 ± 0.617
3.518GlySer: 3.518 ± 2.203
2.261GlyThr: 2.261 ± 1.122
1.508GlyVal: 1.508 ± 1.573
0.503GlyTrp: 0.503 ± 0.145
1.759GlyTyr: 1.759 ± 2.502
0.0GlyXaa: 0.0 ± 0.0
His
1.508HisAla: 1.508 ± 0.434
0.0HisCys: 0.0 ± 0.0
1.508HisAsp: 1.508 ± 0.404
1.256HisGlu: 1.256 ± 0.316
1.005HisPhe: 1.005 ± 1.057
1.256HisGly: 1.256 ± 0.47
0.754HisHis: 0.754 ± 0.202
1.759HisIle: 1.759 ± 0.887
2.01HisLys: 2.01 ± 1.274
2.01HisLeu: 2.01 ± 0.796
0.754HisMet: 0.754 ± 0.202
2.513HisAsn: 2.513 ± 0.79
0.503HisPro: 0.503 ± 0.145
0.503HisGln: 0.503 ± 0.145
1.005HisArg: 1.005 ± 1.78
2.764HisSer: 2.764 ± 1.087
1.759HisThr: 1.759 ± 0.522
1.256HisVal: 1.256 ± 0.47
0.0HisTrp: 0.0 ± 0.0
1.005HisTyr: 1.005 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
6.03IleAla: 6.03 ± 0.578
4.02IleCys: 4.02 ± 2.203
3.769IleAsp: 3.769 ± 1.685
5.276IleGlu: 5.276 ± 0.404
4.271IlePhe: 4.271 ± 0.381
4.271IleGly: 4.271 ± 0.205
3.015IleHis: 3.015 ± 0.762
5.276IleIle: 5.276 ± 1.831
7.538IleLys: 7.538 ± 1.267
7.538IleLeu: 7.538 ± 1.865
3.015IleMet: 3.015 ± 0.419
6.281IleAsn: 6.281 ± 0.847
2.513IlePro: 2.513 ± 0.723
2.513IleGln: 2.513 ± 1.221
4.02IleArg: 4.02 ± 0.852
5.779IleSer: 5.779 ± 1.304
7.538IleThr: 7.538 ± 1.354
3.015IleVal: 3.015 ± 0.373
1.256IleTrp: 1.256 ± 0.316
4.774IleTyr: 4.774 ± 1.298
0.0IleXaa: 0.0 ± 0.0
Lys
5.779LysAla: 5.779 ± 1.51
2.764LysCys: 2.764 ± 2.408
3.518LysAsp: 3.518 ± 0.723
4.523LysGlu: 4.523 ± 1.14
3.769LysPhe: 3.769 ± 0.856
3.266LysGly: 3.266 ± 0.298
1.005LysHis: 1.005 ± 0.328
4.523LysIle: 4.523 ± 0.941
7.789LysLys: 7.789 ± 0.32
8.794LysLeu: 8.794 ± 1.279
2.513LysMet: 2.513 ± 0.425
3.015LysAsn: 3.015 ± 1.081
2.513LysPro: 2.513 ± 0.94
2.261LysGln: 2.261 ± 0.605
2.764LysArg: 2.764 ± 0.848
4.523LysSer: 4.523 ± 0.318
4.523LysThr: 4.523 ± 0.61
3.769LysVal: 3.769 ± 0.256
2.261LysTrp: 2.261 ± 0.471
2.764LysTyr: 2.764 ± 0.514
0.0LysXaa: 0.0 ± 0.0
Leu
5.276LeuAla: 5.276 ± 2.606
2.01LeuCys: 2.01 ± 0.489
5.025LeuAsp: 5.025 ± 2.263
5.779LeuGlu: 5.779 ± 1.333
3.266LeuPhe: 3.266 ± 1.384
2.764LeuGly: 2.764 ± 0.514
3.769LeuHis: 3.769 ± 1.85
6.533LeuIle: 6.533 ± 2.57
6.533LeuLys: 6.533 ± 1.051
7.286LeuLeu: 7.286 ± 2.129
1.759LeuMet: 1.759 ± 1.071
6.03LeuAsn: 6.03 ± 0.712
3.518LeuPro: 3.518 ± 0.909
2.764LeuGln: 2.764 ± 0.313
3.769LeuArg: 3.769 ± 0.211
6.03LeuSer: 6.03 ± 1.615
6.281LeuThr: 6.281 ± 0.487
5.528LeuVal: 5.528 ± 0.949
0.251LeuTrp: 0.251 ± 0.153
4.271LeuTyr: 4.271 ± 1.077
0.0LeuXaa: 0.0 ± 0.0
Met
1.759MetAla: 1.759 ± 0.522
0.503MetCys: 0.503 ± 0.306
1.759MetAsp: 1.759 ± 0.687
1.005MetGlu: 1.005 ± 0.328
1.005MetPhe: 1.005 ± 0.807
1.256MetGly: 1.256 ± 0.42
0.503MetHis: 0.503 ± 0.851
3.015MetIle: 3.015 ± 0.762
2.01MetLys: 2.01 ± 0.741
2.261MetLeu: 2.261 ± 1.069
0.0MetMet: 0.0 ± 0.0
1.508MetAsn: 1.508 ± 0.918
1.508MetPro: 1.508 ± 0.96
1.508MetGln: 1.508 ± 0.491
1.005MetArg: 1.005 ± 0.328
3.266MetSer: 3.266 ± 1.423
1.759MetThr: 1.759 ± 1.614
2.01MetVal: 2.01 ± 0.498
0.251MetTrp: 0.251 ± 0.469
1.005MetTyr: 1.005 ± 1.178
0.0MetXaa: 0.0 ± 0.0
Asn
2.764AsnAla: 2.764 ± 1.404
1.256AsnCys: 1.256 ± 0.767
2.01AsnAsp: 2.01 ± 0.655
1.256AsnGlu: 1.256 ± 0.472
1.759AsnPhe: 1.759 ± 1.526
1.759AsnGly: 1.759 ± 0.748
1.759AsnHis: 1.759 ± 0.605
6.03AsnIle: 6.03 ± 0.677
2.764AsnLys: 2.764 ± 0.848
7.286AsnLeu: 7.286 ± 2.042
2.513AsnMet: 2.513 ± 0.684
4.271AsnAsn: 4.271 ± 0.882
3.266AsnPro: 3.266 ± 1.14
3.769AsnGln: 3.769 ± 1.068
1.759AsnArg: 1.759 ± 0.522
3.266AsnSer: 3.266 ± 1.112
2.513AsnThr: 2.513 ± 0.943
2.764AsnVal: 2.764 ± 0.709
1.005AsnTrp: 1.005 ± 0.328
3.266AsnTyr: 3.266 ± 0.869
0.0AsnXaa: 0.0 ± 0.0
Pro
2.261ProAla: 2.261 ± 0.82
0.0ProCys: 0.0 ± 0.0
1.759ProAsp: 1.759 ± 0.687
2.01ProGlu: 2.01 ± 0.498
1.759ProPhe: 1.759 ± 0.448
1.508ProGly: 1.508 ± 0.665
1.005ProHis: 1.005 ± 0.289
3.518ProIle: 3.518 ± 1.045
1.759ProLys: 1.759 ± 0.422
3.015ProLeu: 3.015 ± 1.622
0.754ProMet: 0.754 ± 0.202
1.759ProAsn: 1.759 ± 0.422
0.503ProPro: 0.503 ± 0.306
1.508ProGln: 1.508 ± 1.593
1.005ProArg: 1.005 ± 0.289
2.261ProSer: 2.261 ± 0.42
2.01ProThr: 2.01 ± 0.51
1.508ProVal: 1.508 ± 0.604
0.503ProTrp: 0.503 ± 0.306
1.005ProTyr: 1.005 ± 0.289
0.0ProXaa: 0.0 ± 0.0
Gln
0.754GlnAla: 0.754 ± 0.202
1.508GlnCys: 1.508 ± 0.676
4.02GlnAsp: 4.02 ± 1.109
0.754GlnGlu: 0.754 ± 0.202
1.508GlnPhe: 1.508 ± 1.573
1.508GlnGly: 1.508 ± 0.404
0.503GlnHis: 0.503 ± 0.145
4.271GlnIle: 4.271 ± 0.97
4.774GlnLys: 4.774 ± 0.796
3.015GlnLeu: 3.015 ± 1.622
0.754GlnMet: 0.754 ± 0.199
2.513GlnAsn: 2.513 ± 0.632
0.503GlnPro: 0.503 ± 0.851
2.261GlnGln: 2.261 ± 1.399
2.01GlnArg: 2.01 ± 1.613
1.759GlnSer: 1.759 ± 0.61
1.759GlnThr: 1.759 ± 0.422
1.508GlnVal: 1.508 ± 0.491
0.754GlnTrp: 0.754 ± 0.465
1.256GlnTyr: 1.256 ± 0.678
0.0GlnXaa: 0.0 ± 0.0
Arg
1.759ArgAla: 1.759 ± 0.767
1.508ArgCys: 1.508 ± 0.434
3.518ArgAsp: 3.518 ± 1.265
2.01ArgGlu: 2.01 ± 0.655
1.508ArgPhe: 1.508 ± 0.617
1.256ArgGly: 1.256 ± 0.316
1.508ArgHis: 1.508 ± 0.749
4.271ArgIle: 4.271 ± 1.704
2.764ArgLys: 2.764 ± 0.709
3.769ArgLeu: 3.769 ± 1.41
0.503ArgMet: 0.503 ± 0.851
2.01ArgAsn: 2.01 ± 0.526
0.251ArgPro: 0.251 ± 0.469
1.508ArgGln: 1.508 ± 0.728
1.256ArgArg: 1.256 ± 1.83
4.271ArgSer: 4.271 ± 0.958
1.005ArgThr: 1.005 ± 1.0
2.261ArgVal: 2.261 ± 1.488
0.503ArgTrp: 0.503 ± 0.89
2.261ArgTyr: 2.261 ± 0.82
0.0ArgXaa: 0.0 ± 0.0
Ser
2.764SerAla: 2.764 ± 0.313
2.261SerCys: 2.261 ± 0.751
3.518SerAsp: 3.518 ± 0.896
5.025SerGlu: 5.025 ± 2.158
1.508SerPhe: 1.508 ± 0.434
2.513SerGly: 2.513 ± 1.864
1.759SerHis: 1.759 ± 0.522
8.291SerIle: 8.291 ± 2.167
6.03SerLys: 6.03 ± 2.47
6.03SerLeu: 6.03 ± 1.615
2.01SerMet: 2.01 ± 0.46
3.015SerAsn: 3.015 ± 0.579
1.508SerPro: 1.508 ± 0.665
2.764SerGln: 2.764 ± 1.595
4.271SerArg: 4.271 ± 1.704
6.533SerSer: 6.533 ± 1.253
5.528SerThr: 5.528 ± 1.604
5.025SerVal: 5.025 ± 2.249
0.251SerTrp: 0.251 ± 0.219
2.261SerTyr: 2.261 ± 0.471
0.0SerXaa: 0.0 ± 0.0
Thr
4.02ThrAla: 4.02 ± 1.237
2.513ThrCys: 2.513 ± 0.971
3.266ThrAsp: 3.266 ± 0.824
2.261ThrGlu: 2.261 ± 0.42
4.774ThrPhe: 4.774 ± 1.786
4.271ThrGly: 4.271 ± 1.577
1.005ThrHis: 1.005 ± 0.289
7.538ThrIle: 7.538 ± 2.29
4.271ThrLys: 4.271 ± 1.243
2.764ThrLeu: 2.764 ± 0.418
1.508ThrMet: 1.508 ± 1.867
3.266ThrAsn: 3.266 ± 1.279
2.764ThrPro: 2.764 ± 0.709
2.261ThrGln: 2.261 ± 0.55
1.759ThrArg: 1.759 ± 0.937
6.281ThrSer: 6.281 ± 1.58
3.266ThrThr: 3.266 ± 1.036
1.256ThrVal: 1.256 ± 0.692
1.759ThrTrp: 1.759 ± 0.687
2.764ThrTyr: 2.764 ± 1.087
0.0ThrXaa: 0.0 ± 0.0
Val
1.759ValAla: 1.759 ± 1.526
1.256ValCys: 1.256 ± 0.316
2.764ValAsp: 2.764 ± 0.418
2.01ValGlu: 2.01 ± 0.655
2.01ValPhe: 2.01 ± 0.526
2.261ValGly: 2.261 ± 1.71
0.754ValHis: 0.754 ± 0.202
3.266ValIle: 3.266 ± 0.246
5.025ValLys: 5.025 ± 1.942
5.276ValLeu: 5.276 ± 2.836
2.01ValMet: 2.01 ± 1.032
2.513ValAsn: 2.513 ± 1.384
1.759ValPro: 1.759 ± 1.514
2.01ValGln: 2.01 ± 0.578
2.01ValArg: 2.01 ± 0.51
4.523ValSer: 4.523 ± 0.925
2.764ValThr: 2.764 ± 1.145
4.271ValVal: 4.271 ± 1.335
0.251ValTrp: 0.251 ± 0.219
2.513ValTyr: 2.513 ± 0.943
0.0ValXaa: 0.0 ± 0.0
Trp
0.754TrpAla: 0.754 ± 1.757
0.0TrpCys: 0.0 ± 0.0
0.503TrpAsp: 0.503 ± 0.306
0.251TrpGlu: 0.251 ± 0.153
0.754TrpPhe: 0.754 ± 0.202
0.754TrpGly: 0.754 ± 0.338
0.754TrpHis: 0.754 ± 0.634
1.256TrpIle: 1.256 ± 0.827
0.251TrpLys: 0.251 ± 0.219
1.759TrpLeu: 1.759 ± 0.748
0.503TrpMet: 0.503 ± 0.851
1.005TrpAsn: 1.005 ± 0.612
0.0TrpPro: 0.0 ± 0.0
0.251TrpGln: 0.251 ± 0.153
0.251TrpArg: 0.251 ± 0.219
1.005TrpSer: 1.005 ± 0.533
0.503TrpThr: 0.503 ± 0.145
0.503TrpVal: 0.503 ± 0.306
0.0TrpTrp: 0.0 ± 0.0
0.503TrpTyr: 0.503 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.508TyrAla: 1.508 ± 0.434
0.754TyrCys: 0.754 ± 0.657
1.256TyrAsp: 1.256 ± 0.472
3.518TyrGlu: 3.518 ± 1.534
1.759TyrPhe: 1.759 ± 0.767
3.518TyrGly: 3.518 ± 1.097
2.261TyrHis: 2.261 ± 0.589
4.774TyrIle: 4.774 ± 1.196
4.774TyrLys: 4.774 ± 1.467
4.02TyrLeu: 4.02 ± 1.052
1.256TyrMet: 1.256 ± 0.765
2.513TyrAsn: 2.513 ± 0.723
1.508TyrPro: 1.508 ± 0.969
1.508TyrGln: 1.508 ± 0.617
0.503TyrArg: 0.503 ± 0.145
1.256TyrSer: 1.256 ± 0.472
3.015TyrThr: 3.015 ± 0.494
2.764TyrVal: 2.764 ± 1.404
0.503TyrTrp: 0.503 ± 0.306
1.508TyrTyr: 1.508 ± 0.985
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski