Amino acid dipepetide frequency for Sand fever Naples-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.764AlaAla: 3.764 ± 1.271
1.004AlaCys: 1.004 ± 0.246
2.258AlaAsp: 2.258 ± 0.852
2.258AlaGlu: 2.258 ± 0.233
2.008AlaPhe: 2.008 ± 0.648
2.76AlaGly: 2.76 ± 1.314
1.506AlaHis: 1.506 ± 0.499
5.521AlaIle: 5.521 ± 0.532
3.262AlaLys: 3.262 ± 0.127
4.266AlaLeu: 4.266 ± 0.586
3.513AlaMet: 3.513 ± 0.509
2.258AlaAsn: 2.258 ± 0.391
1.255AlaPro: 1.255 ± 0.402
1.004AlaGln: 1.004 ± 0.445
3.262AlaArg: 3.262 ± 1.001
4.768AlaSer: 4.768 ± 0.706
2.509AlaThr: 2.509 ± 0.289
2.509AlaVal: 2.509 ± 0.971
1.004AlaTrp: 1.004 ± 0.445
1.506AlaTyr: 1.506 ± 0.254
0.0AlaXaa: 0.0 ± 0.0
Cys
1.255CysAla: 1.255 ± 0.639
0.251CysCys: 0.251 ± 0.153
1.255CysAsp: 1.255 ± 0.202
0.753CysGlu: 0.753 ± 0.357
1.004CysPhe: 1.004 ± 0.6
0.753CysGly: 0.753 ± 0.13
1.004CysHis: 1.004 ± 0.6
0.502CysIle: 0.502 ± 0.123
2.008CysLys: 2.008 ± 0.492
3.011CysLeu: 3.011 ± 1.107
1.255CysMet: 1.255 ± 0.202
1.004CysAsn: 1.004 ± 0.6
1.255CysPro: 1.255 ± 0.844
0.753CysGln: 0.753 ± 0.13
0.502CysArg: 0.502 ± 0.491
2.76CysSer: 2.76 ± 1.186
1.757CysThr: 1.757 ± 0.308
1.255CysVal: 1.255 ± 0.95
0.251CysTrp: 0.251 ± 0.245
1.506CysTyr: 1.506 ± 0.713
0.0CysXaa: 0.0 ± 0.0
Asp
2.76AspAla: 2.76 ± 0.755
1.255AspCys: 1.255 ± 0.844
4.768AspAsp: 4.768 ± 1.627
3.513AspGlu: 3.513 ± 0.785
3.011AspPhe: 3.011 ± 0.937
1.506AspGly: 1.506 ± 0.26
1.255AspHis: 1.255 ± 0.66
3.513AspIle: 3.513 ± 1.058
4.266AspLys: 4.266 ± 0.622
5.772AspLeu: 5.772 ± 1.188
1.004AspMet: 1.004 ± 0.256
1.255AspAsn: 1.255 ± 0.202
2.509AspPro: 2.509 ± 0.551
1.757AspGln: 1.757 ± 0.311
1.506AspArg: 1.506 ± 0.435
6.023AspSer: 6.023 ± 0.227
2.008AspThr: 2.008 ± 0.685
2.258AspVal: 2.258 ± 0.826
0.753AspTrp: 0.753 ± 0.498
0.753AspTyr: 0.753 ± 0.13
0.0AspXaa: 0.0 ± 0.0
Glu
4.015GluAla: 4.015 ± 0.637
1.255GluCys: 1.255 ± 0.844
4.517GluAsp: 4.517 ± 0.733
7.779GluGlu: 7.779 ± 0.934
3.764GluPhe: 3.764 ± 1.077
4.517GluGly: 4.517 ± 0.478
1.004GluHis: 1.004 ± 0.256
3.764GluIle: 3.764 ± 1.482
4.266GluLys: 4.266 ± 1.014
5.521GluLeu: 5.521 ± 1.49
2.258GluMet: 2.258 ± 0.423
2.258GluAsn: 2.258 ± 0.444
3.262GluPro: 3.262 ± 0.726
1.004GluGln: 1.004 ± 0.718
5.772GluArg: 5.772 ± 0.897
4.015GluSer: 4.015 ± 1.147
3.764GluThr: 3.764 ± 0.886
5.019GluVal: 5.019 ± 1.555
0.502GluTrp: 0.502 ± 0.306
2.76GluTyr: 2.76 ± 0.277
0.0GluXaa: 0.0 ± 0.0
Phe
2.258PheAla: 2.258 ± 0.613
1.506PheCys: 1.506 ± 0.804
2.008PheAsp: 2.008 ± 0.961
3.011PheGlu: 3.011 ± 0.28
2.258PhePhe: 2.258 ± 0.718
2.258PheGly: 2.258 ± 0.496
0.502PheHis: 0.502 ± 0.123
2.509PheIle: 2.509 ± 0.318
3.764PheLys: 3.764 ± 0.651
4.768PheLeu: 4.768 ± 1.191
1.506PheMet: 1.506 ± 0.254
2.258PheAsn: 2.258 ± 0.496
1.757PhePro: 1.757 ± 0.785
0.502PheGln: 0.502 ± 0.31
1.757PheArg: 1.757 ± 0.571
4.015PheSer: 4.015 ± 0.176
2.008PheThr: 2.008 ± 0.685
3.764PheVal: 3.764 ± 1.182
0.753PheTrp: 0.753 ± 0.357
0.753PheTyr: 0.753 ± 0.498
0.0PheXaa: 0.0 ± 0.0
Gly
4.015GlyAla: 4.015 ± 0.398
1.506GlyCys: 1.506 ± 0.369
2.008GlyAsp: 2.008 ± 0.83
3.262GlyGlu: 3.262 ± 1.093
3.764GlyPhe: 3.764 ± 0.747
3.262GlyGly: 3.262 ± 1.669
1.506GlyHis: 1.506 ± 0.551
3.262GlyIle: 3.262 ± 0.589
4.266GlyLys: 4.266 ± 0.282
5.521GlyLeu: 5.521 ± 1.185
1.757GlyMet: 1.757 ± 0.235
2.509GlyAsn: 2.509 ± 0.585
1.757GlyPro: 1.757 ± 0.593
0.753GlyGln: 0.753 ± 0.403
2.509GlyArg: 2.509 ± 1.123
5.27GlySer: 5.27 ± 0.83
2.76GlyThr: 2.76 ± 1.18
4.266GlyVal: 4.266 ± 0.655
0.502GlyTrp: 0.502 ± 0.491
2.008GlyTyr: 2.008 ± 0.227
0.0GlyXaa: 0.0 ± 0.0
His
0.753HisAla: 0.753 ± 0.357
0.251HisCys: 0.251 ± 0.245
1.506HisAsp: 1.506 ± 0.551
0.251HisGlu: 0.251 ± 0.153
1.506HisPhe: 1.506 ± 0.19
2.509HisGly: 2.509 ± 1.038
0.251HisHis: 0.251 ± 0.153
1.255HisIle: 1.255 ± 0.66
1.004HisLys: 1.004 ± 0.226
2.008HisLeu: 2.008 ± 0.527
0.0HisMet: 0.0 ± 0.0
0.753HisAsn: 0.753 ± 0.357
1.004HisPro: 1.004 ± 0.53
0.753HisGln: 0.753 ± 0.13
1.506HisArg: 1.506 ± 0.433
2.509HisSer: 2.509 ± 0.404
0.251HisThr: 0.251 ± 0.153
2.008HisVal: 2.008 ± 0.854
0.0HisTrp: 0.0 ± 0.0
1.004HisTyr: 1.004 ± 0.613
0.0HisXaa: 0.0 ± 0.0
Ile
5.27IleAla: 5.27 ± 1.194
0.753IleCys: 0.753 ± 0.357
3.513IleAsp: 3.513 ± 1.186
5.019IleGlu: 5.019 ± 0.932
2.008IlePhe: 2.008 ± 0.359
4.517IleGly: 4.517 ± 1.811
1.255IleHis: 1.255 ± 0.202
5.521IleIle: 5.521 ± 0.695
4.768IleLys: 4.768 ± 1.142
4.266IleLeu: 4.266 ± 1.54
1.757IleMet: 1.757 ± 0.226
2.76IleAsn: 2.76 ± 0.798
3.513IlePro: 3.513 ± 0.545
2.258IleGln: 2.258 ± 0.496
4.266IleArg: 4.266 ± 0.694
5.019IleSer: 5.019 ± 0.869
3.262IleThr: 3.262 ± 0.478
4.015IleVal: 4.015 ± 1.197
0.502IleTrp: 0.502 ± 0.306
2.258IleTyr: 2.258 ± 0.519
0.0IleXaa: 0.0 ± 0.0
Lys
3.262LysAla: 3.262 ± 0.362
2.258LysCys: 2.258 ± 1.181
3.011LysAsp: 3.011 ± 0.436
5.019LysGlu: 5.019 ± 0.767
2.258LysPhe: 2.258 ± 0.519
6.023LysGly: 6.023 ± 0.806
1.506LysHis: 1.506 ± 0.26
4.768LysIle: 4.768 ± 0.755
6.023LysLys: 6.023 ± 1.428
4.768LysLeu: 4.768 ± 1.249
3.513LysMet: 3.513 ± 0.868
2.509LysAsn: 2.509 ± 0.63
4.266LysPro: 4.266 ± 0.427
2.509LysGln: 2.509 ± 0.332
4.015LysArg: 4.015 ± 0.897
4.517LysSer: 4.517 ± 0.688
4.768LysThr: 4.768 ± 0.695
5.019LysVal: 5.019 ± 0.447
1.255LysTrp: 1.255 ± 0.402
3.764LysTyr: 3.764 ± 0.805
0.0LysXaa: 0.0 ± 0.0
Leu
4.517LeuAla: 4.517 ± 0.858
3.011LeuCys: 3.011 ± 0.321
4.015LeuAsp: 4.015 ± 0.765
6.524LeuGlu: 6.524 ± 0.853
4.517LeuPhe: 4.517 ± 1.915
5.521LeuGly: 5.521 ± 0.88
2.509LeuHis: 2.509 ± 1.011
6.023LeuIle: 6.023 ± 1.819
7.528LeuLys: 7.528 ± 0.986
8.281LeuLeu: 8.281 ± 0.789
2.509LeuMet: 2.509 ± 0.187
4.015LeuAsn: 4.015 ± 0.8
2.258LeuPro: 2.258 ± 0.898
2.258LeuGln: 2.258 ± 0.656
5.772LeuArg: 5.772 ± 1.31
8.03LeuSer: 8.03 ± 1.098
4.266LeuThr: 4.266 ± 0.138
5.772LeuVal: 5.772 ± 1.203
0.502LeuTrp: 0.502 ± 0.32
2.008LeuTyr: 2.008 ± 0.451
0.0LeuXaa: 0.0 ± 0.0
Met
1.506MetAla: 1.506 ± 0.662
0.502MetCys: 0.502 ± 0.306
1.506MetAsp: 1.506 ± 0.43
3.262MetGlu: 3.262 ± 0.362
1.506MetPhe: 1.506 ± 0.551
2.008MetGly: 2.008 ± 0.715
0.753MetHis: 0.753 ± 0.432
2.509MetIle: 2.509 ± 0.665
1.757MetLys: 1.757 ± 0.837
3.262MetLeu: 3.262 ± 0.672
2.008MetMet: 2.008 ± 1.034
1.255MetAsn: 1.255 ± 0.202
0.251MetPro: 0.251 ± 0.153
2.509MetGln: 2.509 ± 0.318
1.506MetArg: 1.506 ± 0.369
4.015MetSer: 4.015 ± 0.91
1.255MetThr: 1.255 ± 0.474
0.502MetVal: 0.502 ± 0.306
0.251MetTrp: 0.251 ± 0.245
0.753MetTyr: 0.753 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
1.255AsnAla: 1.255 ± 0.309
1.506AsnCys: 1.506 ± 1.089
2.008AsnAsp: 2.008 ± 0.451
2.509AsnGlu: 2.509 ± 1.348
1.255AsnPhe: 1.255 ± 0.202
1.506AsnGly: 1.506 ± 0.713
0.502AsnHis: 0.502 ± 0.123
2.258AsnIle: 2.258 ± 0.423
4.015AsnLys: 4.015 ± 1.115
4.517AsnLeu: 4.517 ± 0.718
0.753AsnMet: 0.753 ± 0.579
1.757AsnAsn: 1.757 ± 0.311
3.262AsnPro: 3.262 ± 0.492
1.255AsnGln: 1.255 ± 0.402
2.76AsnArg: 2.76 ± 0.501
4.266AsnSer: 4.266 ± 0.43
2.509AsnThr: 2.509 ± 1.001
1.506AsnVal: 1.506 ± 0.435
0.502AsnTrp: 0.502 ± 0.306
1.506AsnTyr: 1.506 ± 0.254
0.0AsnXaa: 0.0 ± 0.0
Pro
2.76ProAla: 2.76 ± 0.99
0.251ProCys: 0.251 ± 0.245
1.757ProAsp: 1.757 ± 0.311
4.266ProGlu: 4.266 ± 1.265
1.757ProPhe: 1.757 ± 0.308
2.76ProGly: 2.76 ± 0.501
0.502ProHis: 0.502 ± 0.491
2.509ProIle: 2.509 ± 0.49
3.513ProLys: 3.513 ± 1.308
2.76ProLeu: 2.76 ± 0.443
1.255ProMet: 1.255 ± 0.474
1.757ProAsn: 1.757 ± 1.163
1.004ProPro: 1.004 ± 0.64
1.255ProGln: 1.255 ± 0.202
2.008ProArg: 2.008 ± 0.296
4.266ProSer: 4.266 ± 0.648
3.262ProThr: 3.262 ± 1.183
2.008ProVal: 2.008 ± 0.685
1.255ProTrp: 1.255 ± 0.649
1.255ProTyr: 1.255 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
1.255GlnAla: 1.255 ± 0.784
1.255GlnCys: 1.255 ± 0.474
0.502GlnAsp: 0.502 ± 0.491
2.008GlnGlu: 2.008 ± 0.715
1.255GlnPhe: 1.255 ± 0.402
2.008GlnGly: 2.008 ± 0.685
1.004GlnHis: 1.004 ± 0.613
2.76GlnIle: 2.76 ± 0.45
1.255GlnLys: 1.255 ± 0.474
3.513GlnLeu: 3.513 ± 1.184
0.251GlnMet: 0.251 ± 0.153
1.255GlnAsn: 1.255 ± 0.474
1.757GlnPro: 1.757 ± 0.261
0.753GlnGln: 0.753 ± 0.13
1.255GlnArg: 1.255 ± 0.66
2.76GlnSer: 2.76 ± 0.277
1.004GlnThr: 1.004 ± 0.613
2.258GlnVal: 2.258 ± 0.423
0.251GlnTrp: 0.251 ± 0.245
0.753GlnTyr: 0.753 ± 0.13
0.0GlnXaa: 0.0 ± 0.0
Arg
3.513ArgAla: 3.513 ± 1.102
1.506ArgCys: 1.506 ± 0.369
3.764ArgAsp: 3.764 ± 0.592
4.266ArgGlu: 4.266 ± 0.535
2.509ArgPhe: 2.509 ± 1.15
2.258ArgGly: 2.258 ± 0.791
0.502ArgHis: 0.502 ± 0.31
2.76ArgIle: 2.76 ± 0.743
3.262ArgLys: 3.262 ± 1.223
4.768ArgLeu: 4.768 ± 0.93
1.004ArgMet: 1.004 ± 0.525
3.011ArgAsn: 3.011 ± 0.321
2.008ArgPro: 2.008 ± 0.227
1.757ArgGln: 1.757 ± 0.593
1.757ArgArg: 1.757 ± 0.308
3.764ArgSer: 3.764 ± 0.793
2.008ArgThr: 2.008 ± 0.296
4.015ArgVal: 4.015 ± 0.385
1.004ArgTrp: 1.004 ± 0.525
1.757ArgTyr: 1.757 ± 0.956
0.0ArgXaa: 0.0 ± 0.0
Ser
4.517SerAla: 4.517 ± 0.161
2.258SerCys: 2.258 ± 1.07
5.27SerAsp: 5.27 ± 0.804
6.775SerGlu: 6.775 ± 0.436
3.513SerPhe: 3.513 ± 0.144
3.513SerGly: 3.513 ± 0.809
1.757SerHis: 1.757 ± 0.571
6.524SerIle: 6.524 ± 0.763
7.277SerLys: 7.277 ± 1.338
8.281SerLeu: 8.281 ± 0.824
2.258SerMet: 2.258 ± 0.423
3.764SerAsn: 3.764 ± 0.783
4.015SerPro: 4.015 ± 0.453
2.258SerGln: 2.258 ± 0.656
3.513SerArg: 3.513 ± 0.384
8.03SerSer: 8.03 ± 1.169
5.27SerThr: 5.27 ± 0.806
6.524SerVal: 6.524 ± 0.98
2.008SerTrp: 2.008 ± 0.492
2.76SerTyr: 2.76 ± 0.277
0.0SerXaa: 0.0 ± 0.0
Thr
1.757ThrAla: 1.757 ± 1.021
1.255ThrCys: 1.255 ± 0.844
2.76ThrAsp: 2.76 ± 0.952
3.011ThrGlu: 3.011 ± 0.521
1.506ThrPhe: 1.506 ± 0.636
3.513ThrGly: 3.513 ± 0.384
1.506ThrHis: 1.506 ± 1.089
5.521ThrIle: 5.521 ± 1.165
3.011ThrLys: 3.011 ± 0.843
5.772ThrLeu: 5.772 ± 1.1
1.004ThrMet: 1.004 ± 0.748
2.76ThrAsn: 2.76 ± 0.542
3.011ThrPro: 3.011 ± 0.113
1.506ThrGln: 1.506 ± 0.26
2.258ThrArg: 2.258 ± 0.239
4.517ThrSer: 4.517 ± 0.54
4.517ThrThr: 4.517 ± 0.357
3.262ThrVal: 3.262 ± 0.672
0.502ThrTrp: 0.502 ± 0.31
0.753ThrTyr: 0.753 ± 0.33
0.0ThrXaa: 0.0 ± 0.0
Val
2.258ValAla: 2.258 ± 0.519
1.757ValCys: 1.757 ± 0.308
2.509ValAsp: 2.509 ± 0.751
4.266ValGlu: 4.266 ± 1.576
2.008ValPhe: 2.008 ± 0.715
2.509ValGly: 2.509 ± 0.756
1.757ValHis: 1.757 ± 0.311
3.262ValIle: 3.262 ± 0.662
6.023ValLys: 6.023 ± 0.775
4.768ValLeu: 4.768 ± 1.126
3.262ValMet: 3.262 ± 0.726
2.509ValAsn: 2.509 ± 0.551
1.757ValPro: 1.757 ± 0.837
2.509ValGln: 2.509 ± 0.756
3.513ValArg: 3.513 ± 0.321
7.779ValSer: 7.779 ± 0.783
3.513ValThr: 3.513 ± 0.522
4.517ValVal: 4.517 ± 0.53
0.251ValTrp: 0.251 ± 0.153
3.011ValTyr: 3.011 ± 0.764
0.0ValXaa: 0.0 ± 0.0
Trp
0.502TrpAla: 0.502 ± 0.123
0.502TrpCys: 0.502 ± 0.123
0.251TrpAsp: 0.251 ± 0.153
1.255TrpGlu: 1.255 ± 0.844
0.753TrpPhe: 0.753 ± 0.13
0.502TrpGly: 0.502 ± 0.123
0.0TrpHis: 0.0 ± 0.0
0.753TrpIle: 0.753 ± 0.459
1.757TrpLys: 1.757 ± 0.261
1.004TrpLeu: 1.004 ± 0.362
1.004TrpMet: 1.004 ± 0.613
0.502TrpAsn: 0.502 ± 0.306
0.753TrpPro: 0.753 ± 0.568
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.251TrpSer: 0.251 ± 0.153
1.506TrpThr: 1.506 ± 0.636
0.753TrpVal: 0.753 ± 0.33
0.251TrpTrp: 0.251 ± 0.153
0.251TrpTyr: 0.251 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.004TyrAla: 1.004 ± 0.246
0.251TyrCys: 0.251 ± 0.245
2.258TyrAsp: 2.258 ± 0.519
2.008TyrGlu: 2.008 ± 0.83
1.757TyrPhe: 1.757 ± 0.243
2.258TyrGly: 2.258 ± 0.23
0.502TyrHis: 0.502 ± 0.306
1.255TyrIle: 1.255 ± 0.844
2.008TyrLys: 2.008 ± 0.451
3.011TyrLeu: 3.011 ± 0.379
0.753TyrMet: 0.753 ± 0.13
1.255TyrAsn: 1.255 ± 0.402
1.506TyrPro: 1.506 ± 0.499
1.757TyrGln: 1.757 ± 0.607
1.757TyrArg: 1.757 ± 0.725
3.513TyrSer: 3.513 ± 0.861
1.506TyrThr: 1.506 ± 0.369
2.509TyrVal: 2.509 ± 0.618
0.251TyrTrp: 0.251 ± 0.153
0.502TyrTyr: 0.502 ± 0.31
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3986 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski