Amino acid dipepetide frequency for Lake Sinai virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.355AlaAla: 9.355 ± 2.768
1.403AlaCys: 1.403 ± 0.637
6.548AlaAsp: 6.548 ± 2.115
3.742AlaGlu: 3.742 ± 2.212
4.677AlaPhe: 4.677 ± 0.329
5.145AlaGly: 5.145 ± 1.179
0.935AlaHis: 0.935 ± 0.378
4.677AlaIle: 4.677 ± 0.624
3.742AlaLys: 3.742 ± 0.95
7.484AlaLeu: 7.484 ± 3.386
1.871AlaMet: 1.871 ± 1.026
1.403AlaAsn: 1.403 ± 0.637
6.548AlaPro: 6.548 ± 0.692
1.403AlaGln: 1.403 ± 1.583
6.548AlaArg: 6.548 ± 1.067
7.484AlaSer: 7.484 ± 2.138
5.613AlaThr: 5.613 ± 0.309
4.21AlaVal: 4.21 ± 1.314
1.871AlaTrp: 1.871 ± 1.34
4.21AlaTyr: 4.21 ± 1.425
0.0AlaXaa: 0.0 ± 0.0
Cys
2.339CysAla: 2.339 ± 0.608
0.935CysCys: 0.935 ± 0.882
1.403CysAsp: 1.403 ± 0.576
1.403CysGlu: 1.403 ± 0.744
0.0CysPhe: 0.0 ± 0.0
0.935CysGly: 0.935 ± 0.378
0.468CysHis: 0.468 ± 0.347
0.935CysIle: 0.935 ± 0.963
0.0CysLys: 0.0 ± 0.0
4.21CysLeu: 4.21 ± 1.278
0.0CysMet: 0.0 ± 0.0
0.468CysAsn: 0.468 ± 0.441
1.871CysPro: 1.871 ± 0.756
1.403CysGln: 1.403 ± 0.242
2.806CysArg: 2.806 ± 0.956
3.742CysSer: 3.742 ± 1.511
0.468CysThr: 0.468 ± 0.481
3.274CysVal: 3.274 ± 2.199
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.21AspAla: 4.21 ± 1.181
1.403AspCys: 1.403 ± 0.744
3.742AspAsp: 3.742 ± 1.098
1.403AspGlu: 1.403 ± 0.918
2.339AspPhe: 2.339 ± 1.043
7.016AspGly: 7.016 ± 1.362
0.935AspHis: 0.935 ± 0.378
2.806AspIle: 2.806 ± 0.664
2.339AspLys: 2.339 ± 0.565
4.677AspLeu: 4.677 ± 1.944
0.468AspMet: 0.468 ± 0.347
1.403AspAsn: 1.403 ± 0.86
3.742AspPro: 3.742 ± 0.72
1.871AspGln: 1.871 ± 0.924
2.806AspArg: 2.806 ± 0.942
4.677AspSer: 4.677 ± 1.395
3.742AspThr: 3.742 ± 2.27
2.339AspVal: 2.339 ± 0.413
0.935AspTrp: 0.935 ± 0.445
2.806AspTyr: 2.806 ± 0.69
0.0AspXaa: 0.0 ± 0.0
Glu
2.339GluAla: 2.339 ± 1.242
0.468GluCys: 0.468 ± 0.773
1.403GluAsp: 1.403 ± 1.041
0.468GluGlu: 0.468 ± 0.441
0.468GluPhe: 0.468 ± 0.347
2.806GluGly: 2.806 ± 1.004
1.403GluHis: 1.403 ± 0.744
1.871GluIle: 1.871 ± 1.764
0.935GluLys: 0.935 ± 0.378
1.871GluLeu: 1.871 ± 0.873
0.468GluMet: 0.468 ± 0.441
0.468GluAsn: 0.468 ± 0.481
2.339GluPro: 2.339 ± 0.972
0.0GluGln: 0.0 ± 0.0
1.871GluArg: 1.871 ± 1.151
2.806GluSer: 2.806 ± 1.531
1.871GluThr: 1.871 ± 0.891
2.806GluVal: 2.806 ± 1.064
0.0GluTrp: 0.0 ± 0.0
2.339GluTyr: 2.339 ± 1.594
0.0GluXaa: 0.0 ± 0.0
Phe
1.871PheAla: 1.871 ± 0.662
2.339PheCys: 2.339 ± 1.095
2.339PheAsp: 2.339 ± 1.446
0.468PheGlu: 0.468 ± 0.347
2.339PhePhe: 2.339 ± 0.564
3.274PheGly: 3.274 ± 1.909
1.403PheHis: 1.403 ± 0.918
3.742PheIle: 3.742 ± 1.626
0.935PheLys: 0.935 ± 0.378
1.403PheLeu: 1.403 ± 1.323
1.871PheMet: 1.871 ± 0.873
1.871PheAsn: 1.871 ± 0.924
2.339PhePro: 2.339 ± 1.793
1.403PheGln: 1.403 ± 0.637
1.871PheArg: 1.871 ± 1.388
2.806PheSer: 2.806 ± 0.524
3.274PheThr: 3.274 ± 1.716
4.677PheVal: 4.677 ± 0.624
0.468PheTrp: 0.468 ± 0.441
2.806PheTyr: 2.806 ± 1.228
0.0PheXaa: 0.0 ± 0.0
Gly
3.742GlyAla: 3.742 ± 1.098
1.871GlyCys: 1.871 ± 0.554
3.742GlyAsp: 3.742 ± 1.582
0.935GlyGlu: 0.935 ± 0.378
4.21GlyPhe: 4.21 ± 0.686
2.806GlyGly: 2.806 ± 0.942
1.403GlyHis: 1.403 ± 0.918
4.677GlyIle: 4.677 ± 1.096
1.871GlyLys: 1.871 ± 1.321
5.613GlyLeu: 5.613 ± 1.553
1.403GlyMet: 1.403 ± 0.996
2.339GlyAsn: 2.339 ± 0.565
5.145GlyPro: 5.145 ± 1.692
0.468GlyGln: 0.468 ± 0.347
1.403GlyArg: 1.403 ± 0.918
4.677GlySer: 4.677 ± 0.735
1.403GlyThr: 1.403 ± 0.86
4.21GlyVal: 4.21 ± 1.971
1.403GlyTrp: 1.403 ± 0.242
1.871GlyTyr: 1.871 ± 0.891
0.0GlyXaa: 0.0 ± 0.0
His
2.339HisAla: 2.339 ± 0.608
0.468HisCys: 0.468 ± 0.441
0.935HisAsp: 0.935 ± 0.694
1.871HisGlu: 1.871 ± 0.924
0.468HisPhe: 0.468 ± 0.347
1.403HisGly: 1.403 ± 0.737
0.935HisHis: 0.935 ± 0.378
1.403HisIle: 1.403 ± 0.744
0.468HisLys: 0.468 ± 0.347
2.806HisLeu: 2.806 ± 0.69
0.468HisMet: 0.468 ± 0.394
0.468HisAsn: 0.468 ± 0.347
2.339HisPro: 2.339 ± 0.91
0.468HisGln: 0.468 ± 0.481
2.806HisArg: 2.806 ± 1.531
0.468HisSer: 0.468 ± 0.347
1.403HisThr: 1.403 ± 1.104
3.742HisVal: 3.742 ± 1.384
0.468HisTrp: 0.468 ± 0.347
1.871HisTyr: 1.871 ± 0.554
0.0HisXaa: 0.0 ± 0.0
Ile
4.21IleAla: 4.21 ± 1.029
0.468IleCys: 0.468 ± 0.347
3.274IleAsp: 3.274 ± 1.221
1.403IleGlu: 1.403 ± 0.744
1.403IlePhe: 1.403 ± 0.893
0.935IleGly: 0.935 ± 0.378
2.339IleHis: 2.339 ± 0.91
2.339IleIle: 2.339 ± 1.095
1.403IleLys: 1.403 ± 0.828
3.742IleLeu: 3.742 ± 1.235
0.468IleMet: 0.468 ± 0.347
0.0IleAsn: 0.0 ± 0.0
3.742IlePro: 3.742 ± 1.491
1.871IleGln: 1.871 ± 1.321
0.468IleArg: 0.468 ± 0.347
7.951IleSer: 7.951 ± 1.628
3.274IleThr: 3.274 ± 0.872
2.339IleVal: 2.339 ± 0.91
0.0IleTrp: 0.0 ± 0.0
0.935IleTyr: 0.935 ± 0.963
0.0IleXaa: 0.0 ± 0.0
Lys
1.871LysAla: 1.871 ± 0.306
0.468LysCys: 0.468 ± 0.441
0.935LysAsp: 0.935 ± 0.882
0.0LysGlu: 0.0 ± 0.0
0.935LysPhe: 0.935 ± 0.694
1.403LysGly: 1.403 ± 0.576
1.403LysHis: 1.403 ± 0.637
1.871LysIle: 1.871 ± 1.223
0.0LysLys: 0.0 ± 0.0
3.274LysLeu: 3.274 ± 0.733
1.403LysMet: 1.403 ± 0.268
0.468LysAsn: 0.468 ± 0.481
1.403LysPro: 1.403 ± 0.86
0.468LysGln: 0.468 ± 0.441
1.871LysArg: 1.871 ± 0.306
0.935LysSer: 0.935 ± 0.882
1.871LysThr: 1.871 ± 0.891
2.339LysVal: 2.339 ± 0.972
0.0LysTrp: 0.0 ± 0.0
1.871LysTyr: 1.871 ± 0.662
0.0LysXaa: 0.0 ± 0.0
Leu
8.887LeuAla: 8.887 ± 2.719
2.339LeuCys: 2.339 ± 1.392
6.08LeuAsp: 6.08 ± 0.636
1.871LeuGlu: 1.871 ± 0.662
2.339LeuPhe: 2.339 ± 1.446
6.08LeuGly: 6.08 ± 1.221
1.871LeuHis: 1.871 ± 0.306
3.742LeuIle: 3.742 ± 1.217
3.274LeuLys: 3.274 ± 0.733
10.29LeuLeu: 10.29 ± 1.012
0.468LeuMet: 0.468 ± 0.347
3.742LeuAsn: 3.742 ± 2.125
4.677LeuPro: 4.677 ± 1.797
1.871LeuGln: 1.871 ± 1.764
8.887LeuArg: 8.887 ± 1.72
11.693LeuSer: 11.693 ± 1.711
6.548LeuThr: 6.548 ± 1.953
7.951LeuVal: 7.951 ± 3.498
0.468LeuTrp: 0.468 ± 0.347
4.21LeuTyr: 4.21 ± 0.456
0.0LeuXaa: 0.0 ± 0.0
Met
1.871MetAla: 1.871 ± 0.634
0.468MetCys: 0.468 ± 0.347
0.468MetAsp: 0.468 ± 0.347
0.468MetGlu: 0.468 ± 0.481
0.468MetPhe: 0.468 ± 0.347
1.403MetGly: 1.403 ± 0.242
0.468MetHis: 0.468 ± 0.441
0.468MetIle: 0.468 ± 0.441
0.0MetLys: 0.0 ± 0.0
1.871MetLeu: 1.871 ± 1.034
0.468MetMet: 0.468 ± 0.481
0.468MetAsn: 0.468 ± 0.481
2.339MetPro: 2.339 ± 1.631
0.468MetGln: 0.468 ± 0.347
0.935MetArg: 0.935 ± 0.882
2.339MetSer: 2.339 ± 0.413
0.0MetThr: 0.0 ± 0.0
1.871MetVal: 1.871 ± 0.306
0.0MetTrp: 0.0 ± 0.0
1.403MetTyr: 1.403 ± 0.744
0.0MetXaa: 0.0 ± 0.0
Asn
2.806AsnAla: 2.806 ± 0.524
0.935AsnCys: 0.935 ± 0.445
1.403AsnAsp: 1.403 ± 0.893
0.468AsnGlu: 0.468 ± 0.347
2.339AsnPhe: 2.339 ± 1.804
1.403AsnGly: 1.403 ± 0.893
0.935AsnHis: 0.935 ± 0.694
1.871AsnIle: 1.871 ± 0.306
1.403AsnLys: 1.403 ± 0.744
3.274AsnLeu: 3.274 ± 0.429
0.0AsnMet: 0.0 ± 0.0
1.403AsnAsn: 1.403 ± 0.893
3.742AsnPro: 3.742 ± 0.95
0.935AsnGln: 0.935 ± 0.963
2.806AsnArg: 2.806 ± 0.664
1.403AsnSer: 1.403 ± 0.637
0.935AsnThr: 0.935 ± 0.445
2.339AsnVal: 2.339 ± 1.463
1.403AsnTrp: 1.403 ± 0.86
0.468AsnTyr: 0.468 ± 0.441
0.0AsnXaa: 0.0 ± 0.0
Pro
2.806ProAla: 2.806 ± 1.228
0.468ProCys: 0.468 ± 0.441
3.742ProAsp: 3.742 ± 1.231
0.935ProGlu: 0.935 ± 0.378
3.274ProPhe: 3.274 ± 0.733
2.339ProGly: 2.339 ± 0.724
4.677ProHis: 4.677 ± 1.222
2.806ProIle: 2.806 ± 1.0
1.403ProLys: 1.403 ± 0.637
5.613ProLeu: 5.613 ± 2.645
1.871ProMet: 1.871 ± 1.028
3.274ProAsn: 3.274 ± 1.661
2.806ProPro: 2.806 ± 0.484
1.403ProGln: 1.403 ± 1.187
7.016ProArg: 7.016 ± 2.786
6.08ProSer: 6.08 ± 1.829
8.419ProThr: 8.419 ± 2.197
3.274ProVal: 3.274 ± 0.897
1.403ProTrp: 1.403 ± 0.576
1.403ProTyr: 1.403 ± 0.744
0.0ProXaa: 0.0 ± 0.0
Gln
2.339GlnAla: 2.339 ± 2.019
0.468GlnCys: 0.468 ± 0.441
0.468GlnAsp: 0.468 ± 0.347
0.468GlnGlu: 0.468 ± 0.347
0.0GlnPhe: 0.0 ± 0.0
1.403GlnGly: 1.403 ± 0.893
0.468GlnHis: 0.468 ± 0.347
1.871GlnIle: 1.871 ± 0.634
0.468GlnLys: 0.468 ± 0.441
4.21GlnLeu: 4.21 ± 1.396
0.0GlnMet: 0.0 ± 0.0
0.935GlnAsn: 0.935 ± 0.378
1.403GlnPro: 1.403 ± 0.86
1.871GlnGln: 1.871 ± 1.051
1.871GlnArg: 1.871 ± 0.306
2.806GlnSer: 2.806 ± 1.861
1.871GlnThr: 1.871 ± 1.223
1.403GlnVal: 1.403 ± 0.918
0.0GlnTrp: 0.0 ± 0.0
2.806GlnTyr: 2.806 ± 1.786
0.0GlnXaa: 0.0 ± 0.0
Arg
7.484ArgAla: 7.484 ± 1.879
3.274ArgCys: 3.274 ± 1.138
6.08ArgAsp: 6.08 ± 1.607
1.871ArgGlu: 1.871 ± 0.554
5.145ArgPhe: 5.145 ± 1.547
3.742ArgGly: 3.742 ± 1.556
1.871ArgHis: 1.871 ± 0.554
0.935ArgIle: 0.935 ± 0.445
0.935ArgLys: 0.935 ± 0.694
6.548ArgLeu: 6.548 ± 1.067
0.0ArgMet: 0.0 ± 0.0
4.677ArgAsn: 4.677 ± 1.533
2.339ArgPro: 2.339 ± 1.132
2.806ArgGln: 2.806 ± 0.563
7.951ArgArg: 7.951 ± 2.722
3.742ArgSer: 3.742 ± 0.393
4.677ArgThr: 4.677 ± 1.321
5.145ArgVal: 5.145 ± 1.629
1.403ArgTrp: 1.403 ± 0.242
3.274ArgTyr: 3.274 ± 1.138
0.0ArgXaa: 0.0 ± 0.0
Ser
8.419SerAla: 8.419 ± 0.427
2.339SerCys: 2.339 ± 0.962
4.21SerAsp: 4.21 ± 0.898
1.871SerGlu: 1.871 ± 0.306
4.21SerPhe: 4.21 ± 1.501
3.742SerGly: 3.742 ± 0.393
0.935SerHis: 0.935 ± 0.694
2.339SerIle: 2.339 ± 0.91
1.403SerLys: 1.403 ± 0.242
7.951SerLeu: 7.951 ± 1.308
2.806SerMet: 2.806 ± 1.47
1.871SerAsn: 1.871 ± 0.306
7.484SerPro: 7.484 ± 1.389
1.871SerGln: 1.871 ± 0.306
8.887SerArg: 8.887 ± 1.523
11.693SerSer: 11.693 ± 2.192
7.016SerThr: 7.016 ± 0.489
7.951SerVal: 7.951 ± 1.665
2.339SerTrp: 2.339 ± 1.197
2.806SerTyr: 2.806 ± 1.216
0.0SerXaa: 0.0 ± 0.0
Thr
7.484ThrAla: 7.484 ± 0.733
0.935ThrCys: 0.935 ± 0.445
0.468ThrAsp: 0.468 ± 0.347
1.871ThrGlu: 1.871 ± 1.034
3.742ThrPhe: 3.742 ± 2.212
3.274ThrGly: 3.274 ± 1.057
0.935ThrHis: 0.935 ± 0.694
0.0ThrIle: 0.0 ± 0.0
1.403ThrLys: 1.403 ± 0.828
8.419ThrLeu: 8.419 ± 2.007
0.935ThrMet: 0.935 ± 0.774
2.339ThrAsn: 2.339 ± 0.565
3.274ThrPro: 3.274 ± 1.442
2.339ThrGln: 2.339 ± 0.724
3.742ThrArg: 3.742 ± 1.563
6.08ThrSer: 6.08 ± 0.942
6.548ThrThr: 6.548 ± 0.596
5.145ThrVal: 5.145 ± 1.866
0.935ThrTrp: 0.935 ± 0.882
5.145ThrTyr: 5.145 ± 0.705
0.0ThrXaa: 0.0 ± 0.0
Val
8.887ValAla: 8.887 ± 1.0
2.339ValCys: 2.339 ± 0.962
3.274ValAsp: 3.274 ± 0.7
3.742ValGlu: 3.742 ± 0.393
3.274ValPhe: 3.274 ± 0.935
4.21ValGly: 4.21 ± 1.501
2.339ValHis: 2.339 ± 1.446
1.871ValIle: 1.871 ± 1.034
2.806ValLys: 2.806 ± 0.798
6.08ValLeu: 6.08 ± 1.36
0.468ValMet: 0.468 ± 0.347
1.871ValAsn: 1.871 ± 0.924
5.145ValPro: 5.145 ± 1.89
1.403ValGln: 1.403 ± 0.893
6.08ValArg: 6.08 ± 1.291
4.677ValSer: 4.677 ± 0.771
4.677ValThr: 4.677 ± 2.189
5.613ValVal: 5.613 ± 2.441
1.403ValTrp: 1.403 ± 0.787
2.339ValTyr: 2.339 ± 0.564
0.0ValXaa: 0.0 ± 0.0
Trp
0.935TrpAla: 0.935 ± 0.694
0.468TrpCys: 0.468 ± 0.347
0.468TrpAsp: 0.468 ± 0.347
0.935TrpGlu: 0.935 ± 0.882
0.468TrpPhe: 0.468 ± 0.347
0.468TrpGly: 0.468 ± 0.481
0.0TrpHis: 0.0 ± 0.0
0.935TrpIle: 0.935 ± 0.694
0.0TrpLys: 0.0 ± 0.0
2.806TrpLeu: 2.806 ± 1.228
0.935TrpMet: 0.935 ± 0.514
0.935TrpAsn: 0.935 ± 0.445
0.468TrpPro: 0.468 ± 0.441
0.468TrpGln: 0.468 ± 0.441
0.0TrpArg: 0.0 ± 0.0
0.935TrpSer: 0.935 ± 0.445
0.935TrpThr: 0.935 ± 0.445
1.403TrpVal: 1.403 ± 0.744
0.0TrpTrp: 0.0 ± 0.0
0.935TrpTyr: 0.935 ± 0.445
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.677TyrAla: 4.677 ± 0.329
2.806TyrCys: 2.806 ± 0.524
4.677TyrAsp: 4.677 ± 0.934
2.806TyrGlu: 2.806 ± 1.488
1.403TyrPhe: 1.403 ± 0.242
1.403TyrGly: 1.403 ± 0.576
2.339TyrHis: 2.339 ± 0.608
1.403TyrIle: 1.403 ± 1.187
0.0TyrLys: 0.0 ± 0.0
5.145TyrLeu: 5.145 ± 2.152
1.403TyrMet: 1.403 ± 0.637
1.871TyrAsn: 1.871 ± 1.321
1.871TyrPro: 1.871 ± 1.028
2.339TyrGln: 2.339 ± 0.564
3.274TyrArg: 3.274 ± 1.362
5.145TyrSer: 5.145 ± 0.615
0.468TyrThr: 0.468 ± 0.347
0.468TyrVal: 0.468 ± 0.481
0.0TyrTrp: 0.0 ± 0.0
4.677TyrTyr: 4.677 ± 1.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski