Amino acid dipepetide frequency for Lake Sinai virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.382AlaAla: 7.382 ± 1.256
1.969AlaCys: 1.969 ± 0.96
6.89AlaAsp: 6.89 ± 1.472
4.429AlaGlu: 4.429 ± 1.369
4.429AlaPhe: 4.429 ± 0.79
3.445AlaGly: 3.445 ± 0.093
0.984AlaHis: 0.984 ± 0.326
2.461AlaIle: 2.461 ± 0.575
2.953AlaLys: 2.953 ± 1.527
5.906AlaLeu: 5.906 ± 1.283
0.492AlaMet: 0.492 ± 0.282
0.984AlaAsn: 0.984 ± 0.395
5.906AlaPro: 5.906 ± 0.604
0.984AlaGln: 0.984 ± 0.807
5.413AlaArg: 5.413 ± 0.878
7.382AlaSer: 7.382 ± 2.286
4.921AlaThr: 4.921 ± 0.619
3.445AlaVal: 3.445 ± 0.77
1.969AlaTrp: 1.969 ± 1.159
4.921AlaTyr: 4.921 ± 0.779
0.0AlaXaa: 0.0 ± 0.0
Cys
2.461CysAla: 2.461 ± 0.309
0.984CysCys: 0.984 ± 0.908
1.476CysAsp: 1.476 ± 0.517
0.984CysGlu: 0.984 ± 0.326
0.0CysPhe: 0.0 ± 0.0
1.476CysGly: 1.476 ± 0.703
0.492CysHis: 0.492 ± 0.361
0.984CysIle: 0.984 ± 0.807
0.0CysLys: 0.0 ± 0.0
3.937CysLeu: 3.937 ± 1.172
0.0CysMet: 0.0 ± 0.0
0.492CysAsn: 0.492 ± 0.454
0.984CysPro: 0.984 ± 0.326
1.969CysGln: 1.969 ± 0.216
2.953CysArg: 2.953 ± 1.034
4.921CysSer: 4.921 ± 1.883
0.492CysThr: 0.492 ± 0.404
0.984CysVal: 0.984 ± 0.326
0.0CysTrp: 0.0 ± 0.0
0.492CysTyr: 0.492 ± 0.454
0.0CysXaa: 0.0 ± 0.0
Asp
3.445AspAla: 3.445 ± 1.158
1.476AspCys: 1.476 ± 1.361
3.445AspAsp: 3.445 ± 0.682
2.461AspGlu: 2.461 ± 0.575
2.953AspPhe: 2.953 ± 0.547
5.906AspGly: 5.906 ± 0.463
1.969AspHis: 1.969 ± 0.216
3.445AspIle: 3.445 ± 0.093
2.461AspLys: 2.461 ± 0.39
6.398AspLeu: 6.398 ± 1.321
0.492AspMet: 0.492 ± 0.361
1.969AspAsn: 1.969 ± 0.79
3.937AspPro: 3.937 ± 1.012
2.461AspGln: 2.461 ± 0.575
1.476AspArg: 1.476 ± 0.64
3.445AspSer: 3.445 ± 0.84
3.937AspThr: 3.937 ± 2.23
0.492AspVal: 0.492 ± 0.404
0.492AspTrp: 0.492 ± 0.404
2.953AspTyr: 2.953 ± 0.387
0.0AspXaa: 0.0 ± 0.0
Glu
3.445GluAla: 3.445 ± 2.006
0.0GluCys: 0.0 ± 0.0
0.492GluAsp: 0.492 ± 0.361
0.0GluGlu: 0.0 ± 0.0
0.492GluPhe: 0.492 ± 0.361
4.921GluGly: 4.921 ± 1.468
1.476GluHis: 1.476 ± 0.703
2.461GluIle: 2.461 ± 0.996
1.476GluLys: 1.476 ± 0.151
1.969GluLeu: 1.969 ± 0.653
0.492GluMet: 0.492 ± 0.454
0.492GluAsn: 0.492 ± 0.361
1.969GluPro: 1.969 ± 1.299
0.984GluGln: 0.984 ± 0.908
1.476GluArg: 1.476 ± 0.517
2.461GluSer: 2.461 ± 1.803
1.476GluThr: 1.476 ± 0.713
2.953GluVal: 2.953 ± 0.746
0.0GluTrp: 0.0 ± 0.0
1.969GluTyr: 1.969 ± 1.138
0.0GluXaa: 0.0 ± 0.0
Phe
1.476PheAla: 1.476 ± 0.151
1.969PheCys: 1.969 ± 1.138
3.937PheAsp: 3.937 ± 1.019
0.492PheGlu: 0.492 ± 0.361
1.969PhePhe: 1.969 ± 0.79
2.953PheGly: 2.953 ± 1.652
0.984PheHis: 0.984 ± 0.721
2.461PheIle: 2.461 ± 0.39
0.984PheLys: 0.984 ± 0.721
1.969PheLeu: 1.969 ± 0.549
1.969PheMet: 1.969 ± 0.829
2.461PheAsn: 2.461 ± 1.0
2.461PhePro: 2.461 ± 1.48
0.984PheGln: 0.984 ± 0.326
2.953PheArg: 2.953 ± 0.547
2.953PheSer: 2.953 ± 0.387
2.461PheThr: 2.461 ± 1.48
6.89PheVal: 6.89 ± 1.472
0.492PheTrp: 0.492 ± 0.454
1.476PheTyr: 1.476 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
3.937GlyAla: 3.937 ± 1.656
1.476GlyCys: 1.476 ± 0.876
5.906GlyAsp: 5.906 ± 0.604
0.492GlyGlu: 0.492 ± 0.361
5.413GlyPhe: 5.413 ± 0.959
1.476GlyGly: 1.476 ± 0.64
2.461GlyHis: 2.461 ± 1.803
4.429GlyIle: 4.429 ± 0.453
1.969GlyLys: 1.969 ± 1.089
5.906GlyLeu: 5.906 ± 0.229
0.492GlyMet: 0.492 ± 0.454
1.476GlyAsn: 1.476 ± 0.799
4.921GlyPro: 4.921 ± 1.478
0.984GlyGln: 0.984 ± 0.721
1.476GlyArg: 1.476 ± 1.082
2.953GlySer: 2.953 ± 0.944
2.461GlyThr: 2.461 ± 0.39
3.445GlyVal: 3.445 ± 0.84
0.984GlyTrp: 0.984 ± 0.509
1.476GlyTyr: 1.476 ± 0.64
0.0GlyXaa: 0.0 ± 0.0
His
2.461HisAla: 2.461 ± 0.575
0.492HisCys: 0.492 ± 0.454
2.461HisAsp: 2.461 ± 1.803
1.476HisGlu: 1.476 ± 0.64
0.492HisPhe: 0.492 ± 0.361
1.969HisGly: 1.969 ± 0.829
0.984HisHis: 0.984 ± 0.326
0.984HisIle: 0.984 ± 0.908
0.0HisLys: 0.0 ± 0.0
2.461HisLeu: 2.461 ± 0.575
0.492HisMet: 0.492 ± 0.361
0.0HisAsn: 0.0 ± 0.0
1.969HisPro: 1.969 ± 0.829
0.492HisGln: 0.492 ± 0.404
3.937HisArg: 3.937 ± 1.659
2.461HisSer: 2.461 ± 0.786
1.476HisThr: 1.476 ± 1.361
1.969HisVal: 1.969 ± 0.829
0.492HisTrp: 0.492 ± 0.361
1.969HisTyr: 1.969 ± 0.515
0.0HisXaa: 0.0 ± 0.0
Ile
1.476IleAla: 1.476 ± 0.713
1.476IleCys: 1.476 ± 0.64
2.953IleAsp: 2.953 ± 0.746
2.953IleGlu: 2.953 ± 0.979
0.492IlePhe: 0.492 ± 0.361
1.969IleGly: 1.969 ± 0.549
2.461IleHis: 2.461 ± 1.17
4.429IleIle: 4.429 ± 0.901
0.984IleLys: 0.984 ± 0.509
3.445IleLeu: 3.445 ± 0.642
1.476IleMet: 1.476 ± 0.151
1.476IleAsn: 1.476 ± 0.151
2.461IlePro: 2.461 ± 0.996
0.984IleGln: 0.984 ± 0.807
1.476IleArg: 1.476 ± 0.799
6.398IleSer: 6.398 ± 2.091
4.429IleThr: 4.429 ± 1.143
0.984IleVal: 0.984 ± 0.326
0.0IleTrp: 0.0 ± 0.0
1.969IleTyr: 1.969 ± 1.089
0.0IleXaa: 0.0 ± 0.0
Lys
0.984LysAla: 0.984 ± 0.908
0.492LysCys: 0.492 ± 0.454
0.984LysAsp: 0.984 ± 0.908
0.0LysGlu: 0.0 ± 0.0
2.461LysPhe: 2.461 ± 1.08
1.969LysGly: 1.969 ± 0.829
0.492LysHis: 0.492 ± 0.404
1.969LysIle: 1.969 ± 1.299
0.492LysLys: 0.492 ± 0.361
2.461LysLeu: 2.461 ± 0.309
0.984LysMet: 0.984 ± 0.477
0.492LysAsn: 0.492 ± 0.404
1.969LysPro: 1.969 ± 1.615
0.492LysGln: 0.492 ± 0.454
3.445LysArg: 3.445 ± 1.196
0.984LysSer: 0.984 ± 0.395
2.461LysThr: 2.461 ± 1.0
2.953LysVal: 2.953 ± 0.944
0.0LysTrp: 0.0 ± 0.0
0.492LysTyr: 0.492 ± 0.454
0.0LysXaa: 0.0 ± 0.0
Leu
8.858LeuAla: 8.858 ± 0.905
2.461LeuCys: 2.461 ± 0.575
4.921LeuAsp: 4.921 ± 0.778
2.953LeuGlu: 2.953 ± 0.302
3.445LeuPhe: 3.445 ± 0.84
4.921LeuGly: 4.921 ± 0.619
1.969LeuHis: 1.969 ± 0.216
3.445LeuIle: 3.445 ± 1.196
2.953LeuLys: 2.953 ± 1.007
9.35LeuLeu: 9.35 ± 2.229
0.492LeuMet: 0.492 ± 0.361
4.921LeuAsn: 4.921 ± 1.54
4.429LeuPro: 4.429 ± 1.143
2.953LeuGln: 2.953 ± 1.752
10.827LeuArg: 10.827 ± 1.743
11.811LeuSer: 11.811 ± 1.083
5.413LeuThr: 5.413 ± 2.442
6.398LeuVal: 6.398 ± 3.228
0.492LeuTrp: 0.492 ± 0.361
3.445LeuTyr: 3.445 ± 1.167
0.0LeuXaa: 0.0 ± 0.0
Met
0.984MetAla: 0.984 ± 0.908
0.492MetCys: 0.492 ± 0.361
0.492MetAsp: 0.492 ± 0.361
0.492MetGlu: 0.492 ± 0.404
0.492MetPhe: 0.492 ± 0.361
0.984MetGly: 0.984 ± 0.395
0.492MetHis: 0.492 ± 0.454
0.984MetIle: 0.984 ± 0.509
0.0MetLys: 0.0 ± 0.0
2.461MetLeu: 2.461 ± 1.277
1.476MetMet: 1.476 ± 0.151
0.984MetAsn: 0.984 ± 0.807
2.461MetPro: 2.461 ± 0.913
0.492MetGln: 0.492 ± 0.361
0.492MetArg: 0.492 ± 0.454
2.461MetSer: 2.461 ± 0.786
0.0MetThr: 0.0 ± 0.0
0.984MetVal: 0.984 ± 0.721
0.492MetTrp: 0.492 ± 0.361
1.476MetTyr: 1.476 ± 0.703
0.0MetXaa: 0.0 ± 0.0
Asn
0.984AsnAla: 0.984 ± 0.509
0.492AsnCys: 0.492 ± 0.404
2.461AsnAsp: 2.461 ± 0.658
0.984AsnGlu: 0.984 ± 0.721
1.969AsnPhe: 1.969 ± 0.515
2.953AsnGly: 2.953 ± 0.302
1.476AsnHis: 1.476 ± 0.517
0.984AsnIle: 0.984 ± 0.509
0.984AsnLys: 0.984 ± 0.326
2.461AsnLeu: 2.461 ± 0.309
0.492AsnMet: 0.492 ± 0.404
1.476AsnAsn: 1.476 ± 0.151
3.937AsnPro: 3.937 ± 1.669
0.492AsnGln: 0.492 ± 0.404
3.937AsnArg: 3.937 ± 0.361
1.969AsnSer: 1.969 ± 0.79
1.969AsnThr: 1.969 ± 0.79
2.461AsnVal: 2.461 ± 1.48
1.969AsnTrp: 1.969 ± 0.79
0.492AsnTyr: 0.492 ± 0.361
0.0AsnXaa: 0.0 ± 0.0
Pro
5.906ProAla: 5.906 ± 0.932
0.492ProCys: 0.492 ± 0.454
3.445ProAsp: 3.445 ± 0.618
0.984ProGlu: 0.984 ± 0.326
2.461ProPhe: 2.461 ± 0.658
2.461ProGly: 2.461 ± 0.658
3.937ProHis: 3.937 ± 0.683
4.429ProIle: 4.429 ± 1.543
1.476ProLys: 1.476 ± 0.713
7.382ProLeu: 7.382 ± 1.757
1.476ProMet: 1.476 ± 0.799
1.969ProAsn: 1.969 ± 1.159
4.921ProPro: 4.921 ± 0.793
0.492ProGln: 0.492 ± 0.404
4.921ProArg: 4.921 ± 1.992
8.858ProSer: 8.858 ± 1.638
5.413ProThr: 5.413 ± 1.047
3.445ProVal: 3.445 ± 0.642
0.984ProTrp: 0.984 ± 0.721
1.476ProTyr: 1.476 ± 0.151
0.0ProXaa: 0.0 ± 0.0
Gln
1.476GlnAla: 1.476 ± 0.713
0.492GlnCys: 0.492 ± 0.454
0.492GlnAsp: 0.492 ± 0.361
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
0.984GlnGly: 0.984 ± 0.509
1.476GlnHis: 1.476 ± 0.517
0.984GlnIle: 0.984 ± 0.395
0.492GlnLys: 0.492 ± 0.454
3.445GlnLeu: 3.445 ± 1.087
0.0GlnMet: 0.0 ± 0.0
0.984GlnAsn: 0.984 ± 0.326
1.476GlnPro: 1.476 ± 0.713
1.476GlnGln: 1.476 ± 0.713
1.969GlnArg: 1.969 ± 0.515
1.969GlnSer: 1.969 ± 1.089
3.445GlnThr: 3.445 ± 2.168
1.476GlnVal: 1.476 ± 0.64
0.0GlnTrp: 0.0 ± 0.0
3.937GlnTyr: 3.937 ± 1.427
0.0GlnXaa: 0.0 ± 0.0
Arg
4.921ArgAla: 4.921 ± 0.619
2.953ArgCys: 2.953 ± 1.407
4.429ArgAsp: 4.429 ± 0.484
3.937ArgGlu: 3.937 ± 0.683
4.921ArgPhe: 4.921 ± 1.867
2.953ArgGly: 2.953 ± 1.185
1.476ArgHis: 1.476 ± 0.151
1.969ArgIle: 1.969 ± 0.79
0.984ArgLys: 0.984 ± 0.326
6.89ArgLeu: 6.89 ± 1.103
0.492ArgMet: 0.492 ± 0.361
5.906ArgAsn: 5.906 ± 1.283
2.953ArgPro: 2.953 ± 0.764
2.461ArgGln: 2.461 ± 0.309
7.382ArgArg: 7.382 ± 2.891
6.89ArgSer: 6.89 ± 1.929
3.937ArgThr: 3.937 ± 1.012
6.89ArgVal: 6.89 ± 1.363
1.476ArgTrp: 1.476 ± 0.151
2.461ArgTyr: 2.461 ± 0.786
0.0ArgXaa: 0.0 ± 0.0
Ser
8.366SerAla: 8.366 ± 2.317
1.969SerCys: 1.969 ± 0.829
4.429SerAsp: 4.429 ± 0.484
1.476SerGlu: 1.476 ± 0.799
3.445SerPhe: 3.445 ± 0.84
4.429SerGly: 4.429 ± 0.839
1.476SerHis: 1.476 ± 1.082
2.953SerIle: 2.953 ± 0.935
2.953SerLys: 2.953 ± 1.007
9.35SerLeu: 9.35 ± 0.812
2.461SerMet: 2.461 ± 1.737
1.969SerAsn: 1.969 ± 0.515
8.366SerPro: 8.366 ± 2.45
2.461SerGln: 2.461 ± 0.309
9.35SerArg: 9.35 ± 0.724
10.827SerSer: 10.827 ± 1.113
8.366SerThr: 8.366 ± 0.164
8.366SerVal: 8.366 ± 2.445
2.461SerTrp: 2.461 ± 1.17
2.953SerTyr: 2.953 ± 1.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.89ThrAla: 6.89 ± 1.109
0.492ThrCys: 0.492 ± 0.404
0.984ThrAsp: 0.984 ± 0.326
0.984ThrGlu: 0.984 ± 0.509
3.445ThrPhe: 3.445 ± 1.295
3.445ThrGly: 3.445 ± 0.77
1.476ThrHis: 1.476 ± 0.517
0.984ThrIle: 0.984 ± 0.509
1.969ThrLys: 1.969 ± 0.549
8.858ThrLeu: 8.858 ± 3.62
0.492ThrMet: 0.492 ± 0.361
1.969ThrAsn: 1.969 ± 0.96
3.937ThrPro: 3.937 ± 1.098
1.476ThrGln: 1.476 ± 0.713
5.906ThrArg: 5.906 ± 1.283
5.906ThrSer: 5.906 ± 1.384
6.398ThrThr: 6.398 ± 1.354
5.413ThrVal: 5.413 ± 1.553
0.492ThrTrp: 0.492 ± 0.454
3.937ThrTyr: 3.937 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
7.382ValAla: 7.382 ± 1.725
1.476ValCys: 1.476 ± 0.517
1.969ValAsp: 1.969 ± 0.79
2.461ValGlu: 2.461 ± 0.39
2.953ValPhe: 2.953 ± 0.302
3.445ValGly: 3.445 ± 1.295
1.476ValHis: 1.476 ± 0.517
1.969ValIle: 1.969 ± 0.549
2.953ValLys: 2.953 ± 0.746
4.429ValLeu: 4.429 ± 1.543
1.969ValMet: 1.969 ± 1.089
1.969ValAsn: 1.969 ± 0.79
5.413ValPro: 5.413 ± 1.51
1.476ValGln: 1.476 ± 0.799
3.937ValArg: 3.937 ± 2.23
7.382ValSer: 7.382 ± 0.268
4.429ValThr: 4.429 ± 0.28
5.413ValVal: 5.413 ± 1.798
0.984ValTrp: 0.984 ± 0.509
2.461ValTyr: 2.461 ± 0.658
0.0ValXaa: 0.0 ± 0.0
Trp
0.984TrpAla: 0.984 ± 0.721
1.476TrpCys: 1.476 ± 1.082
0.0TrpAsp: 0.0 ± 0.0
0.984TrpGlu: 0.984 ± 0.908
0.492TrpPhe: 0.492 ± 0.361
0.492TrpGly: 0.492 ± 0.404
0.0TrpHis: 0.0 ± 0.0
0.492TrpIle: 0.492 ± 0.361
0.0TrpLys: 0.0 ± 0.0
2.461TrpLeu: 2.461 ± 0.913
0.492TrpMet: 0.492 ± 0.404
1.476TrpAsn: 1.476 ± 0.64
0.492TrpPro: 0.492 ± 0.454
0.492TrpGln: 0.492 ± 0.454
0.0TrpArg: 0.0 ± 0.0
0.984TrpSer: 0.984 ± 0.395
0.984TrpThr: 0.984 ± 0.395
0.984TrpVal: 0.984 ± 0.908
0.0TrpTrp: 0.0 ± 0.0
0.984TrpTyr: 0.984 ± 0.395
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.937TyrAla: 3.937 ± 0.361
2.953TyrCys: 2.953 ± 0.547
3.445TyrAsp: 3.445 ± 0.84
2.461TyrGlu: 2.461 ± 1.583
1.969TyrPhe: 1.969 ± 0.549
0.984TyrGly: 0.984 ± 0.326
1.476TyrHis: 1.476 ± 0.151
1.476TyrIle: 1.476 ± 0.713
0.492TyrLys: 0.492 ± 0.404
4.921TyrLeu: 4.921 ± 1.54
1.969TyrMet: 1.969 ± 0.216
1.476TyrAsn: 1.476 ± 0.713
1.969TyrPro: 1.969 ± 0.549
1.969TyrGln: 1.969 ± 0.216
2.953TyrArg: 2.953 ± 0.764
5.413TyrSer: 5.413 ± 1.048
0.984TyrThr: 0.984 ± 0.807
0.0TyrVal: 0.0 ± 0.0
0.492TyrTrp: 0.492 ± 0.361
5.906TyrTyr: 5.906 ± 1.384
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2033 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski