Amino acid dipepetide frequency for Lishi spider virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.765AlaAla: 2.765 ± 1.788
1.536AlaCys: 1.536 ± 0.736
0.614AlaAsp: 0.614 ± 0.295
3.379AlaGlu: 3.379 ± 2.225
3.072AlaPhe: 3.072 ± 1.115
1.536AlaGly: 1.536 ± 0.475
0.922AlaHis: 0.922 ± 0.442
2.765AlaIle: 2.765 ± 0.584
2.765AlaLys: 2.765 ± 1.483
6.144AlaLeu: 6.144 ± 2.148
0.614AlaMet: 0.614 ± 0.295
3.072AlaAsn: 3.072 ± 1.683
3.072AlaPro: 3.072 ± 0.186
2.458AlaGln: 2.458 ± 1.282
2.151AlaArg: 2.151 ± 0.311
3.994AlaSer: 3.994 ± 2.032
2.765AlaThr: 2.765 ± 1.252
3.072AlaVal: 3.072 ± 2.314
0.307AlaTrp: 0.307 ± 0.147
4.301AlaTyr: 4.301 ± 0.496
0.0AlaXaa: 0.0 ± 0.0
Cys
0.922CysAla: 0.922 ± 1.0
0.0CysCys: 0.0 ± 0.0
0.922CysAsp: 0.922 ± 0.263
1.229CysGlu: 1.229 ± 0.317
0.614CysPhe: 0.614 ± 0.295
0.922CysGly: 0.922 ± 0.263
0.614CysHis: 0.614 ± 0.462
2.458CysIle: 2.458 ± 0.634
1.843CysLys: 1.843 ± 0.541
2.151CysLeu: 2.151 ± 0.674
0.614CysMet: 0.614 ± 0.636
1.843CysAsn: 1.843 ± 0.526
0.614CysPro: 0.614 ± 0.295
0.922CysGln: 0.922 ± 0.442
0.614CysArg: 0.614 ± 1.09
1.229CysSer: 1.229 ± 0.741
0.614CysThr: 0.614 ± 0.295
1.536CysVal: 1.536 ± 0.418
0.307CysTrp: 0.307 ± 0.371
1.229CysTyr: 1.229 ± 0.571
0.0CysXaa: 0.0 ± 0.0
Asp
2.151AspAla: 2.151 ± 1.327
0.922AspCys: 0.922 ± 0.645
1.843AspAsp: 1.843 ± 0.452
2.765AspGlu: 2.765 ± 0.886
0.922AspPhe: 0.922 ± 0.417
1.843AspGly: 1.843 ± 0.526
1.229AspHis: 1.229 ± 0.589
5.53AspIle: 5.53 ± 0.418
3.379AspLys: 3.379 ± 1.291
5.53AspLeu: 5.53 ± 0.991
0.614AspMet: 0.614 ± 0.295
1.536AspAsn: 1.536 ± 0.883
1.229AspPro: 1.229 ± 0.589
0.614AspGln: 0.614 ± 0.295
3.687AspArg: 3.687 ± 1.709
2.765AspSer: 2.765 ± 0.572
3.994AspThr: 3.994 ± 0.413
3.379AspVal: 3.379 ± 1.007
0.922AspTrp: 0.922 ± 0.417
1.843AspTyr: 1.843 ± 0.526
0.0AspXaa: 0.0 ± 0.0
Glu
2.765GluAla: 2.765 ± 1.77
1.536GluCys: 1.536 ± 1.382
2.458GluAsp: 2.458 ± 1.178
3.687GluGlu: 3.687 ± 1.082
2.458GluPhe: 2.458 ± 0.812
2.151GluGly: 2.151 ± 0.564
2.765GluHis: 2.765 ± 0.11
3.994GluIle: 3.994 ± 1.213
3.379GluLys: 3.379 ± 1.238
6.759GluLeu: 6.759 ± 1.448
0.922GluMet: 0.922 ± 0.442
3.687GluAsn: 3.687 ± 0.868
1.229GluPro: 1.229 ± 0.925
1.229GluGln: 1.229 ± 0.589
3.379GluArg: 3.379 ± 0.876
6.144GluSer: 6.144 ± 0.976
5.223GluThr: 5.223 ± 0.738
5.53GluVal: 5.53 ± 2.066
0.307GluTrp: 0.307 ± 0.545
1.229GluTyr: 1.229 ± 0.589
0.0GluXaa: 0.0 ± 0.0
Phe
2.151PheAla: 2.151 ± 0.793
0.614PheCys: 0.614 ± 0.295
3.072PheAsp: 3.072 ± 0.186
2.765PheGlu: 2.765 ± 1.326
0.614PhePhe: 0.614 ± 0.462
2.151PheGly: 2.151 ± 0.674
0.922PheHis: 0.922 ± 0.442
2.458PheIle: 2.458 ± 0.306
0.922PheLys: 0.922 ± 0.442
5.837PheLeu: 5.837 ± 0.565
0.614PheMet: 0.614 ± 0.286
1.536PheAsn: 1.536 ± 0.475
2.458PhePro: 2.458 ± 0.812
0.614PheGln: 0.614 ± 1.09
3.072PheArg: 3.072 ± 1.24
3.687PheSer: 3.687 ± 0.868
2.151PheThr: 2.151 ± 0.683
2.765PheVal: 2.765 ± 0.921
0.307PheTrp: 0.307 ± 0.147
2.765PheTyr: 2.765 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
1.843GlyAla: 1.843 ± 0.154
0.307GlyCys: 0.307 ± 0.147
1.843GlyAsp: 1.843 ± 0.775
3.072GlyGlu: 3.072 ± 0.186
3.072GlyPhe: 3.072 ± 0.186
3.072GlyGly: 3.072 ± 0.186
1.229GlyHis: 1.229 ± 0.317
3.687GlyIle: 3.687 ± 0.735
2.151GlyLys: 2.151 ± 1.279
4.301GlyLeu: 4.301 ± 1.357
1.536GlyMet: 1.536 ± 0.994
0.922GlyAsn: 0.922 ± 1.125
1.229GlyPro: 1.229 ± 1.649
0.922GlyGln: 0.922 ± 0.442
2.151GlyArg: 2.151 ± 0.826
3.072GlySer: 3.072 ± 0.465
3.687GlyThr: 3.687 ± 0.309
1.536GlyVal: 1.536 ± 0.232
0.922GlyTrp: 0.922 ± 0.645
3.072GlyTyr: 3.072 ± 0.453
0.0GlyXaa: 0.0 ± 0.0
His
0.922HisAla: 0.922 ± 1.634
0.0HisCys: 0.0 ± 0.0
0.922HisAsp: 0.922 ± 0.417
3.379HisGlu: 3.379 ± 0.387
1.536HisPhe: 1.536 ± 0.736
0.922HisGly: 0.922 ± 0.645
0.614HisHis: 0.614 ± 0.295
2.765HisIle: 2.765 ± 0.952
1.843HisLys: 1.843 ± 0.884
3.687HisLeu: 3.687 ± 0.469
0.614HisMet: 0.614 ± 1.09
0.922HisAsn: 0.922 ± 0.263
1.536HisPro: 1.536 ± 0.529
0.614HisGln: 0.614 ± 0.295
1.229HisArg: 1.229 ± 0.317
2.458HisSer: 2.458 ± 1.178
1.536HisThr: 1.536 ± 0.736
1.843HisVal: 1.843 ± 0.526
0.307HisTrp: 0.307 ± 0.147
1.229HisTyr: 1.229 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
3.687IleAla: 3.687 ± 0.868
1.536IleCys: 1.536 ± 0.418
2.458IleAsp: 2.458 ± 1.058
2.765IleGlu: 2.765 ± 0.727
2.458IlePhe: 2.458 ± 0.844
3.379IleGly: 3.379 ± 1.54
2.151IleHis: 2.151 ± 0.793
4.608IleIle: 4.608 ± 0.049
5.53IleLys: 5.53 ± 0.811
5.837IleLeu: 5.837 ± 1.56
1.843IleMet: 1.843 ± 0.526
3.687IleAsn: 3.687 ± 0.904
2.765IlePro: 2.765 ± 0.921
4.608IleGln: 4.608 ± 1.35
4.301IleArg: 4.301 ± 1.342
6.144IleSer: 6.144 ± 0.976
3.072IleThr: 3.072 ± 0.763
3.687IleVal: 3.687 ± 0.735
1.536IleTrp: 1.536 ± 0.529
4.301IleTyr: 4.301 ± 1.617
0.0IleXaa: 0.0 ± 0.0
Lys
0.922LysAla: 0.922 ± 0.442
1.843LysCys: 1.843 ± 0.857
2.458LysAsp: 2.458 ± 0.634
2.458LysGlu: 2.458 ± 0.792
3.379LysPhe: 3.379 ± 0.601
2.458LysGly: 2.458 ± 0.306
0.614LysHis: 0.614 ± 0.295
3.994LysIle: 3.994 ± 0.413
0.614LysLys: 0.614 ± 0.286
6.144LysLeu: 6.144 ± 1.83
0.0LysMet: 0.0 ± 0.0
3.687LysAsn: 3.687 ± 0.458
1.843LysPro: 1.843 ± 0.452
1.229LysGln: 1.229 ± 0.589
2.458LysArg: 2.458 ± 0.306
2.765LysSer: 2.765 ± 0.11
4.301LysThr: 4.301 ± 1.143
5.837LysVal: 5.837 ± 0.911
0.922LysTrp: 0.922 ± 1.0
2.151LysTyr: 2.151 ± 0.809
0.0LysXaa: 0.0 ± 0.0
Leu
6.452LeuAla: 6.452 ± 0.83
1.536LeuCys: 1.536 ± 0.475
7.988LeuAsp: 7.988 ± 2.509
5.53LeuGlu: 5.53 ± 1.454
3.994LeuPhe: 3.994 ± 1.081
4.301LeuGly: 4.301 ± 0.988
3.072LeuHis: 3.072 ± 1.055
7.066LeuIle: 7.066 ± 2.416
5.53LeuLys: 5.53 ± 0.991
11.367LeuLeu: 11.367 ± 1.65
3.379LeuMet: 3.379 ± 0.366
5.223LeuAsn: 5.223 ± 0.744
4.608LeuPro: 4.608 ± 1.379
2.765LeuGln: 2.765 ± 0.952
10.138LeuArg: 10.138 ± 2.55
9.831LeuSer: 9.831 ± 1.525
3.379LeuThr: 3.379 ± 1.007
6.452LeuVal: 6.452 ± 1.436
0.0LeuTrp: 0.0 ± 0.0
3.072LeuTyr: 3.072 ± 0.921
0.0LeuXaa: 0.0 ± 0.0
Met
1.843MetAla: 1.843 ± 1.352
0.0MetCys: 0.0 ± 0.0
0.614MetAsp: 0.614 ± 0.295
1.843MetGlu: 1.843 ± 0.526
0.922MetPhe: 0.922 ± 0.263
0.922MetGly: 0.922 ± 0.645
0.0MetHis: 0.0 ± 0.0
1.843MetIle: 1.843 ± 0.452
0.922MetLys: 0.922 ± 0.263
2.765MetLeu: 2.765 ± 0.55
0.0MetMet: 0.0 ± 0.0
1.843MetAsn: 1.843 ± 0.452
0.922MetPro: 0.922 ± 0.263
0.614MetGln: 0.614 ± 0.295
1.536MetArg: 1.536 ± 0.475
1.843MetSer: 1.843 ± 0.775
2.765MetThr: 2.765 ± 0.44
1.536MetVal: 1.536 ± 0.926
0.307MetTrp: 0.307 ± 0.147
1.843MetTyr: 1.843 ± 0.835
0.0MetXaa: 0.0 ± 0.0
Asn
2.151AsnAla: 2.151 ± 0.674
1.229AsnCys: 1.229 ± 0.357
1.843AsnAsp: 1.843 ± 1.423
2.151AsnGlu: 2.151 ± 0.193
2.765AsnPhe: 2.765 ± 0.886
1.536AsnGly: 1.536 ± 0.926
2.458AsnHis: 2.458 ± 0.306
2.765AsnIle: 2.765 ± 1.091
0.614AsnLys: 0.614 ± 0.295
6.144AsnLeu: 6.144 ± 0.615
1.229AsnMet: 1.229 ± 0.571
3.994AsnAsn: 3.994 ± 1.014
2.458AsnPro: 2.458 ± 0.634
2.151AsnGln: 2.151 ± 1.279
2.458AsnArg: 2.458 ± 0.634
3.994AsnSer: 3.994 ± 0.672
3.994AsnThr: 3.994 ± 0.323
3.687AsnVal: 3.687 ± 0.666
1.229AsnTrp: 1.229 ± 0.357
3.994AsnTyr: 3.994 ± 0.672
0.0AsnXaa: 0.0 ± 0.0
Pro
1.843ProAla: 1.843 ± 0.835
2.151ProCys: 2.151 ± 0.671
3.379ProAsp: 3.379 ± 0.317
1.843ProGlu: 1.843 ± 0.541
2.151ProPhe: 2.151 ± 0.671
1.536ProGly: 1.536 ± 0.418
0.922ProHis: 0.922 ± 0.442
2.765ProIle: 2.765 ± 0.584
2.151ProLys: 2.151 ± 0.564
1.843ProLeu: 1.843 ± 0.154
2.458ProMet: 2.458 ± 0.634
2.765ProAsn: 2.765 ± 1.494
1.229ProPro: 1.229 ± 0.422
1.843ProGln: 1.843 ± 0.452
1.536ProArg: 1.536 ± 0.418
3.379ProSer: 3.379 ± 1.192
3.994ProThr: 3.994 ± 0.323
1.843ProVal: 1.843 ± 0.154
0.307ProTrp: 0.307 ± 0.147
1.229ProTyr: 1.229 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
1.843GlnAla: 1.843 ± 0.989
0.614GlnCys: 0.614 ± 0.286
1.536GlnAsp: 1.536 ± 0.232
1.843GlnGlu: 1.843 ± 0.541
1.536GlnPhe: 1.536 ± 0.475
1.843GlnGly: 1.843 ± 0.775
1.229GlnHis: 1.229 ± 0.422
0.922GlnIle: 0.922 ± 0.442
1.536GlnLys: 1.536 ± 0.475
3.994GlnLeu: 3.994 ± 1.081
0.0GlnMet: 0.0 ± 0.0
0.307GlnAsn: 0.307 ± 0.147
0.307GlnPro: 0.307 ± 0.147
1.536GlnGln: 1.536 ± 1.13
1.536GlnArg: 1.536 ± 0.418
3.072GlnSer: 3.072 ± 0.763
3.687GlnThr: 3.687 ± 0.909
1.536GlnVal: 1.536 ± 0.736
1.536GlnTrp: 1.536 ± 0.232
1.536GlnTyr: 1.536 ± 0.418
0.0GlnXaa: 0.0 ± 0.0
Arg
4.301ArgAla: 4.301 ± 1.436
0.922ArgCys: 0.922 ± 0.263
3.379ArgAsp: 3.379 ± 0.317
3.072ArgGlu: 3.072 ± 1.095
2.151ArgPhe: 2.151 ± 0.674
2.151ArgGly: 2.151 ± 0.826
1.536ArgHis: 1.536 ± 0.596
3.379ArgIle: 3.379 ± 1.007
3.379ArgLys: 3.379 ± 0.876
6.759ArgLeu: 6.759 ± 1.368
1.536ArgMet: 1.536 ± 0.537
2.765ArgAsn: 2.765 ± 0.572
1.536ArgPro: 1.536 ± 0.475
1.843ArgGln: 1.843 ± 1.387
2.765ArgArg: 2.765 ± 0.55
4.608ArgSer: 4.608 ± 1.46
3.072ArgThr: 3.072 ± 0.836
3.072ArgVal: 3.072 ± 1.074
0.922ArgTrp: 0.922 ± 0.442
2.458ArgTyr: 2.458 ± 0.785
0.0ArgXaa: 0.0 ± 0.0
Ser
3.687SerAla: 3.687 ± 0.309
2.765SerCys: 2.765 ± 0.952
3.072SerAsp: 3.072 ± 0.186
5.223SerGlu: 5.223 ± 0.744
2.458SerPhe: 2.458 ± 0.181
2.765SerGly: 2.765 ± 1.483
2.765SerHis: 2.765 ± 0.44
6.759SerIle: 6.759 ± 0.798
3.994SerLys: 3.994 ± 0.413
8.602SerLeu: 8.602 ± 2.098
2.765SerMet: 2.765 ± 1.091
5.223SerAsn: 5.223 ± 1.383
4.301SerPro: 4.301 ± 1.143
1.843SerGln: 1.843 ± 0.526
2.151SerArg: 2.151 ± 0.671
5.223SerSer: 5.223 ± 1.214
6.759SerThr: 6.759 ± 0.678
4.301SerVal: 4.301 ± 1.019
1.843SerTrp: 1.843 ± 0.541
3.687SerTyr: 3.687 ± 0.372
0.0SerXaa: 0.0 ± 0.0
Thr
3.687ThrAla: 3.687 ± 2.197
1.536ThrCys: 1.536 ± 0.883
2.765ThrAsp: 2.765 ± 2.409
4.301ThrGlu: 4.301 ± 1.306
3.072ThrPhe: 3.072 ± 0.949
1.536ThrGly: 1.536 ± 0.418
1.536ThrHis: 1.536 ± 0.596
1.843ThrIle: 1.843 ± 0.452
3.379ThrLys: 3.379 ± 0.724
7.988ThrLeu: 7.988 ± 0.826
2.458ThrMet: 2.458 ± 0.612
3.687ThrAsn: 3.687 ± 1.053
3.072ThrPro: 3.072 ± 1.095
2.458ThrGln: 2.458 ± 0.181
5.223ThrArg: 5.223 ± 0.728
4.916ThrSer: 4.916 ± 0.129
6.452ThrThr: 6.452 ± 1.691
3.687ThrVal: 3.687 ± 1.124
0.922ThrTrp: 0.922 ± 0.263
4.608ThrTyr: 4.608 ± 0.49
0.0ThrXaa: 0.0 ± 0.0
Val
3.379ValAla: 3.379 ± 1.022
0.307ValCys: 0.307 ± 0.545
2.765ValAsp: 2.765 ± 1.143
5.223ValGlu: 5.223 ± 1.056
2.765ValPhe: 2.765 ± 0.952
3.072ValGly: 3.072 ± 3.541
2.458ValHis: 2.458 ± 0.667
4.916ValIle: 4.916 ± 0.726
2.458ValLys: 2.458 ± 0.306
5.53ValLeu: 5.53 ± 0.786
0.922ValMet: 0.922 ± 0.417
3.379ValAsn: 3.379 ± 0.82
3.072ValPro: 3.072 ± 0.186
1.843ValGln: 1.843 ± 0.541
2.151ValArg: 2.151 ± 0.671
6.452ValSer: 6.452 ± 2.228
3.687ValThr: 3.687 ± 1.337
3.072ValVal: 3.072 ± 0.453
1.229ValTrp: 1.229 ± 0.317
2.151ValTyr: 2.151 ± 0.671
0.0ValXaa: 0.0 ± 0.0
Trp
0.922TrpAla: 0.922 ± 0.417
0.0TrpCys: 0.0 ± 0.0
1.536TrpAsp: 1.536 ± 1.063
0.922TrpGlu: 0.922 ± 0.263
0.0TrpPhe: 0.0 ± 0.0
0.922TrpGly: 0.922 ± 0.442
0.0TrpHis: 0.0 ± 0.0
0.614TrpIle: 0.614 ± 0.295
1.229TrpLys: 1.229 ± 0.317
0.614TrpLeu: 0.614 ± 0.462
0.0TrpMet: 0.0 ± 0.0
1.229TrpAsn: 1.229 ± 0.317
0.922TrpPro: 0.922 ± 0.417
0.307TrpGln: 0.307 ± 0.147
1.536TrpArg: 1.536 ± 0.418
0.922TrpSer: 0.922 ± 0.645
0.614TrpThr: 0.614 ± 0.462
0.307TrpVal: 0.307 ± 0.147
0.0TrpTrp: 0.0 ± 0.0
1.536TrpTyr: 1.536 ± 0.232
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.072TyrAla: 3.072 ± 0.186
1.843TyrCys: 1.843 ± 0.154
1.229TyrAsp: 1.229 ± 1.012
3.379TyrGlu: 3.379 ± 0.317
1.536TyrPhe: 1.536 ± 0.736
4.608TyrGly: 4.608 ± 0.958
1.843TyrHis: 1.843 ± 0.541
5.223TyrIle: 5.223 ± 1.383
2.458TyrLys: 2.458 ± 0.306
3.687TyrLeu: 3.687 ± 1.082
2.458TyrMet: 2.458 ± 1.57
1.536TyrAsn: 1.536 ± 0.926
2.765TyrPro: 2.765 ± 0.44
1.536TyrGln: 1.536 ± 0.418
1.843TyrArg: 1.843 ± 0.452
3.687TyrSer: 3.687 ± 0.666
3.379TyrThr: 3.379 ± 0.82
2.151TyrVal: 2.151 ± 0.311
0.0TyrTrp: 0.0 ± 0.0
0.922TyrTyr: 0.922 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3256 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski