Amino acid dipepetide frequency for Entoleuca phenui-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.502AlaAla: 4.502 ± 1.947
1.608AlaCys: 1.608 ± 1.368
1.929AlaAsp: 1.929 ± 0.944
3.859AlaGlu: 3.859 ± 1.519
2.572AlaPhe: 2.572 ± 1.012
1.929AlaGly: 1.929 ± 0.379
0.322AlaHis: 0.322 ± 0.734
2.572AlaIle: 2.572 ± 0.876
5.145AlaLys: 5.145 ± 0.833
4.823AlaLeu: 4.823 ± 2.569
2.251AlaMet: 2.251 ± 0.216
2.572AlaAsn: 2.572 ± 1.424
1.608AlaPro: 1.608 ± 1.368
1.608AlaGln: 1.608 ± 1.926
3.215AlaArg: 3.215 ± 2.142
6.109AlaSer: 6.109 ± 0.695
1.608AlaThr: 1.608 ± 0.466
4.823AlaVal: 4.823 ± 3.688
0.965AlaTrp: 0.965 ± 0.472
2.251AlaTyr: 2.251 ± 0.693
0.0AlaXaa: 0.0 ± 0.0
Cys
0.322CysAla: 0.322 ± 0.168
0.0CysCys: 0.0 ± 0.0
0.322CysAsp: 0.322 ± 0.168
0.322CysGlu: 0.322 ± 0.168
0.965CysPhe: 0.965 ± 0.504
1.286CysGly: 1.286 ± 0.672
0.965CysHis: 0.965 ± 0.504
0.322CysIle: 0.322 ± 0.168
1.286CysLys: 1.286 ± 0.506
1.929CysLeu: 1.929 ± 0.944
0.322CysMet: 0.322 ± 0.168
0.643CysAsn: 0.643 ± 0.623
0.322CysPro: 0.322 ± 0.674
0.965CysGln: 0.965 ± 0.542
0.643CysArg: 0.643 ± 0.336
1.608CysSer: 1.608 ± 0.524
0.643CysThr: 0.643 ± 0.336
0.643CysVal: 0.643 ± 0.557
0.322CysTrp: 0.322 ± 0.168
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.859AspAla: 3.859 ± 0.638
0.0AspCys: 0.0 ± 0.0
5.145AspAsp: 5.145 ± 0.728
4.823AspGlu: 4.823 ± 0.59
2.572AspPhe: 2.572 ± 0.807
1.608AspGly: 1.608 ± 0.84
0.965AspHis: 0.965 ± 0.504
5.145AspIle: 5.145 ± 0.833
6.109AspLys: 6.109 ± 0.926
3.859AspLeu: 3.859 ± 0.374
1.286AspMet: 1.286 ± 0.506
1.608AspAsn: 1.608 ± 1.019
2.251AspPro: 2.251 ± 0.96
3.215AspGln: 3.215 ± 1.101
2.572AspArg: 2.572 ± 0.876
5.145AspSer: 5.145 ± 3.229
2.572AspThr: 2.572 ± 1.798
3.859AspVal: 3.859 ± 2.015
0.965AspTrp: 0.965 ± 0.504
3.537AspTyr: 3.537 ± 1.003
0.0AspXaa: 0.0 ± 0.0
Glu
3.537GluAla: 3.537 ± 2.008
0.965GluCys: 0.965 ± 0.504
4.502GluAsp: 4.502 ± 0.432
7.717GluGlu: 7.717 ± 2.252
3.537GluPhe: 3.537 ± 2.584
2.572GluGly: 2.572 ± 1.012
0.965GluHis: 0.965 ± 0.504
4.502GluIle: 4.502 ± 1.728
5.145GluLys: 5.145 ± 1.308
7.074GluLeu: 7.074 ± 0.681
3.215GluMet: 3.215 ± 1.101
3.215GluAsn: 3.215 ± 0.306
1.608GluPro: 1.608 ± 0.466
3.859GluGln: 3.859 ± 2.015
3.215GluArg: 3.215 ± 0.532
5.788GluSer: 5.788 ± 0.294
3.215GluThr: 3.215 ± 0.532
5.145GluVal: 5.145 ± 1.005
1.608GluTrp: 1.608 ± 0.466
3.537GluTyr: 3.537 ± 0.471
0.0GluXaa: 0.0 ± 0.0
Phe
1.608PheAla: 1.608 ± 2.619
0.643PheCys: 0.643 ± 0.557
3.215PheAsp: 3.215 ± 1.894
1.286PheGlu: 1.286 ± 0.672
1.286PhePhe: 1.286 ± 0.438
1.608PheGly: 1.608 ± 0.466
0.965PheHis: 0.965 ± 0.504
3.537PheIle: 3.537 ± 0.619
1.286PheLys: 1.286 ± 0.438
4.823PheLeu: 4.823 ± 1.14
2.251PheMet: 2.251 ± 0.292
1.286PheAsn: 1.286 ± 0.672
1.286PhePro: 1.286 ± 0.712
3.215PheGln: 3.215 ± 0.306
1.286PheArg: 1.286 ± 0.438
2.894PheSer: 2.894 ± 1.257
3.215PheThr: 3.215 ± 1.091
2.251PheVal: 2.251 ± 0.216
0.643PheTrp: 0.643 ± 0.336
2.572PheTyr: 2.572 ± 1.343
0.0PheXaa: 0.0 ± 0.0
Gly
2.894GlyAla: 2.894 ± 1.471
0.0GlyCys: 0.0 ± 0.0
2.251GlyAsp: 2.251 ± 1.176
3.537GlyGlu: 3.537 ± 1.003
1.608GlyPhe: 1.608 ± 0.524
2.572GlyGly: 2.572 ± 1.798
0.643GlyHis: 0.643 ± 0.336
4.18GlyIle: 4.18 ± 1.202
5.145GlyLys: 5.145 ± 1.59
3.215GlyLeu: 3.215 ± 0.7
2.251GlyMet: 2.251 ± 0.216
2.894GlyAsn: 2.894 ± 1.017
3.215GlyPro: 3.215 ± 0.532
0.965GlyGln: 0.965 ± 1.352
3.537GlyArg: 3.537 ± 1.427
5.788GlySer: 5.788 ± 4.659
2.251GlyThr: 2.251 ± 0.661
4.18GlyVal: 4.18 ± 0.982
0.0GlyTrp: 0.0 ± 0.0
1.286GlyTyr: 1.286 ± 0.672
0.0GlyXaa: 0.0 ± 0.0
His
0.965HisAla: 0.965 ± 0.542
0.322HisCys: 0.322 ± 0.168
2.572HisAsp: 2.572 ± 0.795
1.286HisGlu: 1.286 ± 0.438
0.965HisPhe: 0.965 ± 0.472
0.643HisGly: 0.643 ± 0.336
0.322HisHis: 0.322 ± 0.168
0.965HisIle: 0.965 ± 0.504
1.608HisLys: 1.608 ± 0.466
0.965HisLeu: 0.965 ± 0.504
0.322HisMet: 0.322 ± 0.168
1.929HisAsn: 1.929 ± 1.008
1.608HisPro: 1.608 ± 0.84
0.643HisGln: 0.643 ± 0.336
0.965HisArg: 0.965 ± 0.504
0.965HisSer: 0.965 ± 0.504
0.965HisThr: 0.965 ± 0.504
1.608HisVal: 1.608 ± 0.466
0.0HisTrp: 0.0 ± 0.0
1.286HisTyr: 1.286 ± 0.672
0.0HisXaa: 0.0 ± 0.0
Ile
3.537IleAla: 3.537 ± 0.924
1.286IleCys: 1.286 ± 0.672
2.894IleAsp: 2.894 ± 0.94
5.788IleGlu: 5.788 ± 1.102
2.251IlePhe: 2.251 ± 0.216
4.502IleGly: 4.502 ± 1.728
1.608IleHis: 1.608 ± 0.466
0.965IleIle: 0.965 ± 0.504
2.572IleLys: 2.572 ± 1.343
6.752IleLeu: 6.752 ± 2.337
1.929IleMet: 1.929 ± 0.379
1.286IleAsn: 1.286 ± 0.672
2.251IlePro: 2.251 ± 0.895
2.894IleGln: 2.894 ± 0.889
2.894IleArg: 2.894 ± 1.257
6.109IleSer: 6.109 ± 2.058
3.215IleThr: 3.215 ± 0.532
2.894IleVal: 2.894 ± 1.511
1.286IleTrp: 1.286 ± 0.506
2.572IleTyr: 2.572 ± 1.343
0.0IleXaa: 0.0 ± 0.0
Lys
4.823LysAla: 4.823 ± 0.277
0.643LysCys: 0.643 ± 0.336
2.894LysAsp: 2.894 ± 0.663
4.823LysGlu: 4.823 ± 0.971
1.286LysPhe: 1.286 ± 1.438
5.788LysGly: 5.788 ± 1.88
2.894LysHis: 2.894 ± 0.889
4.502LysIle: 4.502 ± 1.536
5.788LysLys: 5.788 ± 1.774
7.717LysLeu: 7.717 ± 0.231
1.929LysMet: 1.929 ± 0.944
4.18LysAsn: 4.18 ± 1.322
2.572LysPro: 2.572 ± 0.795
0.965LysGln: 0.965 ± 0.542
3.859LysArg: 3.859 ± 0.638
6.752LysSer: 6.752 ± 2.147
4.502LysThr: 4.502 ± 1.726
5.466LysVal: 5.466 ± 0.962
0.965LysTrp: 0.965 ± 0.504
1.929LysTyr: 1.929 ± 0.379
0.0LysXaa: 0.0 ± 0.0
Leu
4.18LeuAla: 4.18 ± 1.43
1.286LeuCys: 1.286 ± 0.506
6.431LeuAsp: 6.431 ± 1.865
9.003LeuGlu: 9.003 ± 1.046
1.608LeuPhe: 1.608 ± 1.019
2.572LeuGly: 2.572 ± 0.077
1.929LeuHis: 1.929 ± 1.008
4.502LeuIle: 4.502 ± 1.323
6.752LeuLys: 6.752 ± 2.337
8.36LeuLeu: 8.36 ± 1.108
3.537LeuMet: 3.537 ± 0.619
4.823LeuAsn: 4.823 ± 0.59
1.929LeuPro: 1.929 ± 0.591
2.251LeuGln: 2.251 ± 1.592
8.36LeuArg: 8.36 ± 1.257
8.039LeuSer: 8.039 ± 0.667
6.109LeuThr: 6.109 ± 1.61
4.18LeuVal: 4.18 ± 1.278
2.251LeuTrp: 2.251 ± 1.176
3.215LeuTyr: 3.215 ± 1.101
0.0LeuXaa: 0.0 ± 0.0
Met
2.894MetAla: 2.894 ± 1.471
0.0MetCys: 0.0 ± 0.0
3.537MetAsp: 3.537 ± 0.924
2.894MetGlu: 2.894 ± 0.147
1.286MetPhe: 1.286 ± 0.672
2.894MetGly: 2.894 ± 1.667
0.643MetHis: 0.643 ± 0.336
1.286MetIle: 1.286 ± 0.672
3.859MetLys: 3.859 ± 0.374
1.929MetLeu: 1.929 ± 0.944
2.251MetMet: 2.251 ± 1.176
1.608MetAsn: 1.608 ± 0.466
1.286MetPro: 1.286 ± 0.712
0.322MetGln: 0.322 ± 0.168
1.286MetArg: 1.286 ± 0.506
4.502MetSer: 4.502 ± 0.432
1.608MetThr: 1.608 ± 0.466
1.608MetVal: 1.608 ± 1.157
0.0MetTrp: 0.0 ± 0.0
0.322MetTyr: 0.322 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
3.537AsnAla: 3.537 ± 0.428
0.643AsnCys: 0.643 ± 0.336
4.18AsnAsp: 4.18 ± 0.982
1.286AsnGlu: 1.286 ± 0.672
1.608AsnPhe: 1.608 ± 1.019
1.286AsnGly: 1.286 ± 0.672
1.286AsnHis: 1.286 ± 0.438
2.572AsnIle: 2.572 ± 0.077
3.859AsnLys: 3.859 ± 1.087
5.145AsnLeu: 5.145 ± 1.082
0.643AsnMet: 0.643 ± 0.557
2.251AsnAsn: 2.251 ± 0.661
2.894AsnPro: 2.894 ± 0.955
1.929AsnGln: 1.929 ± 0.591
1.929AsnArg: 1.929 ± 1.008
2.894AsnSer: 2.894 ± 0.94
2.894AsnThr: 2.894 ± 0.663
2.572AsnVal: 2.572 ± 0.077
0.965AsnTrp: 0.965 ± 0.504
1.929AsnTyr: 1.929 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
1.608ProAla: 1.608 ± 1.157
1.286ProCys: 1.286 ± 0.672
3.537ProAsp: 3.537 ± 1.247
4.18ProGlu: 4.18 ± 1.566
2.251ProPhe: 2.251 ± 0.216
1.929ProGly: 1.929 ± 2.45
0.643ProHis: 0.643 ± 0.336
1.608ProIle: 1.608 ± 0.466
2.251ProLys: 2.251 ± 0.216
4.18ProLeu: 4.18 ± 0.805
0.643ProMet: 0.643 ± 0.623
0.322ProAsn: 0.322 ± 0.168
0.643ProPro: 0.643 ± 0.623
1.286ProGln: 1.286 ± 1.438
2.251ProArg: 2.251 ± 0.96
3.215ProSer: 3.215 ± 2.471
1.286ProThr: 1.286 ± 0.438
3.215ProVal: 3.215 ± 1.09
0.322ProTrp: 0.322 ± 0.168
0.643ProTyr: 0.643 ± 0.336
0.0ProXaa: 0.0 ± 0.0
Gln
2.894GlnAla: 2.894 ± 2.713
0.322GlnCys: 0.322 ± 0.168
1.286GlnAsp: 1.286 ± 0.672
3.537GlnGlu: 3.537 ± 1.253
0.965GlnPhe: 0.965 ± 0.504
2.572GlnGly: 2.572 ± 2.662
0.965GlnHis: 0.965 ± 0.504
1.608GlnIle: 1.608 ± 0.84
1.608GlnLys: 1.608 ± 1.019
3.215GlnLeu: 3.215 ± 0.7
1.929GlnMet: 1.929 ± 0.878
2.572GlnAsn: 2.572 ± 0.077
1.608GlnPro: 1.608 ± 1.019
0.965GlnGln: 0.965 ± 0.504
1.929GlnArg: 1.929 ± 0.547
2.894GlnSer: 2.894 ± 1.017
1.608GlnThr: 1.608 ± 0.524
1.929GlnVal: 1.929 ± 0.944
0.643GlnTrp: 0.643 ± 0.623
1.286GlnTyr: 1.286 ± 0.506
0.0GlnXaa: 0.0 ± 0.0
Arg
2.894ArgAla: 2.894 ± 1.627
1.286ArgCys: 1.286 ± 0.506
2.572ArgAsp: 2.572 ± 0.077
3.859ArgGlu: 3.859 ± 0.374
2.894ArgPhe: 2.894 ± 0.955
4.18ArgGly: 4.18 ± 1.757
1.286ArgHis: 1.286 ± 0.672
4.502ArgIle: 4.502 ± 1.726
4.18ArgLys: 4.18 ± 0.594
5.145ArgLeu: 5.145 ± 0.728
3.215ArgMet: 3.215 ± 1.363
2.894ArgAsn: 2.894 ± 1.471
1.608ArgPro: 1.608 ± 0.466
2.894ArgGln: 2.894 ± 1.417
1.929ArgArg: 1.929 ± 1.912
2.894ArgSer: 2.894 ± 0.808
2.894ArgThr: 2.894 ± 1.954
2.572ArgVal: 2.572 ± 0.077
0.643ArgTrp: 0.643 ± 0.336
1.286ArgTyr: 1.286 ± 0.672
0.0ArgXaa: 0.0 ± 0.0
Ser
5.788SerAla: 5.788 ± 1.643
0.965SerCys: 0.965 ± 0.504
5.788SerAsp: 5.788 ± 1.616
6.109SerGlu: 6.109 ± 2.35
5.788SerPhe: 5.788 ± 1.779
4.18SerGly: 4.18 ± 1.969
1.929SerHis: 1.929 ± 0.547
6.109SerIle: 6.109 ± 1.502
4.18SerLys: 4.18 ± 0.805
6.109SerLeu: 6.109 ± 1.189
1.929SerMet: 1.929 ± 1.116
2.894SerAsn: 2.894 ± 0.147
1.929SerPro: 1.929 ± 2.537
2.894SerGln: 2.894 ± 1.667
6.431SerArg: 6.431 ± 0.315
9.325SerSer: 9.325 ± 3.181
3.859SerThr: 3.859 ± 2.434
4.502SerVal: 4.502 ± 0.471
0.322SerTrp: 0.322 ± 0.168
3.537SerTyr: 3.537 ± 0.471
0.0SerXaa: 0.0 ± 0.0
Thr
1.608ThrAla: 1.608 ± 1.157
0.322ThrCys: 0.322 ± 0.168
1.286ThrAsp: 1.286 ± 0.438
2.894ThrGlu: 2.894 ± 0.663
3.859ThrPhe: 3.859 ± 0.638
3.859ThrGly: 3.859 ± 2.233
0.965ThrHis: 0.965 ± 0.504
2.894ThrIle: 2.894 ± 0.663
2.894ThrLys: 2.894 ± 1.627
5.788ThrLeu: 5.788 ± 0.294
1.929ThrMet: 1.929 ± 1.342
3.215ThrAsn: 3.215 ± 1.101
3.537ThrPro: 3.537 ± 1.253
1.608ThrGln: 1.608 ± 0.545
3.215ThrArg: 3.215 ± 1.101
1.929ThrSer: 1.929 ± 1.085
2.894ThrThr: 2.894 ± 1.017
3.215ThrVal: 3.215 ± 1.334
0.322ThrTrp: 0.322 ± 0.168
1.286ThrTyr: 1.286 ± 0.672
0.0ThrXaa: 0.0 ± 0.0
Val
3.537ValAla: 3.537 ± 2.304
0.322ValCys: 0.322 ± 0.168
2.894ValAsp: 2.894 ± 0.147
4.18ValGlu: 4.18 ± 0.391
1.608ValPhe: 1.608 ± 0.524
3.859ValGly: 3.859 ± 0.638
0.643ValHis: 0.643 ± 0.336
4.18ValIle: 4.18 ± 1.322
5.788ValLys: 5.788 ± 1.137
5.466ValLeu: 5.466 ± 1.468
2.572ValMet: 2.572 ± 0.818
3.537ValAsn: 3.537 ± 1.003
3.537ValPro: 3.537 ± 0.924
2.572ValGln: 2.572 ± 0.818
3.215ValArg: 3.215 ± 0.532
4.18ValSer: 4.18 ± 0.582
3.537ValThr: 3.537 ± 0.471
3.215ValVal: 3.215 ± 1.101
1.286ValTrp: 1.286 ± 0.506
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.965TrpAla: 0.965 ± 0.542
0.965TrpCys: 0.965 ± 0.504
1.929TrpAsp: 1.929 ± 0.547
1.929TrpGlu: 1.929 ± 1.008
0.643TrpPhe: 0.643 ± 0.336
0.965TrpGly: 0.965 ± 0.472
0.0TrpHis: 0.0 ± 0.0
0.322TrpIle: 0.322 ± 0.168
1.286TrpLys: 1.286 ± 0.672
0.965TrpLeu: 0.965 ± 0.542
0.0TrpMet: 0.0 ± 0.0
0.322TrpAsn: 0.322 ± 0.168
0.322TrpPro: 0.322 ± 0.168
0.0TrpGln: 0.0 ± 0.0
0.643TrpArg: 0.643 ± 0.623
1.286TrpSer: 1.286 ± 0.672
0.322TrpThr: 0.322 ± 0.168
0.965TrpVal: 0.965 ± 0.504
0.322TrpTrp: 0.322 ± 0.168
0.322TrpTyr: 0.322 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.322TyrAla: 0.322 ± 0.674
0.965TyrCys: 0.965 ± 1.225
1.929TyrAsp: 1.929 ± 1.008
1.608TyrGlu: 1.608 ± 0.545
1.608TyrPhe: 1.608 ± 0.84
1.608TyrGly: 1.608 ± 0.84
0.965TyrHis: 0.965 ± 0.504
3.215TyrIle: 3.215 ± 1.679
3.537TyrLys: 3.537 ± 0.619
3.859TyrLeu: 3.859 ± 1.408
1.286TyrMet: 1.286 ± 0.506
2.251TyrAsn: 2.251 ± 0.216
1.286TyrPro: 1.286 ± 0.672
1.286TyrGln: 1.286 ± 0.672
2.251TyrArg: 2.251 ± 0.693
2.251TyrSer: 2.251 ± 1.176
0.643TyrThr: 0.643 ± 0.336
1.286TyrVal: 1.286 ± 0.672
0.643TyrTrp: 0.643 ± 0.336
1.286TyrTyr: 1.286 ± 1.247
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski