Amino acid dipepetide frequency for Nome phantom orthophasmavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.439AlaAla: 3.439 ± 1.42
1.911AlaCys: 1.911 ± 0.208
1.146AlaAsp: 1.146 ± 0.864
3.821AlaGlu: 3.821 ± 0.392
2.675AlaPhe: 2.675 ± 1.073
2.293AlaGly: 2.293 ± 2.818
1.911AlaHis: 1.911 ± 0.934
3.821AlaIle: 3.821 ± 0.882
3.439AlaLys: 3.439 ± 0.756
5.35AlaLeu: 5.35 ± 1.944
0.764AlaMet: 0.764 ± 0.442
4.585AlaAsn: 4.585 ± 0.789
0.764AlaPro: 0.764 ± 0.487
0.382AlaGln: 0.382 ± 0.221
2.293AlaArg: 2.293 ± 0.879
1.911AlaSer: 1.911 ± 1.106
3.057AlaThr: 3.057 ± 1.947
1.911AlaVal: 1.911 ± 0.687
0.764AlaTrp: 0.764 ± 0.442
1.528AlaTyr: 1.528 ± 0.263
0.382AlaXaa: 0.382 ± 0.221
Cys
2.293CysAla: 2.293 ± 0.879
0.0CysCys: 0.0 ± 0.0
1.146CysAsp: 1.146 ± 0.327
2.293CysGlu: 2.293 ± 2.186
0.0CysPhe: 0.0 ± 0.0
1.146CysGly: 1.146 ± 0.599
0.382CysHis: 0.382 ± 0.221
0.0CysIle: 0.0 ± 0.0
2.293CysLys: 2.293 ± 0.746
2.293CysLeu: 2.293 ± 1.198
0.0CysMet: 0.0 ± 0.0
1.911CysAsn: 1.911 ± 1.822
0.764CysPro: 0.764 ± 0.644
0.764CysGln: 0.764 ± 0.487
0.764CysArg: 0.764 ± 0.442
1.911CysSer: 1.911 ± 0.852
1.528CysThr: 1.528 ± 0.533
1.528CysVal: 1.528 ± 0.644
0.0CysTrp: 0.0 ± 0.0
1.146CysTyr: 1.146 ± 0.327
0.0CysXaa: 0.0 ± 0.0
Asp
2.293AspAla: 2.293 ± 0.895
1.528AspCys: 1.528 ± 0.533
5.35AspAsp: 5.35 ± 1.347
4.585AspGlu: 4.585 ± 0.158
3.057AspPhe: 3.057 ± 0.743
2.293AspGly: 2.293 ± 0.34
0.382AspHis: 0.382 ± 0.364
6.496AspIle: 6.496 ± 2.713
5.35AspLys: 5.35 ± 2.123
5.35AspLeu: 5.35 ± 0.134
2.293AspMet: 2.293 ± 0.736
5.35AspAsn: 5.35 ± 0.578
2.293AspPro: 2.293 ± 0.878
2.293AspGln: 2.293 ± 0.746
3.821AspArg: 3.821 ± 0.392
3.821AspSer: 3.821 ± 0.646
2.293AspThr: 2.293 ± 0.654
4.585AspVal: 4.585 ± 1.756
0.0AspTrp: 0.0 ± 0.0
3.439AspTyr: 3.439 ± 0.469
0.0AspXaa: 0.0 ± 0.0
Glu
2.293GluAla: 2.293 ± 0.947
0.764GluCys: 0.764 ± 0.729
3.821GluAsp: 3.821 ± 1.175
3.439GluGlu: 3.439 ± 0.42
3.439GluPhe: 3.439 ± 1.793
3.439GluGly: 3.439 ± 0.604
1.911GluHis: 1.911 ± 0.701
5.35GluIle: 5.35 ± 0.871
5.35GluLys: 5.35 ± 1.628
4.203GluLeu: 4.203 ± 1.326
1.911GluMet: 1.911 ± 0.934
3.439GluAsn: 3.439 ± 0.761
1.528GluPro: 1.528 ± 0.263
1.146GluGln: 1.146 ± 1.179
0.764GluArg: 0.764 ± 0.487
3.821GluSer: 3.821 ± 0.646
5.35GluThr: 5.35 ± 1.61
1.911GluVal: 1.911 ± 0.701
0.764GluTrp: 0.764 ± 0.442
2.293GluTyr: 2.293 ± 0.654
0.0GluXaa: 0.0 ± 0.0
Phe
3.439PheAla: 3.439 ± 0.42
0.764PheCys: 0.764 ± 0.644
3.057PheAsp: 3.057 ± 0.743
3.057PheGlu: 3.057 ± 0.681
0.382PhePhe: 0.382 ± 0.59
1.146PheGly: 1.146 ± 1.093
0.382PheHis: 0.382 ± 0.221
4.203PheIle: 4.203 ± 0.201
3.057PheLys: 3.057 ± 0.981
3.821PheLeu: 3.821 ± 0.992
0.764PheMet: 0.764 ± 0.266
2.293PheAsn: 2.293 ± 1.198
0.382PhePro: 0.382 ± 0.221
1.146PheGln: 1.146 ± 0.663
1.528PheArg: 1.528 ± 0.49
2.675PheSer: 2.675 ± 1.317
0.764PheThr: 0.764 ± 0.487
2.293PheVal: 2.293 ± 0.208
0.0PheTrp: 0.0 ± 0.0
3.821PheTyr: 3.821 ± 1.704
0.0PheXaa: 0.0 ± 0.0
Gly
1.911GlyAla: 1.911 ± 0.934
2.675GlyCys: 2.675 ± 1.635
3.439GlyAsp: 3.439 ± 1.157
2.293GlyGlu: 2.293 ± 0.878
1.146GlyPhe: 1.146 ± 1.093
1.146GlyGly: 1.146 ± 1.058
2.293GlyHis: 2.293 ± 0.208
1.528GlyIle: 1.528 ± 0.885
4.203GlyLys: 4.203 ± 0.594
4.968GlyLeu: 4.968 ± 1.563
1.911GlyMet: 1.911 ± 0.424
2.293GlyAsn: 2.293 ± 0.34
1.528GlyPro: 1.528 ± 0.556
0.382GlyGln: 0.382 ± 0.59
0.382GlyArg: 0.382 ± 0.221
2.293GlySer: 2.293 ± 1.442
3.439GlyThr: 3.439 ± 0.761
3.057GlyVal: 3.057 ± 0.399
0.382GlyTrp: 0.382 ± 0.221
3.057GlyTyr: 3.057 ± 0.87
0.0GlyXaa: 0.0 ± 0.0
His
0.382HisAla: 0.382 ± 0.221
0.382HisCys: 0.382 ± 0.221
0.764HisAsp: 0.764 ± 0.266
1.911HisGlu: 1.911 ± 0.687
1.146HisPhe: 1.146 ± 0.327
2.293HisGly: 2.293 ± 0.654
0.0HisHis: 0.0 ± 0.0
2.293HisIle: 2.293 ± 0.746
2.293HisLys: 2.293 ± 0.736
2.675HisLeu: 2.675 ± 1.073
0.382HisMet: 0.382 ± 0.364
1.911HisAsn: 1.911 ± 0.208
0.0HisPro: 0.0 ± 0.0
0.764HisGln: 0.764 ± 0.442
0.0HisArg: 0.0 ± 0.0
1.528HisSer: 1.528 ± 0.49
1.528HisThr: 1.528 ± 0.885
0.382HisVal: 0.382 ± 0.59
0.0HisTrp: 0.0 ± 0.0
0.764HisTyr: 0.764 ± 0.487
0.0HisXaa: 0.0 ± 0.0
Ile
4.585IleAla: 4.585 ± 0.679
1.146IleCys: 1.146 ± 0.599
4.968IleAsp: 4.968 ± 1.837
6.114IleGlu: 6.114 ± 0.504
1.528IlePhe: 1.528 ± 1.005
4.203IleGly: 4.203 ± 0.996
2.293IleHis: 2.293 ± 1.327
4.968IleIle: 4.968 ± 1.244
5.35IleLys: 5.35 ± 1.147
5.35IleLeu: 5.35 ± 1.378
2.293IleMet: 2.293 ± 0.654
6.496IleAsn: 6.496 ± 1.392
1.146IlePro: 1.146 ± 0.663
0.764IleGln: 0.764 ± 0.266
1.911IleArg: 1.911 ± 0.687
5.35IleSer: 5.35 ± 1.165
6.114IleThr: 6.114 ± 1.045
5.732IleVal: 5.732 ± 1.472
1.911IleTrp: 1.911 ± 0.852
3.821IleTyr: 3.821 ± 1.374
0.382IleXaa: 0.382 ± 0.221
Lys
3.057LysAla: 3.057 ± 0.681
2.293LysCys: 2.293 ± 0.799
6.496LysAsp: 6.496 ± 2.309
3.439LysGlu: 3.439 ± 1.42
3.439LysPhe: 3.439 ± 0.761
1.911LysGly: 1.911 ± 0.208
0.382LysHis: 0.382 ± 0.221
5.35LysIle: 5.35 ± 0.578
8.024LysLys: 8.024 ± 1.188
8.024LysLeu: 8.024 ± 2.255
2.675LysMet: 2.675 ± 2.031
5.732LysAsn: 5.732 ± 1.126
2.675LysPro: 2.675 ± 0.556
1.146LysGln: 1.146 ± 0.439
2.675LysArg: 2.675 ± 1.317
8.789LysSer: 8.789 ± 1.274
5.35LysThr: 5.35 ± 1.068
8.407LysVal: 8.407 ± 2.289
0.382LysTrp: 0.382 ± 0.364
4.968LysTyr: 4.968 ± 0.867
0.0LysXaa: 0.0 ± 0.0
Leu
5.732LeuAla: 5.732 ± 2.367
1.528LeuCys: 1.528 ± 0.885
3.821LeuAsp: 3.821 ± 0.646
3.057LeuGlu: 3.057 ± 0.743
4.203LeuPhe: 4.203 ± 0.201
6.114LeuGly: 6.114 ± 1.09
1.911LeuHis: 1.911 ± 0.554
6.878LeuIle: 6.878 ± 1.04
6.496LeuLys: 6.496 ± 1.914
8.789LeuLeu: 8.789 ± 1.336
3.439LeuMet: 3.439 ± 0.593
6.114LeuAsn: 6.114 ± 0.391
3.821LeuPro: 3.821 ± 1.134
3.821LeuGln: 3.821 ± 1.695
4.203LeuArg: 4.203 ± 0.594
7.26LeuSer: 7.26 ± 0.726
7.26LeuThr: 7.26 ± 1.915
3.821LeuVal: 3.821 ± 0.511
0.382LeuTrp: 0.382 ± 0.221
2.293LeuTyr: 2.293 ± 1.327
0.382LeuXaa: 0.382 ± 0.221
Met
1.911MetAla: 1.911 ± 1.538
0.0MetCys: 0.0 ± 0.0
1.911MetAsp: 1.911 ± 0.687
2.293MetGlu: 2.293 ± 0.879
0.764MetPhe: 0.764 ± 0.487
0.0MetGly: 0.0 ± 0.0
0.382MetHis: 0.382 ± 0.221
1.528MetIle: 1.528 ± 0.955
2.293MetLys: 2.293 ± 0.878
3.821MetLeu: 3.821 ± 0.392
0.382MetMet: 0.382 ± 0.221
2.675MetAsn: 2.675 ± 0.556
3.057MetPro: 3.057 ± 0.526
0.382MetGln: 0.382 ± 0.221
1.911MetArg: 1.911 ± 0.852
3.057MetSer: 3.057 ± 1.22
2.675MetThr: 2.675 ± 0.556
1.146MetVal: 1.146 ± 0.327
0.0MetTrp: 0.0 ± 0.0
0.764MetTyr: 0.764 ± 0.442
0.0MetXaa: 0.0 ± 0.0
Asn
1.911AsnAla: 1.911 ± 0.701
0.764AsnCys: 0.764 ± 0.266
5.35AsnAsp: 5.35 ± 0.578
4.585AsnGlu: 4.585 ± 1.452
2.293AsnPhe: 2.293 ± 0.34
3.439AsnGly: 3.439 ± 1.796
1.146AsnHis: 1.146 ± 0.473
5.35AsnIle: 5.35 ± 0.863
8.024AsnLys: 8.024 ± 0.938
4.203AsnLeu: 4.203 ± 0.518
3.057AsnMet: 3.057 ± 0.399
2.675AsnAsn: 2.675 ± 1.548
0.382AsnPro: 0.382 ± 0.59
1.146AsnGln: 1.146 ± 0.599
1.528AsnArg: 1.528 ± 0.49
5.732AsnSer: 5.732 ± 0.79
4.968AsnThr: 4.968 ± 1.589
4.203AsnVal: 4.203 ± 1.191
0.764AsnTrp: 0.764 ± 0.442
4.968AsnTyr: 4.968 ± 1.714
0.0AsnXaa: 0.0 ± 0.0
Pro
1.528ProAla: 1.528 ± 0.556
0.764ProCys: 0.764 ± 0.266
1.146ProAsp: 1.146 ± 0.473
1.146ProGlu: 1.146 ± 0.473
1.528ProPhe: 1.528 ± 0.49
1.146ProGly: 1.146 ± 0.473
0.764ProHis: 0.764 ± 0.487
1.528ProIle: 1.528 ± 0.644
3.439ProLys: 3.439 ± 1.793
3.057ProLeu: 3.057 ± 1.22
0.382ProMet: 0.382 ± 0.221
1.528ProAsn: 1.528 ± 0.49
1.528ProPro: 1.528 ± 1.643
0.764ProGln: 0.764 ± 0.442
0.382ProArg: 0.382 ± 0.221
1.528ProSer: 1.528 ± 0.533
2.293ProThr: 2.293 ± 0.34
1.911ProVal: 1.911 ± 0.424
0.0ProTrp: 0.0 ± 0.0
1.528ProTyr: 1.528 ± 0.556
0.382ProXaa: 0.382 ± 0.221
Gln
1.528GlnAla: 1.528 ± 0.556
0.382GlnCys: 0.382 ± 0.221
1.146GlnAsp: 1.146 ± 0.473
0.382GlnGlu: 0.382 ± 0.221
0.764GlnPhe: 0.764 ± 0.266
1.146GlnGly: 1.146 ± 0.473
0.382GlnHis: 0.382 ± 0.364
1.146GlnIle: 1.146 ± 0.327
1.911GlnLys: 1.911 ± 0.949
1.911GlnLeu: 1.911 ± 1.106
1.146GlnMet: 1.146 ± 0.777
1.911GlnAsn: 1.911 ± 0.687
0.764GlnPro: 0.764 ± 0.442
1.146GlnGln: 1.146 ± 0.663
1.528GlnArg: 1.528 ± 0.644
3.057GlnSer: 3.057 ± 0.681
2.675GlnThr: 2.675 ± 0.556
1.911GlnVal: 1.911 ± 0.945
0.0GlnTrp: 0.0 ± 0.0
0.764GlnTyr: 0.764 ± 0.487
0.382GlnXaa: 0.382 ± 0.221
Arg
1.528ArgAla: 1.528 ± 1.005
0.382ArgCys: 0.382 ± 0.364
2.675ArgAsp: 2.675 ± 0.674
0.764ArgGlu: 0.764 ± 0.487
3.057ArgPhe: 3.057 ± 0.743
2.293ArgGly: 2.293 ± 1.327
0.382ArgHis: 0.382 ± 0.221
2.675ArgIle: 2.675 ± 0.534
2.675ArgLys: 2.675 ± 1.073
3.057ArgLeu: 3.057 ± 0.916
1.146ArgMet: 1.146 ± 0.473
1.528ArgAsn: 1.528 ± 0.533
0.764ArgPro: 0.764 ± 0.442
0.382ArgGln: 0.382 ± 0.221
1.528ArgArg: 1.528 ± 0.556
2.675ArgSer: 2.675 ± 0.805
3.821ArgThr: 3.821 ± 0.848
1.146ArgVal: 1.146 ± 0.327
0.0ArgTrp: 0.0 ± 0.0
2.293ArgTyr: 2.293 ± 0.947
0.0ArgXaa: 0.0 ± 0.0
Ser
4.203SerAla: 4.203 ± 2.432
2.293SerCys: 2.293 ± 1.286
6.496SerAsp: 6.496 ± 0.72
5.732SerGlu: 5.732 ± 1.152
2.675SerPhe: 2.675 ± 1.095
3.439SerGly: 3.439 ± 1.317
1.528SerHis: 1.528 ± 0.955
6.114SerIle: 6.114 ± 2.323
5.732SerLys: 5.732 ± 1.126
5.35SerLeu: 5.35 ± 1.147
3.821SerMet: 3.821 ± 1.108
3.057SerAsn: 3.057 ± 0.399
0.382SerPro: 0.382 ± 0.221
4.968SerGln: 4.968 ± 0.937
2.675SerArg: 2.675 ± 0.689
3.821SerSer: 3.821 ± 0.848
3.439SerThr: 3.439 ± 0.469
4.968SerVal: 4.968 ± 0.867
0.0SerTrp: 0.0 ± 0.0
3.439SerTyr: 3.439 ± 1.796
0.0SerXaa: 0.0 ± 0.0
Thr
1.528ThrAla: 1.528 ± 0.263
1.146ThrCys: 1.146 ± 0.599
4.968ThrAsp: 4.968 ± 1.244
2.293ThrGlu: 2.293 ± 0.895
3.439ThrPhe: 3.439 ± 0.761
3.057ThrGly: 3.057 ± 1.266
2.293ThrHis: 2.293 ± 0.208
6.496ThrIle: 6.496 ± 2.118
6.496ThrLys: 6.496 ± 1.425
6.878ThrLeu: 6.878 ± 0.603
1.911ThrMet: 1.911 ± 0.208
4.585ThrAsn: 4.585 ± 0.679
1.911ThrPro: 1.911 ± 0.852
1.146ThrGln: 1.146 ± 0.439
4.203ThrArg: 4.203 ± 0.545
4.585ThrSer: 4.585 ± 1.598
2.675ThrThr: 2.675 ± 0.804
3.439ThrVal: 3.439 ± 0.42
0.382ThrTrp: 0.382 ± 0.221
3.057ThrTyr: 3.057 ± 0.87
0.0ThrXaa: 0.0 ± 0.0
Val
2.675ValAla: 2.675 ± 1.317
1.911ValCys: 1.911 ± 0.208
3.821ValAsp: 3.821 ± 0.992
2.675ValGlu: 2.675 ± 0.067
1.911ValPhe: 1.911 ± 0.554
1.528ValGly: 1.528 ± 1.159
1.528ValHis: 1.528 ± 0.885
4.585ValIle: 4.585 ± 1.354
4.585ValLys: 4.585 ± 0.416
6.878ValLeu: 6.878 ± 2.207
0.764ValMet: 0.764 ± 0.442
2.293ValAsn: 2.293 ± 0.34
2.675ValPro: 2.675 ± 1.073
0.764ValGln: 0.764 ± 0.266
1.911ValArg: 1.911 ± 1.106
4.968ValSer: 4.968 ± 1.714
4.968ValThr: 4.968 ± 0.756
3.057ValVal: 3.057 ± 1.22
0.382ValTrp: 0.382 ± 0.59
3.821ValTyr: 3.821 ± 1.374
0.764ValXaa: 0.764 ± 0.266
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.146TrpAsp: 1.146 ± 0.599
0.382TrpGlu: 0.382 ± 0.364
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.146TrpIle: 1.146 ± 0.327
0.0TrpLys: 0.0 ± 0.0
1.528TrpLeu: 1.528 ± 0.533
0.0TrpMet: 0.0 ± 0.0
0.764TrpAsn: 0.764 ± 0.442
0.382TrpPro: 0.382 ± 0.221
0.382TrpGln: 0.382 ± 0.59
0.382TrpArg: 0.382 ± 0.221
0.382TrpSer: 0.382 ± 0.221
0.0TrpThr: 0.0 ± 0.0
0.382TrpVal: 0.382 ± 0.221
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.911TyrAla: 1.911 ± 1.316
1.528TyrCys: 1.528 ± 1.458
4.203TyrAsp: 4.203 ± 1.581
2.675TyrGlu: 2.675 ± 0.805
1.146TyrPhe: 1.146 ± 0.599
2.293TyrGly: 2.293 ± 0.895
1.146TyrHis: 1.146 ± 0.327
4.203TyrIle: 4.203 ± 0.91
4.203TyrLys: 4.203 ± 0.594
3.439TyrLeu: 3.439 ± 1.246
1.528TyrMet: 1.528 ± 0.263
4.968TyrAsn: 4.968 ± 0.756
1.528TyrPro: 1.528 ± 0.49
2.293TyrGln: 2.293 ± 0.879
0.382TyrArg: 0.382 ± 0.221
4.585TyrSer: 4.585 ± 1.13
2.293TyrThr: 2.293 ± 0.746
2.675TyrVal: 2.675 ± 1.548
0.764TyrTrp: 0.764 ± 0.729
1.528TyrTyr: 1.528 ± 0.955
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.382XaaPhe: 0.382 ± 0.221
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.764XaaIle: 0.764 ± 0.266
0.0XaaLys: 0.0 ± 0.0
0.764XaaLeu: 0.764 ± 0.442
0.0XaaMet: 0.0 ± 0.0
0.382XaaAsn: 0.382 ± 0.221
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.382XaaSer: 0.382 ± 0.221
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
3.821XaaXaa: 3.821 ± 1.125
Statistics based on 3 proteins (2618 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski