Amino acid dipepetide frequency for Beihai sipunculid worm virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.508AlaAla: 7.508 ± 4.619
1.669AlaCys: 1.669 ± 0.28
3.615AlaAsp: 3.615 ± 0.913
4.449AlaGlu: 4.449 ± 0.54
1.112AlaPhe: 1.112 ± 0.344
5.006AlaGly: 5.006 ± 4.98
1.669AlaHis: 1.669 ± 0.699
4.727AlaIle: 4.727 ± 1.401
3.337AlaLys: 3.337 ± 1.16
5.562AlaLeu: 5.562 ± 3.122
2.225AlaMet: 2.225 ± 0.716
1.112AlaAsn: 1.112 ± 0.892
2.225AlaPro: 2.225 ± 0.467
3.337AlaGln: 3.337 ± 0.826
3.337AlaArg: 3.337 ± 1.023
3.059AlaSer: 3.059 ± 0.996
3.337AlaThr: 3.337 ± 0.881
5.84AlaVal: 5.84 ± 0.687
0.834AlaTrp: 0.834 ± 0.278
2.225AlaTyr: 2.225 ± 0.646
0.0AlaXaa: 0.0 ± 0.0
Cys
0.556CysAla: 0.556 ± 0.287
0.0CysCys: 0.0 ± 0.0
0.556CysAsp: 0.556 ± 0.405
1.39CysGlu: 1.39 ± 0.501
1.669CysPhe: 1.669 ± 0.767
1.39CysGly: 1.39 ± 0.756
0.556CysHis: 0.556 ± 0.303
1.669CysIle: 1.669 ± 0.767
1.112CysLys: 1.112 ± 0.419
1.39CysLeu: 1.39 ± 0.603
0.0CysMet: 0.0 ± 0.0
0.278CysAsn: 0.278 ± 0.151
1.112CysPro: 1.112 ± 0.349
0.0CysGln: 0.0 ± 0.0
1.947CysArg: 1.947 ± 0.789
2.503CysSer: 2.503 ± 1.029
0.834CysThr: 0.834 ± 0.454
1.669CysVal: 1.669 ± 0.28
0.278CysTrp: 0.278 ± 0.151
0.834CysTyr: 0.834 ± 0.383
0.0CysXaa: 0.0 ± 0.0
Asp
3.059AspAla: 3.059 ± 0.946
0.556AspCys: 0.556 ± 0.303
4.449AspAsp: 4.449 ± 1.088
4.449AspGlu: 4.449 ± 0.752
2.781AspPhe: 2.781 ± 0.735
2.781AspGly: 2.781 ± 0.879
1.39AspHis: 1.39 ± 0.452
2.225AspIle: 2.225 ± 0.431
4.171AspLys: 4.171 ± 1.688
3.615AspLeu: 3.615 ± 1.027
1.39AspMet: 1.39 ± 0.756
1.947AspAsn: 1.947 ± 1.059
3.893AspPro: 3.893 ± 1.239
1.947AspGln: 1.947 ± 0.607
2.503AspArg: 2.503 ± 1.751
2.225AspSer: 2.225 ± 0.866
3.893AspThr: 3.893 ± 1.213
3.893AspVal: 3.893 ± 1.016
0.556AspTrp: 0.556 ± 0.303
1.947AspTyr: 1.947 ± 0.825
0.0AspXaa: 0.0 ± 0.0
Glu
3.893GluAla: 3.893 ± 1.311
1.947GluCys: 1.947 ± 0.354
4.727GluAsp: 4.727 ± 1.442
5.284GluGlu: 5.284 ± 1.106
1.669GluPhe: 1.669 ± 0.908
3.893GluGly: 3.893 ± 1.239
1.39GluHis: 1.39 ± 0.452
3.337GluIle: 3.337 ± 0.895
4.449GluLys: 4.449 ± 0.752
7.508GluLeu: 7.508 ± 0.476
1.112GluMet: 1.112 ± 0.605
3.615GluAsn: 3.615 ± 0.927
3.059GluPro: 3.059 ± 1.289
2.225GluGln: 2.225 ± 0.646
2.503GluArg: 2.503 ± 1.361
5.284GluSer: 5.284 ± 1.626
4.171GluThr: 4.171 ± 0.919
2.781GluVal: 2.781 ± 0.615
3.059GluTrp: 3.059 ± 1.329
2.503GluTyr: 2.503 ± 0.597
0.0GluXaa: 0.0 ± 0.0
Phe
2.781PheAla: 2.781 ± 1.117
1.112PheCys: 1.112 ± 0.809
3.615PheAsp: 3.615 ± 1.296
5.006PheGlu: 5.006 ± 0.968
1.669PhePhe: 1.669 ± 0.609
1.669PheGly: 1.669 ± 1.252
1.39PheHis: 1.39 ± 0.545
2.225PheIle: 2.225 ± 2.439
2.781PheLys: 2.781 ± 0.904
3.893PheLeu: 3.893 ± 1.047
0.556PheMet: 0.556 ± 0.287
1.669PheAsn: 1.669 ± 0.609
1.112PhePro: 1.112 ± 0.419
2.503PheGln: 2.503 ± 0.406
2.781PheArg: 2.781 ± 0.808
2.503PheSer: 2.503 ± 0.378
1.947PheThr: 1.947 ± 0.354
1.39PheVal: 1.39 ± 0.278
0.278PheTrp: 0.278 ± 0.476
1.112PheTyr: 1.112 ± 0.809
0.0PheXaa: 0.0 ± 0.0
Gly
3.059GlyAla: 3.059 ± 0.892
1.112GlyCys: 1.112 ± 0.574
4.171GlyAsp: 4.171 ± 1.102
5.006GlyGlu: 5.006 ± 1.547
3.893GlyPhe: 3.893 ± 1.565
6.396GlyGly: 6.396 ± 5.239
0.556GlyHis: 0.556 ± 0.303
3.337GlyIle: 3.337 ± 1.961
4.171GlyLys: 4.171 ± 1.875
3.337GlyLeu: 3.337 ± 2.084
1.669GlyMet: 1.669 ± 0.695
3.337GlyAsn: 3.337 ± 0.477
3.337GlyPro: 3.337 ± 0.447
2.503GlyGln: 2.503 ± 0.911
1.669GlyArg: 1.669 ± 0.28
4.727GlySer: 4.727 ± 2.119
4.727GlyThr: 4.727 ± 1.767
2.503GlyVal: 2.503 ± 0.597
0.556GlyTrp: 0.556 ± 0.303
1.39GlyTyr: 1.39 ± 0.452
0.0GlyXaa: 0.0 ± 0.0
His
1.112HisAla: 1.112 ± 0.419
0.278HisCys: 0.278 ± 0.151
1.947HisAsp: 1.947 ± 0.717
0.834HisGlu: 0.834 ± 0.454
0.278HisPhe: 0.278 ± 0.365
2.503HisGly: 2.503 ± 0.658
0.556HisHis: 0.556 ± 0.303
1.947HisIle: 1.947 ± 0.789
1.669HisLys: 1.669 ± 0.908
3.337HisLeu: 3.337 ± 1.048
0.278HisMet: 0.278 ± 0.151
1.112HisAsn: 1.112 ± 0.727
1.947HisPro: 1.947 ± 0.804
1.112HisGln: 1.112 ± 0.419
2.781HisArg: 2.781 ± 0.879
0.834HisSer: 0.834 ± 0.278
1.669HisThr: 1.669 ± 0.609
0.834HisVal: 0.834 ± 0.454
0.0HisTrp: 0.0 ± 0.0
0.834HisTyr: 0.834 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
3.615IleAla: 3.615 ± 0.835
0.556IleCys: 0.556 ± 0.405
1.669IleAsp: 1.669 ± 0.556
4.171IleGlu: 4.171 ± 0.814
2.781IlePhe: 2.781 ± 0.467
0.834IleGly: 0.834 ± 0.454
1.39IleHis: 1.39 ± 0.278
2.225IleIle: 2.225 ± 0.687
3.615IleLys: 3.615 ± 1.106
4.727IleLeu: 4.727 ± 1.763
0.556IleMet: 0.556 ± 0.287
1.947IleAsn: 1.947 ± 1.312
2.503IlePro: 2.503 ± 1.029
1.39IleGln: 1.39 ± 0.501
1.669IleArg: 1.669 ± 0.28
2.781IleSer: 2.781 ± 1.568
2.503IleThr: 2.503 ± 1.256
4.171IleVal: 4.171 ± 0.866
0.834IleTrp: 0.834 ± 1.427
3.615IleTyr: 3.615 ± 0.62
0.0IleXaa: 0.0 ± 0.0
Lys
3.615LysAla: 3.615 ± 0.992
1.112LysCys: 1.112 ± 1.342
3.337LysAsp: 3.337 ± 1.365
5.562LysGlu: 5.562 ± 0.762
3.615LysPhe: 3.615 ± 1.34
3.615LysGly: 3.615 ± 0.835
1.669LysHis: 1.669 ± 0.58
3.893LysIle: 3.893 ± 0.972
5.006LysLys: 5.006 ± 0.964
4.449LysLeu: 4.449 ± 0.734
2.503LysMet: 2.503 ± 0.641
3.893LysAsn: 3.893 ± 0.548
2.225LysPro: 2.225 ± 0.467
5.562LysGln: 5.562 ± 3.403
3.337LysArg: 3.337 ± 0.795
3.893LysSer: 3.893 ± 1.047
4.449LysThr: 4.449 ± 1.732
6.674LysVal: 6.674 ± 2.02
0.834LysTrp: 0.834 ± 0.383
2.225LysTyr: 2.225 ± 0.872
0.0LysXaa: 0.0 ± 0.0
Leu
8.065LeuAla: 8.065 ± 0.688
2.503LeuCys: 2.503 ± 0.611
3.615LeuAsp: 3.615 ± 0.608
3.893LeuGlu: 3.893 ± 0.605
2.503LeuPhe: 2.503 ± 0.67
5.006LeuGly: 5.006 ± 2.214
3.893LeuHis: 3.893 ± 0.972
3.059LeuIle: 3.059 ± 1.105
6.118LeuLys: 6.118 ± 1.247
4.171LeuLeu: 4.171 ± 0.919
2.225LeuMet: 2.225 ± 0.859
5.006LeuAsn: 5.006 ± 3.486
3.893LeuPro: 3.893 ± 1.652
4.727LeuGln: 4.727 ± 1.993
4.449LeuArg: 4.449 ± 0.764
4.171LeuSer: 4.171 ± 1.713
6.396LeuThr: 6.396 ± 1.694
4.727LeuVal: 4.727 ± 1.434
1.39LeuTrp: 1.39 ± 0.278
2.225LeuTyr: 2.225 ± 0.705
0.0LeuXaa: 0.0 ± 0.0
Met
0.834MetAla: 0.834 ± 0.64
0.556MetCys: 0.556 ± 0.303
1.947MetAsp: 1.947 ± 0.789
1.947MetGlu: 1.947 ± 0.607
1.669MetPhe: 1.669 ± 0.556
1.39MetGly: 1.39 ± 0.452
0.556MetHis: 0.556 ± 0.303
0.278MetIle: 0.278 ± 0.476
2.503MetLys: 2.503 ± 0.641
2.503MetLeu: 2.503 ± 1.15
1.112MetMet: 1.112 ± 0.419
0.834MetAsn: 0.834 ± 0.64
1.39MetPro: 1.39 ± 1.054
1.112MetGln: 1.112 ± 0.344
2.225MetArg: 2.225 ± 0.866
2.503MetSer: 2.503 ± 0.834
1.112MetThr: 1.112 ± 0.419
0.556MetVal: 0.556 ± 0.303
0.278MetTrp: 0.278 ± 0.476
0.834MetTyr: 0.834 ± 0.278
0.0MetXaa: 0.0 ± 0.0
Asn
1.669AsnAla: 1.669 ± 0.616
1.669AsnCys: 1.669 ± 0.609
1.112AsnAsp: 1.112 ± 0.349
2.503AsnGlu: 2.503 ± 0.378
1.39AsnPhe: 1.39 ± 0.773
2.225AsnGly: 2.225 ± 1.149
0.834AsnHis: 0.834 ± 0.383
2.781AsnIle: 2.781 ± 1.305
3.615AsnLys: 3.615 ± 0.992
3.337AsnLeu: 3.337 ± 2.219
0.834AsnMet: 0.834 ± 0.278
2.225AsnAsn: 2.225 ± 0.687
2.781AsnPro: 2.781 ± 1.434
1.947AsnGln: 1.947 ± 0.613
2.225AsnArg: 2.225 ± 1.396
2.503AsnSer: 2.503 ± 0.406
1.947AsnThr: 1.947 ± 0.636
4.449AsnVal: 4.449 ± 0.764
1.112AsnTrp: 1.112 ± 0.419
3.337AsnTyr: 3.337 ± 2.219
0.0AsnXaa: 0.0 ± 0.0
Pro
4.171ProAla: 4.171 ± 1.484
0.0ProCys: 0.0 ± 0.0
2.225ProAsp: 2.225 ± 0.467
3.337ProGlu: 3.337 ± 1.023
0.834ProPhe: 0.834 ± 1.427
2.503ProGly: 2.503 ± 0.711
1.39ProHis: 1.39 ± 0.773
2.225ProIle: 2.225 ± 0.467
1.669ProLys: 1.669 ± 0.64
6.118ProLeu: 6.118 ± 1.41
0.834ProMet: 0.834 ± 0.87
1.947ProAsn: 1.947 ± 0.354
2.503ProPro: 2.503 ± 1.912
1.669ProGln: 1.669 ± 0.98
2.503ProArg: 2.503 ± 1.15
4.449ProSer: 4.449 ± 1.639
1.947ProThr: 1.947 ± 0.717
3.615ProVal: 3.615 ± 2.497
0.556ProTrp: 0.556 ± 0.287
1.947ProTyr: 1.947 ± 0.717
0.0ProXaa: 0.0 ± 0.0
Gln
2.781GlnAla: 2.781 ± 0.556
1.39GlnCys: 1.39 ± 0.501
2.503GlnAsp: 2.503 ± 1.225
3.059GlnGlu: 3.059 ± 0.741
1.947GlnPhe: 1.947 ± 0.84
3.893GlnGly: 3.893 ± 1.157
1.669GlnHis: 1.669 ± 0.556
2.503GlnIle: 2.503 ± 0.911
3.615GlnLys: 3.615 ± 0.697
3.893GlnLeu: 3.893 ± 1.016
0.834GlnMet: 0.834 ± 0.639
2.225GlnAsn: 2.225 ± 2.549
2.503GlnPro: 2.503 ± 0.597
2.781GlnGln: 2.781 ± 1.286
1.669GlnArg: 1.669 ± 0.64
3.059GlnSer: 3.059 ± 2.062
2.503GlnThr: 2.503 ± 0.834
3.893GlnVal: 3.893 ± 0.881
0.556GlnTrp: 0.556 ± 0.303
1.39GlnTyr: 1.39 ± 0.617
0.0GlnXaa: 0.0 ± 0.0
Arg
4.449ArgAla: 4.449 ± 0.752
0.834ArgCys: 0.834 ± 0.383
2.225ArgAsp: 2.225 ± 0.687
2.781ArgGlu: 2.781 ± 0.719
3.059ArgPhe: 3.059 ± 2.496
3.337ArgGly: 3.337 ± 0.826
1.39ArgHis: 1.39 ± 0.501
1.112ArgIle: 1.112 ± 0.344
5.284ArgLys: 5.284 ± 1.105
5.284ArgLeu: 5.284 ± 1.709
1.39ArgMet: 1.39 ± 0.773
2.225ArgAsn: 2.225 ± 0.467
0.834ArgPro: 0.834 ± 0.454
1.947ArgGln: 1.947 ± 0.354
4.171ArgArg: 4.171 ± 1.717
3.059ArgSer: 3.059 ± 1.105
2.503ArgThr: 2.503 ± 1.193
2.781ArgVal: 2.781 ± 0.653
0.834ArgTrp: 0.834 ± 0.278
0.834ArgTyr: 0.834 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
3.337SerAla: 3.337 ± 1.386
0.556SerCys: 0.556 ± 0.303
2.225SerAsp: 2.225 ± 0.839
3.893SerGlu: 3.893 ± 1.061
3.893SerPhe: 3.893 ± 1.466
4.449SerGly: 4.449 ± 1.938
1.112SerHis: 1.112 ± 0.349
2.781SerIle: 2.781 ± 0.381
4.727SerLys: 4.727 ± 1.484
4.449SerLeu: 4.449 ± 1.908
3.059SerMet: 3.059 ± 0.639
2.781SerAsn: 2.781 ± 0.925
3.059SerPro: 3.059 ± 1.105
4.449SerGln: 4.449 ± 2.703
1.669SerArg: 1.669 ± 0.767
6.674SerSer: 6.674 ± 1.37
6.118SerThr: 6.118 ± 0.801
4.171SerVal: 4.171 ± 0.866
0.0SerTrp: 0.0 ± 0.0
1.669SerTyr: 1.669 ± 1.053
0.0SerXaa: 0.0 ± 0.0
Thr
5.562ThrAla: 5.562 ± 2.415
1.39ThrCys: 1.39 ± 0.773
2.503ThrAsp: 2.503 ± 0.916
4.727ThrGlu: 4.727 ± 1.442
2.225ThrPhe: 2.225 ± 1.21
3.337ThrGly: 3.337 ± 1.16
1.669ThrHis: 1.669 ± 0.58
2.503ThrIle: 2.503 ± 0.789
5.006ThrLys: 5.006 ± 1.476
3.615ThrLeu: 3.615 ± 0.835
1.669ThrMet: 1.669 ± 0.454
2.225ThrAsn: 2.225 ± 0.705
3.615ThrPro: 3.615 ± 0.697
3.337ThrGln: 3.337 ± 1.021
2.781ThrArg: 2.781 ± 0.808
4.449ThrSer: 4.449 ± 1.43
4.449ThrThr: 4.449 ± 1.75
4.727ThrVal: 4.727 ± 0.837
0.556ThrTrp: 0.556 ± 0.303
3.059ThrTyr: 3.059 ± 0.725
0.0ThrXaa: 0.0 ± 0.0
Val
3.893ValAla: 3.893 ± 0.972
1.112ValCys: 1.112 ± 0.605
3.893ValAsp: 3.893 ± 1.225
3.893ValGlu: 3.893 ± 1.239
2.503ValPhe: 2.503 ± 1.741
3.893ValGly: 3.893 ± 1.466
1.669ValHis: 1.669 ± 0.699
2.503ValIle: 2.503 ± 0.914
5.284ValLys: 5.284 ± 0.893
5.562ValLeu: 5.562 ± 2.022
1.669ValMet: 1.669 ± 0.28
2.781ValAsn: 2.781 ± 0.904
2.503ValPro: 2.503 ± 1.675
3.893ValGln: 3.893 ± 0.864
3.059ValArg: 3.059 ± 0.878
4.727ValSer: 4.727 ± 1.335
4.727ValThr: 4.727 ± 0.362
3.615ValVal: 3.615 ± 0.835
0.834ValTrp: 0.834 ± 0.461
3.337ValTyr: 3.337 ± 0.991
0.0ValXaa: 0.0 ± 0.0
Trp
0.556TrpAla: 0.556 ± 0.287
0.834TrpCys: 0.834 ± 0.383
1.39TrpAsp: 1.39 ± 0.642
0.556TrpGlu: 0.556 ± 0.303
0.278TrpPhe: 0.278 ± 0.151
0.834TrpGly: 0.834 ± 0.454
0.0TrpHis: 0.0 ± 0.0
0.278TrpIle: 0.278 ± 0.151
1.39TrpLys: 1.39 ± 0.642
1.669TrpLeu: 1.669 ± 0.28
0.834TrpMet: 0.834 ± 0.454
0.556TrpAsn: 0.556 ± 0.303
0.556TrpPro: 0.556 ± 0.405
0.278TrpGln: 0.278 ± 0.476
1.39TrpArg: 1.39 ± 0.787
1.112TrpSer: 1.112 ± 0.605
1.39TrpThr: 1.39 ± 0.787
0.556TrpVal: 0.556 ± 0.405
0.0TrpTrp: 0.0 ± 0.0
0.556TrpTyr: 0.556 ± 0.591
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.669TyrAla: 1.669 ± 0.495
0.556TyrCys: 0.556 ± 0.287
1.947TyrAsp: 1.947 ± 0.607
1.669TyrGlu: 1.669 ± 0.58
2.225TyrPhe: 2.225 ± 0.467
2.503TyrGly: 2.503 ± 0.641
1.112TyrHis: 1.112 ± 1.181
1.947TyrIle: 1.947 ± 0.789
1.947TyrLys: 1.947 ± 0.636
3.615TyrLeu: 3.615 ± 1.04
1.112TyrMet: 1.112 ± 0.892
2.781TyrAsn: 2.781 ± 1.618
1.39TyrPro: 1.39 ± 0.452
1.947TyrGln: 1.947 ± 0.551
1.669TyrArg: 1.669 ± 0.565
0.834TyrSer: 0.834 ± 0.278
2.781TyrThr: 2.781 ± 0.615
2.503TyrVal: 2.503 ± 0.579
1.669TyrTrp: 1.669 ± 0.609
0.834TyrTyr: 0.834 ± 0.64
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3597 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski