Amino acid dipepetide frequency for Tetraselmis viridis virus S1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.9AlaAla: 6.9 ± 1.112
0.242AlaCys: 0.242 ± 0.142
5.447AlaAsp: 5.447 ± 0.553
4.237AlaGlu: 4.237 ± 0.858
1.937AlaPhe: 1.937 ± 0.539
5.084AlaGly: 5.084 ± 0.762
2.3AlaHis: 2.3 ± 0.389
3.995AlaIle: 3.995 ± 0.567
2.905AlaLys: 2.905 ± 0.617
5.326AlaLeu: 5.326 ± 0.832
1.453AlaMet: 1.453 ± 0.44
3.874AlaAsn: 3.874 ± 0.651
3.995AlaPro: 3.995 ± 0.699
1.937AlaGln: 1.937 ± 0.444
3.147AlaArg: 3.147 ± 0.666
3.632AlaSer: 3.632 ± 0.703
4.842AlaThr: 4.842 ± 0.823
4.479AlaVal: 4.479 ± 1.19
0.484AlaTrp: 0.484 ± 0.208
2.058AlaTyr: 2.058 ± 0.616
0.0AlaXaa: 0.0 ± 0.0
Cys
0.242CysAla: 0.242 ± 0.189
0.484CysCys: 0.484 ± 0.245
0.847CysAsp: 0.847 ± 0.333
0.726CysGlu: 0.726 ± 0.345
0.242CysPhe: 0.242 ± 0.123
1.089CysGly: 1.089 ± 0.357
0.484CysHis: 0.484 ± 0.307
0.726CysIle: 0.726 ± 0.303
1.089CysLys: 1.089 ± 0.479
0.605CysLeu: 0.605 ± 0.242
0.363CysMet: 0.363 ± 0.23
0.121CysAsn: 0.121 ± 0.134
0.847CysPro: 0.847 ± 0.302
0.242CysGln: 0.242 ± 0.139
0.726CysArg: 0.726 ± 0.347
0.363CysSer: 0.363 ± 0.283
0.363CysThr: 0.363 ± 0.187
0.484CysVal: 0.484 ± 0.243
0.0CysTrp: 0.0 ± 0.0
0.363CysTyr: 0.363 ± 0.21
0.0CysXaa: 0.0 ± 0.0
Asp
4.6AspAla: 4.6 ± 0.675
0.726AspCys: 0.726 ± 0.253
6.658AspAsp: 6.658 ± 1.306
5.81AspGlu: 5.81 ± 1.177
3.632AspPhe: 3.632 ± 0.647
6.295AspGly: 6.295 ± 0.644
1.089AspHis: 1.089 ± 0.43
4.116AspIle: 4.116 ± 0.693
3.632AspLys: 3.632 ± 0.654
4.6AspLeu: 4.6 ± 0.691
2.058AspMet: 2.058 ± 0.622
2.663AspAsn: 2.663 ± 0.441
5.689AspPro: 5.689 ± 1.083
2.179AspGln: 2.179 ± 0.527
2.663AspArg: 2.663 ± 0.521
6.537AspSer: 6.537 ± 1.143
3.51AspThr: 3.51 ± 0.936
4.842AspVal: 4.842 ± 0.561
1.332AspTrp: 1.332 ± 0.442
3.874AspTyr: 3.874 ± 0.572
0.0AspXaa: 0.0 ± 0.0
Glu
4.6GluAla: 4.6 ± 0.674
0.847GluCys: 0.847 ± 0.484
4.116GluAsp: 4.116 ± 0.76
5.689GluGlu: 5.689 ± 1.298
2.905GluPhe: 2.905 ± 0.569
3.51GluGly: 3.51 ± 0.794
1.211GluHis: 1.211 ± 0.311
3.874GluIle: 3.874 ± 0.675
2.058GluLys: 2.058 ± 0.456
5.326GluLeu: 5.326 ± 0.545
2.421GluMet: 2.421 ± 0.585
2.905GluAsn: 2.905 ± 0.506
2.784GluPro: 2.784 ± 0.78
1.937GluGln: 1.937 ± 0.649
3.026GluArg: 3.026 ± 0.759
4.721GluSer: 4.721 ± 0.785
3.51GluThr: 3.51 ± 0.747
2.905GluVal: 2.905 ± 0.531
0.968GluTrp: 0.968 ± 0.409
3.268GluTyr: 3.268 ± 0.528
0.0GluXaa: 0.0 ± 0.0
Phe
1.695PheAla: 1.695 ± 0.442
0.363PheCys: 0.363 ± 0.255
3.632PheAsp: 3.632 ± 0.787
2.179PheGlu: 2.179 ± 0.466
0.726PhePhe: 0.726 ± 0.248
1.816PheGly: 1.816 ± 0.445
0.605PheHis: 0.605 ± 0.241
1.937PheIle: 1.937 ± 0.343
2.542PheLys: 2.542 ± 0.706
2.542PheLeu: 2.542 ± 0.597
1.453PheMet: 1.453 ± 0.345
2.3PheAsn: 2.3 ± 0.382
2.542PhePro: 2.542 ± 0.491
0.968PheGln: 0.968 ± 0.304
2.421PheArg: 2.421 ± 0.52
2.663PheSer: 2.663 ± 0.435
2.3PheThr: 2.3 ± 0.489
1.695PheVal: 1.695 ± 0.482
0.121PheTrp: 0.121 ± 0.134
0.968PheTyr: 0.968 ± 0.442
0.0PheXaa: 0.0 ± 0.0
Gly
5.568GlyAla: 5.568 ± 1.557
0.484GlyCys: 0.484 ± 0.217
5.568GlyAsp: 5.568 ± 1.002
3.632GlyGlu: 3.632 ± 0.522
3.026GlyPhe: 3.026 ± 0.828
6.295GlyGly: 6.295 ± 1.499
1.695GlyHis: 1.695 ± 0.395
4.479GlyIle: 4.479 ± 0.723
3.026GlyLys: 3.026 ± 0.424
4.116GlyLeu: 4.116 ± 0.53
1.332GlyMet: 1.332 ± 0.514
4.116GlyAsn: 4.116 ± 0.705
3.753GlyPro: 3.753 ± 1.748
2.3GlyGln: 2.3 ± 0.492
4.116GlyArg: 4.116 ± 0.68
6.779GlySer: 6.779 ± 1.065
5.084GlyThr: 5.084 ± 0.831
4.358GlyVal: 4.358 ± 0.802
0.847GlyTrp: 0.847 ± 0.29
1.937GlyTyr: 1.937 ± 0.509
0.0GlyXaa: 0.0 ± 0.0
His
1.574HisAla: 1.574 ± 0.346
0.363HisCys: 0.363 ± 0.256
1.211HisAsp: 1.211 ± 0.401
1.695HisGlu: 1.695 ± 0.494
1.211HisPhe: 1.211 ± 0.392
2.3HisGly: 2.3 ± 0.608
0.847HisHis: 0.847 ± 0.286
0.847HisIle: 0.847 ± 0.229
1.332HisLys: 1.332 ± 0.511
1.937HisLeu: 1.937 ± 0.505
1.211HisMet: 1.211 ± 0.285
0.847HisAsn: 0.847 ± 0.354
1.211HisPro: 1.211 ± 0.349
0.847HisGln: 0.847 ± 0.495
1.332HisArg: 1.332 ± 0.381
1.574HisSer: 1.574 ± 0.448
1.816HisThr: 1.816 ± 0.537
1.816HisVal: 1.816 ± 0.426
0.484HisTrp: 0.484 ± 0.265
0.242HisTyr: 0.242 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
2.663IleAla: 2.663 ± 0.65
0.726IleCys: 0.726 ± 0.393
4.479IleAsp: 4.479 ± 0.747
4.116IleGlu: 4.116 ± 0.83
0.968IlePhe: 0.968 ± 0.319
3.753IleGly: 3.753 ± 0.5
1.089IleHis: 1.089 ± 0.315
3.389IleIle: 3.389 ± 0.579
2.784IleLys: 2.784 ± 0.69
3.874IleLeu: 3.874 ± 0.506
1.332IleMet: 1.332 ± 0.429
3.389IleAsn: 3.389 ± 0.731
3.026IlePro: 3.026 ± 0.615
2.179IleGln: 2.179 ± 0.506
3.389IleArg: 3.389 ± 0.486
4.116IleSer: 4.116 ± 0.49
4.116IleThr: 4.116 ± 1.06
2.784IleVal: 2.784 ± 0.676
0.726IleTrp: 0.726 ± 0.446
2.179IleTyr: 2.179 ± 0.481
0.0IleXaa: 0.0 ± 0.0
Lys
2.179LysAla: 2.179 ± 0.524
0.726LysCys: 0.726 ± 0.305
3.632LysAsp: 3.632 ± 0.804
3.147LysGlu: 3.147 ± 0.728
2.179LysPhe: 2.179 ± 0.464
3.026LysGly: 3.026 ± 0.642
1.695LysHis: 1.695 ± 0.429
2.179LysIle: 2.179 ± 0.449
4.116LysLys: 4.116 ± 1.058
2.905LysLeu: 2.905 ± 0.709
1.695LysMet: 1.695 ± 0.448
2.058LysAsn: 2.058 ± 0.676
2.784LysPro: 2.784 ± 0.704
1.089LysGln: 1.089 ± 0.343
5.568LysArg: 5.568 ± 1.294
3.51LysSer: 3.51 ± 0.644
3.026LysThr: 3.026 ± 0.684
2.058LysVal: 2.058 ± 0.482
0.484LysTrp: 0.484 ± 0.216
1.937LysTyr: 1.937 ± 0.443
0.0LysXaa: 0.0 ± 0.0
Leu
4.963LeuAla: 4.963 ± 1.017
0.484LeuCys: 0.484 ± 0.294
4.6LeuAsp: 4.6 ± 0.725
3.995LeuGlu: 3.995 ± 0.59
1.937LeuPhe: 1.937 ± 0.517
4.116LeuGly: 4.116 ± 0.599
1.453LeuHis: 1.453 ± 0.371
3.51LeuIle: 3.51 ± 0.432
3.632LeuLys: 3.632 ± 0.542
3.874LeuLeu: 3.874 ± 0.571
1.937LeuMet: 1.937 ± 0.465
3.874LeuAsn: 3.874 ± 0.932
3.995LeuPro: 3.995 ± 0.701
2.179LeuGln: 2.179 ± 0.683
4.842LeuArg: 4.842 ± 0.697
5.689LeuSer: 5.689 ± 0.798
4.237LeuThr: 4.237 ± 0.683
4.6LeuVal: 4.6 ± 0.851
0.605LeuTrp: 0.605 ± 0.342
2.542LeuTyr: 2.542 ± 0.527
0.0LeuXaa: 0.0 ± 0.0
Met
2.905MetAla: 2.905 ± 0.628
0.363MetCys: 0.363 ± 0.199
1.574MetAsp: 1.574 ± 0.405
1.937MetGlu: 1.937 ± 0.46
0.726MetPhe: 0.726 ± 0.225
2.905MetGly: 2.905 ± 0.731
0.605MetHis: 0.605 ± 0.279
1.453MetIle: 1.453 ± 0.46
0.968MetLys: 0.968 ± 0.404
1.453MetLeu: 1.453 ± 0.315
1.211MetMet: 1.211 ± 0.352
1.211MetAsn: 1.211 ± 0.442
1.089MetPro: 1.089 ± 0.321
0.847MetGln: 0.847 ± 0.382
1.089MetArg: 1.089 ± 0.372
2.179MetSer: 2.179 ± 0.739
1.937MetThr: 1.937 ± 0.592
1.453MetVal: 1.453 ± 0.366
0.484MetTrp: 0.484 ± 0.22
1.453MetTyr: 1.453 ± 0.462
0.0MetXaa: 0.0 ± 0.0
Asn
3.389AsnAla: 3.389 ± 0.817
0.484AsnCys: 0.484 ± 0.318
2.663AsnAsp: 2.663 ± 0.514
2.542AsnGlu: 2.542 ± 0.523
3.147AsnPhe: 3.147 ± 0.496
3.268AsnGly: 3.268 ± 0.61
1.695AsnHis: 1.695 ± 0.348
2.784AsnIle: 2.784 ± 0.532
4.116AsnLys: 4.116 ± 0.835
3.874AsnLeu: 3.874 ± 0.64
1.211AsnMet: 1.211 ± 0.471
2.421AsnAsn: 2.421 ± 0.733
3.632AsnPro: 3.632 ± 0.519
1.574AsnGln: 1.574 ± 0.483
3.268AsnArg: 3.268 ± 0.776
4.479AsnSer: 4.479 ± 0.801
3.632AsnThr: 3.632 ± 0.694
2.3AsnVal: 2.3 ± 0.673
0.605AsnTrp: 0.605 ± 0.275
1.695AsnTyr: 1.695 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
4.358ProAla: 4.358 ± 0.84
0.484ProCys: 0.484 ± 0.281
5.205ProAsp: 5.205 ± 0.793
5.205ProGlu: 5.205 ± 0.874
0.847ProPhe: 0.847 ± 0.218
3.874ProGly: 3.874 ± 0.527
1.089ProHis: 1.089 ± 0.311
2.542ProIle: 2.542 ± 0.522
1.937ProLys: 1.937 ± 0.525
3.389ProLeu: 3.389 ± 1.052
1.574ProMet: 1.574 ± 0.366
2.058ProAsn: 2.058 ± 0.58
3.268ProPro: 3.268 ± 0.741
0.605ProGln: 0.605 ± 0.377
3.268ProArg: 3.268 ± 0.545
4.116ProSer: 4.116 ± 0.853
3.995ProThr: 3.995 ± 1.394
4.842ProVal: 4.842 ± 0.654
0.605ProTrp: 0.605 ± 0.219
1.816ProTyr: 1.816 ± 0.563
0.0ProXaa: 0.0 ± 0.0
Gln
2.3GlnAla: 2.3 ± 0.551
0.121GlnCys: 0.121 ± 0.092
1.816GlnAsp: 1.816 ± 0.471
1.089GlnGlu: 1.089 ± 0.332
1.332GlnPhe: 1.332 ± 0.375
1.332GlnGly: 1.332 ± 0.353
0.363GlnHis: 0.363 ± 0.275
2.058GlnIle: 2.058 ± 0.52
1.332GlnLys: 1.332 ± 0.554
2.421GlnLeu: 2.421 ± 0.417
1.211GlnMet: 1.211 ± 0.506
2.058GlnAsn: 2.058 ± 0.428
1.332GlnPro: 1.332 ± 0.366
0.484GlnGln: 0.484 ± 0.227
1.453GlnArg: 1.453 ± 0.414
3.026GlnSer: 3.026 ± 0.638
2.542GlnThr: 2.542 ± 0.668
0.847GlnVal: 0.847 ± 0.319
0.242GlnTrp: 0.242 ± 0.195
2.663GlnTyr: 2.663 ± 0.784
0.0GlnXaa: 0.0 ± 0.0
Arg
3.995ArgAla: 3.995 ± 0.942
0.242ArgCys: 0.242 ± 0.163
4.358ArgAsp: 4.358 ± 0.848
4.116ArgGlu: 4.116 ± 0.88
2.058ArgPhe: 2.058 ± 0.492
3.51ArgGly: 3.51 ± 0.585
1.695ArgHis: 1.695 ± 0.408
2.784ArgIle: 2.784 ± 0.461
3.995ArgLys: 3.995 ± 0.855
4.116ArgLeu: 4.116 ± 0.719
1.453ArgMet: 1.453 ± 0.39
3.026ArgAsn: 3.026 ± 0.8
2.663ArgPro: 2.663 ± 0.627
2.542ArgGln: 2.542 ± 0.728
4.237ArgArg: 4.237 ± 0.792
3.026ArgSer: 3.026 ± 0.806
2.421ArgThr: 2.421 ± 0.52
5.326ArgVal: 5.326 ± 0.726
0.968ArgTrp: 0.968 ± 0.351
2.421ArgTyr: 2.421 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
4.237SerAla: 4.237 ± 0.543
0.484SerCys: 0.484 ± 0.275
6.658SerAsp: 6.658 ± 1.11
4.116SerGlu: 4.116 ± 0.807
2.542SerPhe: 2.542 ± 0.647
7.868SerGly: 7.868 ± 1.204
1.816SerHis: 1.816 ± 0.49
3.632SerIle: 3.632 ± 0.747
3.753SerLys: 3.753 ± 0.724
5.205SerLeu: 5.205 ± 0.719
1.332SerMet: 1.332 ± 0.39
4.842SerAsn: 4.842 ± 0.681
2.663SerPro: 2.663 ± 0.847
2.179SerGln: 2.179 ± 0.507
4.479SerArg: 4.479 ± 0.762
5.447SerSer: 5.447 ± 1.089
5.084SerThr: 5.084 ± 0.866
3.995SerVal: 3.995 ± 0.526
0.484SerTrp: 0.484 ± 0.198
2.058SerTyr: 2.058 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
4.116ThrAla: 4.116 ± 0.781
0.726ThrCys: 0.726 ± 0.287
4.6ThrAsp: 4.6 ± 0.659
2.179ThrGlu: 2.179 ± 0.353
1.816ThrPhe: 1.816 ± 0.623
5.447ThrGly: 5.447 ± 1.125
2.179ThrHis: 2.179 ± 0.456
4.237ThrIle: 4.237 ± 0.791
2.421ThrLys: 2.421 ± 0.517
4.237ThrLeu: 4.237 ± 0.688
1.089ThrMet: 1.089 ± 0.359
4.237ThrAsn: 4.237 ± 0.685
4.237ThrPro: 4.237 ± 0.583
2.421ThrGln: 2.421 ± 0.601
3.632ThrArg: 3.632 ± 0.745
3.147ThrSer: 3.147 ± 0.499
4.6ThrThr: 4.6 ± 0.707
5.689ThrVal: 5.689 ± 1.159
0.968ThrTrp: 0.968 ± 0.442
3.389ThrTyr: 3.389 ± 0.655
0.0ThrXaa: 0.0 ± 0.0
Val
4.116ValAla: 4.116 ± 0.588
1.332ValCys: 1.332 ± 0.346
5.084ValAsp: 5.084 ± 0.824
3.389ValGlu: 3.389 ± 0.801
2.3ValPhe: 2.3 ± 0.523
3.995ValGly: 3.995 ± 0.7
1.453ValHis: 1.453 ± 0.379
3.026ValIle: 3.026 ± 0.639
2.663ValLys: 2.663 ± 0.483
3.753ValLeu: 3.753 ± 0.644
1.332ValMet: 1.332 ± 0.417
4.479ValAsn: 4.479 ± 0.916
3.51ValPro: 3.51 ± 0.786
2.058ValGln: 2.058 ± 0.402
2.542ValArg: 2.542 ± 0.597
4.237ValSer: 4.237 ± 0.942
4.721ValThr: 4.721 ± 0.854
3.268ValVal: 3.268 ± 0.595
1.089ValTrp: 1.089 ± 0.448
4.116ValTyr: 4.116 ± 0.809
0.0ValXaa: 0.0 ± 0.0
Trp
0.968TrpAla: 0.968 ± 0.286
0.242TrpCys: 0.242 ± 0.199
0.726TrpAsp: 0.726 ± 0.213
0.968TrpGlu: 0.968 ± 0.429
0.363TrpPhe: 0.363 ± 0.231
0.605TrpGly: 0.605 ± 0.23
0.242TrpHis: 0.242 ± 0.152
0.605TrpIle: 0.605 ± 0.33
0.484TrpLys: 0.484 ± 0.293
0.726TrpLeu: 0.726 ± 0.337
0.121TrpMet: 0.121 ± 0.137
0.605TrpAsn: 0.605 ± 0.367
0.121TrpPro: 0.121 ± 0.095
0.484TrpGln: 0.484 ± 0.273
0.847TrpArg: 0.847 ± 0.268
1.332TrpSer: 1.332 ± 0.39
0.605TrpThr: 0.605 ± 0.318
1.453TrpVal: 1.453 ± 0.338
0.0TrpTrp: 0.0 ± 0.0
0.484TrpTyr: 0.484 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.905TyrAla: 2.905 ± 0.652
0.484TyrCys: 0.484 ± 0.226
3.632TyrAsp: 3.632 ± 0.69
1.816TyrGlu: 1.816 ± 0.465
1.574TyrPhe: 1.574 ± 0.456
2.421TyrGly: 2.421 ± 0.667
1.211TyrHis: 1.211 ± 0.322
2.905TyrIle: 2.905 ± 0.382
1.211TyrLys: 1.211 ± 0.329
2.784TyrLeu: 2.784 ± 0.597
1.937TyrMet: 1.937 ± 0.539
1.695TyrAsn: 1.695 ± 0.487
1.937TyrPro: 1.937 ± 0.357
0.968TyrGln: 0.968 ± 0.311
2.905TyrArg: 2.905 ± 0.738
2.3TyrSer: 2.3 ± 0.582
3.026TyrThr: 3.026 ± 0.603
3.268TyrVal: 3.268 ± 0.702
0.484TyrTrp: 0.484 ± 0.226
0.605TyrTyr: 0.605 ± 0.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (8262 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski