Amino acid dipepetide frequency for Sulfolobus turreted icosahedral virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.742AlaAla: 4.742 ± 1.36
0.198AlaCys: 0.198 ± 0.183
2.173AlaAsp: 2.173 ± 0.893
4.742AlaGlu: 4.742 ± 1.159
1.383AlaPhe: 1.383 ± 0.617
5.137AlaGly: 5.137 ± 1.168
0.593AlaHis: 0.593 ± 0.355
5.137AlaIle: 5.137 ± 0.95
4.742AlaLys: 4.742 ± 1.147
8.892AlaLeu: 8.892 ± 1.537
0.593AlaMet: 0.593 ± 0.367
1.976AlaAsn: 1.976 ± 0.764
2.766AlaPro: 2.766 ± 1.097
2.964AlaGln: 2.964 ± 0.72
2.371AlaArg: 2.371 ± 0.574
2.569AlaSer: 2.569 ± 0.818
2.173AlaThr: 2.173 ± 0.539
1.581AlaVal: 1.581 ± 0.554
0.79AlaTrp: 0.79 ± 0.398
1.581AlaTyr: 1.581 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.79CysGlu: 0.79 ± 0.592
0.198CysPhe: 0.198 ± 0.194
1.581CysGly: 1.581 ± 0.623
0.0CysHis: 0.0 ± 0.0
0.79CysIle: 0.79 ± 0.35
1.186CysLys: 1.186 ± 0.633
0.395CysLeu: 0.395 ± 0.268
0.198CysMet: 0.198 ± 0.196
0.79CysAsn: 0.79 ± 0.337
0.0CysPro: 0.0 ± 0.0
0.593CysGln: 0.593 ± 0.313
0.0CysArg: 0.0 ± 0.0
0.593CysSer: 0.593 ± 0.325
0.0CysThr: 0.0 ± 0.0
0.395CysVal: 0.395 ± 0.249
0.0CysTrp: 0.0 ± 0.0
0.198CysTyr: 0.198 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
1.976AspAla: 1.976 ± 0.586
0.198AspCys: 0.198 ± 0.196
1.186AspAsp: 1.186 ± 0.661
5.73AspGlu: 5.73 ± 1.222
2.371AspPhe: 2.371 ± 0.703
1.581AspGly: 1.581 ± 0.587
0.395AspHis: 0.395 ± 0.279
4.545AspIle: 4.545 ± 1.104
1.383AspLys: 1.383 ± 0.541
6.718AspLeu: 6.718 ± 1.472
0.988AspMet: 0.988 ± 0.539
1.186AspAsn: 1.186 ± 0.544
0.988AspPro: 0.988 ± 0.338
0.988AspGln: 0.988 ± 0.413
0.988AspArg: 0.988 ± 0.426
2.371AspSer: 2.371 ± 0.901
1.383AspThr: 1.383 ± 0.614
3.952AspVal: 3.952 ± 0.874
0.593AspTrp: 0.593 ± 0.342
0.79AspTyr: 0.79 ± 0.345
0.0AspXaa: 0.0 ± 0.0
Glu
5.533GluAla: 5.533 ± 1.491
0.593GluCys: 0.593 ± 0.317
3.754GluAsp: 3.754 ± 0.901
14.424GluGlu: 14.424 ± 3.337
3.754GluPhe: 3.754 ± 0.838
5.73GluGly: 5.73 ± 0.882
0.395GluHis: 0.395 ± 0.292
9.287GluIle: 9.287 ± 2.152
11.46GluLys: 11.46 ± 2.705
7.508GluLeu: 7.508 ± 1.379
3.359GluMet: 3.359 ± 0.975
2.964GluAsn: 2.964 ± 0.839
0.988GluPro: 0.988 ± 0.414
3.952GluGln: 3.952 ± 1.453
4.545GluArg: 4.545 ± 1.028
3.754GluSer: 3.754 ± 0.959
2.569GluThr: 2.569 ± 0.732
4.545GluVal: 4.545 ± 1.273
0.0GluTrp: 0.0 ± 0.0
3.952GluTyr: 3.952 ± 1.077
0.0GluXaa: 0.0 ± 0.0
Phe
0.79PheAla: 0.79 ± 0.38
0.988PheCys: 0.988 ± 0.48
1.581PheAsp: 1.581 ± 0.503
3.161PheGlu: 3.161 ± 0.847
3.557PhePhe: 3.557 ± 0.66
2.766PheGly: 2.766 ± 0.792
0.79PheHis: 0.79 ± 0.324
3.161PheIle: 3.161 ± 0.796
3.161PheLys: 3.161 ± 0.761
4.347PheLeu: 4.347 ± 1.221
0.395PheMet: 0.395 ± 0.281
2.964PheAsn: 2.964 ± 0.63
1.778PhePro: 1.778 ± 0.504
1.383PheGln: 1.383 ± 0.529
1.383PheArg: 1.383 ± 0.5
3.557PheSer: 3.557 ± 1.04
1.778PheThr: 1.778 ± 0.742
1.186PheVal: 1.186 ± 0.626
0.198PheTrp: 0.198 ± 0.173
1.778PheTyr: 1.778 ± 0.678
0.0PheXaa: 0.0 ± 0.0
Gly
4.149GlyAla: 4.149 ± 1.102
0.198GlyCys: 0.198 ± 0.186
1.581GlyAsp: 1.581 ± 0.706
5.533GlyGlu: 5.533 ± 1.126
1.976GlyPhe: 1.976 ± 0.572
4.545GlyGly: 4.545 ± 1.309
1.186GlyHis: 1.186 ± 0.579
5.73GlyIle: 5.73 ± 1.54
5.137GlyLys: 5.137 ± 1.273
6.718GlyLeu: 6.718 ± 1.161
1.383GlyMet: 1.383 ± 0.602
2.371GlyAsn: 2.371 ± 0.777
0.198GlyPro: 0.198 ± 0.207
2.569GlyGln: 2.569 ± 0.876
2.964GlyArg: 2.964 ± 0.829
4.742GlySer: 4.742 ± 1.58
4.347GlyThr: 4.347 ± 1.142
2.766GlyVal: 2.766 ± 0.645
0.198GlyTrp: 0.198 ± 0.188
1.976GlyTyr: 1.976 ± 0.717
0.0GlyXaa: 0.0 ± 0.0
His
1.186HisAla: 1.186 ± 0.697
0.0HisCys: 0.0 ± 0.0
0.395HisAsp: 0.395 ± 0.244
0.988HisGlu: 0.988 ± 0.44
1.581HisPhe: 1.581 ± 0.534
0.79HisGly: 0.79 ± 0.378
0.395HisHis: 0.395 ± 0.255
0.79HisIle: 0.79 ± 0.4
1.581HisLys: 1.581 ± 0.648
2.371HisLeu: 2.371 ± 0.858
0.0HisMet: 0.0 ± 0.0
0.395HisAsn: 0.395 ± 0.29
1.778HisPro: 1.778 ± 0.883
0.395HisGln: 0.395 ± 0.375
0.0HisArg: 0.0 ± 0.0
0.593HisSer: 0.593 ± 0.334
0.395HisThr: 0.395 ± 0.277
1.383HisVal: 1.383 ± 0.44
0.198HisTrp: 0.198 ± 0.186
0.79HisTyr: 0.79 ± 0.376
0.0HisXaa: 0.0 ± 0.0
Ile
5.73IleAla: 5.73 ± 1.283
0.79IleCys: 0.79 ± 0.436
6.52IleAsp: 6.52 ± 1.323
8.496IleGlu: 8.496 ± 1.598
1.778IlePhe: 1.778 ± 0.524
4.94IleGly: 4.94 ± 1.027
2.173IleHis: 2.173 ± 0.548
9.089IleIle: 9.089 ± 1.521
6.718IleLys: 6.718 ± 1.173
7.508IleLeu: 7.508 ± 1.343
2.173IleMet: 2.173 ± 0.675
4.347IleAsn: 4.347 ± 0.987
4.545IlePro: 4.545 ± 1.003
2.964IleGln: 2.964 ± 0.93
3.952IleArg: 3.952 ± 0.835
5.533IleSer: 5.533 ± 0.903
4.347IleThr: 4.347 ± 0.942
4.545IleVal: 4.545 ± 0.861
0.79IleTrp: 0.79 ± 0.412
3.161IleTyr: 3.161 ± 0.861
0.0IleXaa: 0.0 ± 0.0
Lys
3.359LysAla: 3.359 ± 0.954
0.79LysCys: 0.79 ± 0.419
4.347LysAsp: 4.347 ± 1.284
10.275LysGlu: 10.275 ± 2.68
3.557LysPhe: 3.557 ± 0.793
3.754LysGly: 3.754 ± 0.682
1.383LysHis: 1.383 ± 0.593
8.694LysIle: 8.694 ± 1.354
10.077LysLys: 10.077 ± 3.753
6.718LysLeu: 6.718 ± 1.402
1.976LysMet: 1.976 ± 0.556
3.161LysAsn: 3.161 ± 0.972
1.186LysPro: 1.186 ± 0.604
4.742LysGln: 4.742 ± 1.244
3.952LysArg: 3.952 ± 1.1
3.557LysSer: 3.557 ± 1.183
3.161LysThr: 3.161 ± 1.08
6.718LysVal: 6.718 ± 0.924
0.79LysTrp: 0.79 ± 0.615
2.964LysTyr: 2.964 ± 0.602
0.0LysXaa: 0.0 ± 0.0
Leu
8.101LeuAla: 8.101 ± 1.438
0.79LeuCys: 0.79 ± 0.486
4.149LeuAsp: 4.149 ± 0.76
6.718LeuGlu: 6.718 ± 1.107
4.545LeuPhe: 4.545 ± 1.024
6.718LeuGly: 6.718 ± 1.219
1.976LeuHis: 1.976 ± 0.716
8.101LeuIle: 8.101 ± 1.329
7.706LeuLys: 7.706 ± 1.215
8.101LeuLeu: 8.101 ± 1.394
1.581LeuMet: 1.581 ± 0.618
5.137LeuAsn: 5.137 ± 0.888
5.137LeuPro: 5.137 ± 1.892
4.94LeuGln: 4.94 ± 1.122
4.94LeuArg: 4.94 ± 1.13
6.916LeuSer: 6.916 ± 0.965
4.347LeuThr: 4.347 ± 0.791
6.323LeuVal: 6.323 ± 1.311
0.198LeuTrp: 0.198 ± 0.219
3.754LeuTyr: 3.754 ± 0.987
0.0LeuXaa: 0.0 ± 0.0
Met
1.383MetAla: 1.383 ± 0.522
0.0MetCys: 0.0 ± 0.0
0.395MetAsp: 0.395 ± 0.284
1.383MetGlu: 1.383 ± 0.683
0.395MetPhe: 0.395 ± 0.279
1.383MetGly: 1.383 ± 0.496
0.198MetHis: 0.198 ± 0.201
1.186MetIle: 1.186 ± 0.713
2.371MetLys: 2.371 ± 0.793
1.186MetLeu: 1.186 ± 0.494
0.395MetMet: 0.395 ± 0.272
1.581MetAsn: 1.581 ± 0.731
1.383MetPro: 1.383 ± 0.477
0.0MetGln: 0.0 ± 0.0
1.976MetArg: 1.976 ± 0.634
1.976MetSer: 1.976 ± 0.586
1.186MetThr: 1.186 ± 0.489
0.79MetVal: 0.79 ± 0.389
0.198MetTrp: 0.198 ± 0.169
0.593MetTyr: 0.593 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
2.371AsnAla: 2.371 ± 0.671
0.988AsnCys: 0.988 ± 0.498
2.371AsnAsp: 2.371 ± 0.827
5.335AsnGlu: 5.335 ± 1.103
1.383AsnPhe: 1.383 ± 0.52
2.371AsnGly: 2.371 ± 0.592
0.79AsnHis: 0.79 ± 0.35
3.754AsnIle: 3.754 ± 0.832
2.964AsnLys: 2.964 ± 0.82
5.73AsnLeu: 5.73 ± 0.97
1.383AsnMet: 1.383 ± 0.544
1.778AsnAsn: 1.778 ± 0.865
2.766AsnPro: 2.766 ± 0.793
1.976AsnGln: 1.976 ± 0.64
1.976AsnArg: 1.976 ± 0.651
1.581AsnSer: 1.581 ± 0.943
3.952AsnThr: 3.952 ± 1.196
3.359AsnVal: 3.359 ± 0.954
0.198AsnTrp: 0.198 ± 0.187
1.186AsnTyr: 1.186 ± 0.41
0.0AsnXaa: 0.0 ± 0.0
Pro
2.964ProAla: 2.964 ± 1.045
0.198ProCys: 0.198 ± 0.201
0.988ProAsp: 0.988 ± 0.397
2.173ProGlu: 2.173 ± 0.888
1.976ProPhe: 1.976 ± 0.689
2.766ProGly: 2.766 ± 0.943
0.593ProHis: 0.593 ± 0.328
2.569ProIle: 2.569 ± 0.646
1.976ProLys: 1.976 ± 0.697
3.557ProLeu: 3.557 ± 0.696
0.0ProMet: 0.0 ± 0.0
1.778ProAsn: 1.778 ± 0.736
2.766ProPro: 2.766 ± 1.399
0.988ProGln: 0.988 ± 0.43
1.186ProArg: 1.186 ± 0.42
6.52ProSer: 6.52 ± 3.022
1.581ProThr: 1.581 ± 0.606
2.569ProVal: 2.569 ± 0.972
0.198ProTrp: 0.198 ± 0.201
1.383ProTyr: 1.383 ± 0.607
0.0ProXaa: 0.0 ± 0.0
Gln
2.964GlnAla: 2.964 ± 0.807
0.198GlnCys: 0.198 ± 0.207
1.778GlnAsp: 1.778 ± 0.466
1.778GlnGlu: 1.778 ± 0.468
1.581GlnPhe: 1.581 ± 0.619
1.383GlnGly: 1.383 ± 0.511
0.593GlnHis: 0.593 ± 0.303
3.557GlnIle: 3.557 ± 0.871
6.718GlnLys: 6.718 ± 1.606
5.533GlnLeu: 5.533 ± 0.762
0.198GlnMet: 0.198 ± 0.185
2.766GlnAsn: 2.766 ± 0.724
1.186GlnPro: 1.186 ± 0.481
1.383GlnGln: 1.383 ± 0.648
1.778GlnArg: 1.778 ± 0.634
1.778GlnSer: 1.778 ± 0.664
2.173GlnThr: 2.173 ± 0.833
1.778GlnVal: 1.778 ± 0.509
0.395GlnTrp: 0.395 ± 0.242
1.383GlnTyr: 1.383 ± 0.585
0.0GlnXaa: 0.0 ± 0.0
Arg
1.976ArgAla: 1.976 ± 0.852
0.395ArgCys: 0.395 ± 0.267
0.988ArgAsp: 0.988 ± 0.389
3.754ArgGlu: 3.754 ± 0.658
1.383ArgPhe: 1.383 ± 0.548
1.581ArgGly: 1.581 ± 0.739
0.593ArgHis: 0.593 ± 0.342
4.545ArgIle: 4.545 ± 1.029
4.94ArgLys: 4.94 ± 1.138
3.952ArgLeu: 3.952 ± 0.888
0.79ArgMet: 0.79 ± 0.326
1.778ArgAsn: 1.778 ± 0.561
0.988ArgPro: 0.988 ± 0.323
1.186ArgGln: 1.186 ± 0.463
2.173ArgArg: 2.173 ± 0.846
2.371ArgSer: 2.371 ± 0.679
0.79ArgThr: 0.79 ± 0.416
3.161ArgVal: 3.161 ± 0.809
0.395ArgTrp: 0.395 ± 0.283
1.976ArgTyr: 1.976 ± 0.628
0.0ArgXaa: 0.0 ± 0.0
Ser
2.569SerAla: 2.569 ± 0.641
0.198SerCys: 0.198 ± 0.201
2.371SerAsp: 2.371 ± 0.554
4.347SerGlu: 4.347 ± 0.807
2.766SerPhe: 2.766 ± 0.783
5.533SerGly: 5.533 ± 1.461
0.79SerHis: 0.79 ± 0.363
6.323SerIle: 6.323 ± 1.594
3.754SerLys: 3.754 ± 1.119
5.137SerLeu: 5.137 ± 0.86
1.186SerMet: 1.186 ± 0.469
3.161SerAsn: 3.161 ± 0.904
4.545SerPro: 4.545 ± 2.099
3.754SerGln: 3.754 ± 0.705
1.581SerArg: 1.581 ± 0.563
8.299SerSer: 8.299 ± 2.676
3.359SerThr: 3.359 ± 1.231
3.952SerVal: 3.952 ± 1.437
0.593SerTrp: 0.593 ± 0.294
2.964SerTyr: 2.964 ± 0.84
0.0SerXaa: 0.0 ± 0.0
Thr
1.778ThrAla: 1.778 ± 0.687
0.395ThrCys: 0.395 ± 0.286
1.186ThrAsp: 1.186 ± 0.525
3.359ThrGlu: 3.359 ± 0.87
3.359ThrPhe: 3.359 ± 1.088
3.161ThrGly: 3.161 ± 0.866
1.581ThrHis: 1.581 ± 0.515
3.754ThrIle: 3.754 ± 0.939
2.173ThrLys: 2.173 ± 0.615
4.94ThrLeu: 4.94 ± 1.104
0.988ThrMet: 0.988 ± 0.421
3.754ThrAsn: 3.754 ± 1.139
1.581ThrPro: 1.581 ± 0.53
2.173ThrGln: 2.173 ± 0.843
1.581ThrArg: 1.581 ± 0.525
2.964ThrSer: 2.964 ± 0.932
2.766ThrThr: 2.766 ± 1.018
2.766ThrVal: 2.766 ± 0.869
0.198ThrTrp: 0.198 ± 0.187
2.766ThrTyr: 2.766 ± 0.871
0.0ThrXaa: 0.0 ± 0.0
Val
3.557ValAla: 3.557 ± 0.845
0.593ValCys: 0.593 ± 0.316
1.581ValAsp: 1.581 ± 0.67
4.94ValGlu: 4.94 ± 1.375
2.173ValPhe: 2.173 ± 0.563
1.581ValGly: 1.581 ± 0.546
0.593ValHis: 0.593 ± 0.562
5.335ValIle: 5.335 ± 1.16
4.149ValLys: 4.149 ± 1.042
5.73ValLeu: 5.73 ± 1.709
1.186ValMet: 1.186 ± 0.422
2.569ValAsn: 2.569 ± 0.654
2.964ValPro: 2.964 ± 0.946
2.371ValGln: 2.371 ± 0.715
1.383ValArg: 1.383 ± 0.493
4.347ValSer: 4.347 ± 1.208
5.335ValThr: 5.335 ± 1.668
4.149ValVal: 4.149 ± 1.204
1.186ValTrp: 1.186 ± 0.573
3.754ValTyr: 3.754 ± 0.882
0.0ValXaa: 0.0 ± 0.0
Trp
0.198TrpAla: 0.198 ± 0.169
0.0TrpCys: 0.0 ± 0.0
0.395TrpAsp: 0.395 ± 0.414
1.383TrpGlu: 1.383 ± 0.631
0.0TrpPhe: 0.0 ± 0.0
0.593TrpGly: 0.593 ± 0.33
0.0TrpHis: 0.0 ± 0.0
0.79TrpIle: 0.79 ± 0.542
0.593TrpLys: 0.593 ± 0.458
0.395TrpLeu: 0.395 ± 0.241
0.198TrpMet: 0.198 ± 0.177
0.79TrpAsn: 0.79 ± 0.38
0.0TrpPro: 0.0 ± 0.0
0.593TrpGln: 0.593 ± 0.397
0.198TrpArg: 0.198 ± 0.187
0.198TrpSer: 0.198 ± 0.187
0.395TrpThr: 0.395 ± 0.29
0.395TrpVal: 0.395 ± 0.278
0.198TrpTrp: 0.198 ± 0.183
0.198TrpTyr: 0.198 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.778TyrAla: 1.778 ± 0.576
0.198TyrCys: 0.198 ± 0.208
2.569TyrAsp: 2.569 ± 0.653
3.754TyrGlu: 3.754 ± 0.958
0.988TyrPhe: 0.988 ± 0.36
2.371TyrGly: 2.371 ± 0.607
0.988TyrHis: 0.988 ± 0.48
2.964TyrIle: 2.964 ± 0.909
2.371TyrLys: 2.371 ± 0.538
4.742TyrLeu: 4.742 ± 0.539
0.79TyrMet: 0.79 ± 0.473
2.964TyrAsn: 2.964 ± 0.654
1.186TyrPro: 1.186 ± 0.374
0.988TyrGln: 0.988 ± 0.389
0.988TyrArg: 0.988 ± 0.413
2.964TyrSer: 2.964 ± 0.886
1.186TyrThr: 1.186 ± 0.55
3.161TyrVal: 3.161 ± 0.768
0.198TyrTrp: 0.198 ± 0.187
3.359TyrTyr: 3.359 ± 0.822
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 36 proteins (5062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski