Amino acid dipepetide frequency for Tortoise microvirus 82

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.948AlaAla: 13.948 ± 3.543
0.536AlaCys: 0.536 ± 0.584
3.755AlaAsp: 3.755 ± 1.497
9.657AlaGlu: 9.657 ± 0.831
6.438AlaPhe: 6.438 ± 1.572
11.803AlaGly: 11.803 ± 4.475
3.755AlaHis: 3.755 ± 1.587
2.146AlaIle: 2.146 ± 1.287
4.292AlaLys: 4.292 ± 1.071
6.438AlaLeu: 6.438 ± 1.419
1.073AlaMet: 1.073 ± 0.855
3.219AlaAsn: 3.219 ± 1.568
5.365AlaPro: 5.365 ± 0.764
4.292AlaGln: 4.292 ± 1.689
8.047AlaArg: 8.047 ± 3.233
3.219AlaSer: 3.219 ± 1.075
5.365AlaThr: 5.365 ± 1.689
8.584AlaVal: 8.584 ± 2.203
1.609AlaTrp: 1.609 ± 1.179
4.828AlaTyr: 4.828 ± 1.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.536CysGlu: 0.536 ± 0.557
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.536CysIle: 0.536 ± 0.557
1.073CysLys: 1.073 ± 0.717
1.073CysLeu: 1.073 ± 1.115
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.536CysPro: 0.536 ± 0.4
0.536CysGln: 0.536 ± 0.584
2.682CysArg: 2.682 ± 2.786
0.536CysSer: 0.536 ± 0.582
0.536CysThr: 0.536 ± 0.4
0.536CysVal: 0.536 ± 0.592
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.292AspAla: 4.292 ± 1.423
0.536AspCys: 0.536 ± 0.557
1.609AspAsp: 1.609 ± 0.67
3.219AspGlu: 3.219 ± 0.984
2.682AspPhe: 2.682 ± 0.903
4.828AspGly: 4.828 ± 1.604
1.609AspHis: 1.609 ± 0.606
3.219AspIle: 3.219 ± 1.309
1.609AspLys: 1.609 ± 1.331
4.292AspLeu: 4.292 ± 1.416
1.073AspMet: 1.073 ± 0.652
2.146AspAsn: 2.146 ± 1.239
5.365AspPro: 5.365 ± 0.888
2.682AspGln: 2.682 ± 0.988
2.682AspArg: 2.682 ± 0.976
0.0AspSer: 0.0 ± 0.0
1.609AspThr: 1.609 ± 0.539
4.828AspVal: 4.828 ± 1.809
1.073AspTrp: 1.073 ± 0.662
1.609AspTyr: 1.609 ± 0.786
0.0AspXaa: 0.0 ± 0.0
Glu
6.438GluAla: 6.438 ± 0.779
0.0GluCys: 0.0 ± 0.0
2.682GluAsp: 2.682 ± 0.975
0.536GluGlu: 0.536 ± 0.584
0.536GluPhe: 0.536 ± 0.582
3.219GluGly: 3.219 ± 0.899
0.536GluHis: 0.536 ± 0.557
2.146GluIle: 2.146 ± 0.705
1.609GluLys: 1.609 ± 0.67
4.292GluLeu: 4.292 ± 0.89
2.146GluMet: 2.146 ± 1.133
0.536GluAsn: 0.536 ± 0.557
4.292GluPro: 4.292 ± 1.722
3.755GluGln: 3.755 ± 0.707
6.438GluArg: 6.438 ± 2.29
2.146GluSer: 2.146 ± 0.866
2.682GluThr: 2.682 ± 1.059
6.438GluVal: 6.438 ± 2.021
0.536GluTrp: 0.536 ± 0.4
1.073GluTyr: 1.073 ± 0.552
0.0GluXaa: 0.0 ± 0.0
Phe
3.755PheAla: 3.755 ± 1.036
0.536PheCys: 0.536 ± 0.557
1.073PheAsp: 1.073 ± 0.8
2.682PheGlu: 2.682 ± 1.186
1.073PhePhe: 1.073 ± 0.552
3.755PheGly: 3.755 ± 0.944
0.0PheHis: 0.0 ± 0.0
1.609PheIle: 1.609 ± 0.694
1.609PheLys: 1.609 ± 1.126
2.682PheLeu: 2.682 ± 1.439
0.536PheMet: 0.536 ± 0.557
1.073PheAsn: 1.073 ± 0.694
1.073PhePro: 1.073 ± 0.8
2.682PheGln: 2.682 ± 0.903
2.146PheArg: 2.146 ± 0.94
2.146PheSer: 2.146 ± 1.031
1.609PheThr: 1.609 ± 1.009
3.219PheVal: 3.219 ± 1.444
1.073PheTrp: 1.073 ± 0.552
2.682PheTyr: 2.682 ± 0.976
0.0PheXaa: 0.0 ± 0.0
Gly
8.584GlyAla: 8.584 ± 1.502
0.0GlyCys: 0.0 ± 0.0
5.365GlyAsp: 5.365 ± 1.531
4.828GlyGlu: 4.828 ± 2.096
1.609GlyPhe: 1.609 ± 1.187
9.657GlyGly: 9.657 ± 1.647
4.292GlyHis: 4.292 ± 1.088
6.438GlyIle: 6.438 ± 2.48
2.682GlyLys: 2.682 ± 1.525
4.828GlyLeu: 4.828 ± 2.07
3.219GlyMet: 3.219 ± 0.893
3.219GlyAsn: 3.219 ± 1.009
2.682GlyPro: 2.682 ± 1.865
2.682GlyGln: 2.682 ± 0.472
4.828GlyArg: 4.828 ± 1.874
4.292GlySer: 4.292 ± 1.608
7.511GlyThr: 7.511 ± 1.817
7.511GlyVal: 7.511 ± 1.411
1.609GlyTrp: 1.609 ± 0.539
2.682GlyTyr: 2.682 ± 0.976
0.0GlyXaa: 0.0 ± 0.0
His
2.146HisAla: 2.146 ± 0.757
1.073HisCys: 1.073 ± 0.662
3.219HisAsp: 3.219 ± 0.988
0.0HisGlu: 0.0 ± 0.0
1.609HisPhe: 1.609 ± 0.732
4.292HisGly: 4.292 ± 2.469
0.536HisHis: 0.536 ± 0.557
3.219HisIle: 3.219 ± 0.984
0.0HisLys: 0.0 ± 0.0
1.073HisLeu: 1.073 ± 0.618
0.0HisMet: 0.0 ± 0.0
1.609HisAsn: 1.609 ± 0.712
0.536HisPro: 0.536 ± 0.557
2.146HisGln: 2.146 ± 0.832
1.073HisArg: 1.073 ± 0.662
1.073HisSer: 1.073 ± 0.598
0.0HisThr: 0.0 ± 0.0
1.073HisVal: 1.073 ± 0.654
2.146HisTrp: 2.146 ± 1.324
0.536HisTyr: 0.536 ± 0.592
0.0HisXaa: 0.0 ± 0.0
Ile
3.755IleAla: 3.755 ± 1.254
0.0IleCys: 0.0 ± 0.0
2.682IleAsp: 2.682 ± 1.557
3.219IleGlu: 3.219 ± 0.813
0.536IlePhe: 0.536 ± 0.486
4.828IleGly: 4.828 ± 2.08
0.0IleHis: 0.0 ± 0.0
1.073IleIle: 1.073 ± 0.8
1.609IleLys: 1.609 ± 0.694
2.146IleLeu: 2.146 ± 0.533
0.0IleMet: 0.0 ± 0.0
3.755IleAsn: 3.755 ± 1.455
2.146IlePro: 2.146 ± 0.805
0.0IleGln: 0.0 ± 0.0
3.219IleArg: 3.219 ± 1.086
4.828IleSer: 4.828 ± 1.606
5.901IleThr: 5.901 ± 1.188
1.609IleVal: 1.609 ± 0.786
1.073IleTrp: 1.073 ± 0.8
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.219LysAla: 3.219 ± 1.6
0.536LysCys: 0.536 ± 0.557
1.609LysAsp: 1.609 ± 0.918
2.682LysGlu: 2.682 ± 1.931
1.073LysPhe: 1.073 ± 0.955
3.755LysGly: 3.755 ± 1.884
1.073LysHis: 1.073 ± 0.552
1.609LysIle: 1.609 ± 1.034
1.609LysLys: 1.609 ± 1.009
1.609LysLeu: 1.609 ± 0.933
1.073LysMet: 1.073 ± 0.695
2.682LysAsn: 2.682 ± 0.84
4.292LysPro: 4.292 ± 1.023
1.073LysGln: 1.073 ± 0.552
2.682LysArg: 2.682 ± 1.801
2.682LysSer: 2.682 ± 1.81
1.073LysThr: 1.073 ± 0.654
2.682LysVal: 2.682 ± 1.325
0.536LysTrp: 0.536 ± 0.584
1.609LysTyr: 1.609 ± 1.672
0.0LysXaa: 0.0 ± 0.0
Leu
10.193LeuAla: 10.193 ± 1.92
1.073LeuCys: 1.073 ± 0.552
3.219LeuAsp: 3.219 ± 0.984
3.219LeuGlu: 3.219 ± 2.267
1.609LeuPhe: 1.609 ± 0.626
7.511LeuGly: 7.511 ± 2.148
1.073LeuHis: 1.073 ± 0.598
2.146LeuIle: 2.146 ± 0.94
2.146LeuLys: 2.146 ± 0.852
4.292LeuLeu: 4.292 ± 1.407
2.146LeuMet: 2.146 ± 0.789
3.755LeuAsn: 3.755 ± 0.823
1.609LeuPro: 1.609 ± 0.835
4.292LeuGln: 4.292 ± 2.183
6.974LeuArg: 6.974 ± 1.91
3.755LeuSer: 3.755 ± 1.287
3.755LeuThr: 3.755 ± 1.619
3.755LeuVal: 3.755 ± 1.576
0.536LeuTrp: 0.536 ± 0.584
1.073LeuTyr: 1.073 ± 0.662
0.0LeuXaa: 0.0 ± 0.0
Met
3.755MetAla: 3.755 ± 0.878
0.536MetCys: 0.536 ± 0.557
1.073MetAsp: 1.073 ± 0.8
2.146MetGlu: 2.146 ± 1.194
0.536MetPhe: 0.536 ± 0.592
1.609MetGly: 1.609 ± 0.539
1.073MetHis: 1.073 ± 0.515
1.073MetIle: 1.073 ± 0.762
2.146MetLys: 2.146 ± 1.483
3.219MetLeu: 3.219 ± 1.44
1.073MetMet: 1.073 ± 0.694
1.073MetAsn: 1.073 ± 0.8
1.073MetPro: 1.073 ± 0.552
1.609MetGln: 1.609 ± 1.133
1.073MetArg: 1.073 ± 0.762
1.609MetSer: 1.609 ± 1.2
1.073MetThr: 1.073 ± 0.8
1.073MetVal: 1.073 ± 0.515
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.365AsnAla: 5.365 ± 2.056
0.0AsnCys: 0.0 ± 0.0
3.219AsnAsp: 3.219 ± 1.655
0.536AsnGlu: 0.536 ± 0.557
2.682AsnPhe: 2.682 ± 0.617
2.682AsnGly: 2.682 ± 1.145
1.073AsnHis: 1.073 ± 0.869
2.146AsnIle: 2.146 ± 0.789
2.146AsnLys: 2.146 ± 0.968
1.073AsnLeu: 1.073 ± 0.8
0.536AsnMet: 0.536 ± 0.584
0.536AsnAsn: 0.536 ± 0.4
3.755AsnPro: 3.755 ± 0.738
3.755AsnGln: 3.755 ± 1.035
3.219AsnArg: 3.219 ± 1.136
3.755AsnSer: 3.755 ± 1.23
1.609AsnThr: 1.609 ± 1.2
3.219AsnVal: 3.219 ± 1.253
0.536AsnTrp: 0.536 ± 0.557
0.536AsnTyr: 0.536 ± 0.486
0.0AsnXaa: 0.0 ± 0.0
Pro
5.901ProAla: 5.901 ± 2.628
1.073ProCys: 1.073 ± 1.115
2.146ProAsp: 2.146 ± 1.196
4.292ProGlu: 4.292 ± 1.451
3.219ProPhe: 3.219 ± 1.23
5.901ProGly: 5.901 ± 1.37
2.682ProHis: 2.682 ± 0.699
2.146ProIle: 2.146 ± 1.023
3.219ProLys: 3.219 ± 1.824
4.292ProLeu: 4.292 ± 1.299
1.609ProMet: 1.609 ± 0.875
2.682ProAsn: 2.682 ± 1.072
1.073ProPro: 1.073 ± 1.115
3.219ProGln: 3.219 ± 0.716
4.292ProArg: 4.292 ± 1.862
1.609ProSer: 1.609 ± 0.784
5.365ProThr: 5.365 ± 1.827
4.828ProVal: 4.828 ± 1.451
0.536ProTrp: 0.536 ± 0.582
1.609ProTyr: 1.609 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
7.511GlnAla: 7.511 ± 2.285
0.536GlnCys: 0.536 ± 0.4
2.146GlnAsp: 2.146 ± 1.119
1.609GlnGlu: 1.609 ± 0.626
1.073GlnPhe: 1.073 ± 0.723
5.901GlnGly: 5.901 ± 1.821
1.073GlnHis: 1.073 ± 1.184
2.682GlnIle: 2.682 ± 0.842
2.146GlnLys: 2.146 ± 0.968
5.365GlnLeu: 5.365 ± 1.411
0.536GlnMet: 0.536 ± 0.4
1.609GlnAsn: 1.609 ± 0.786
2.146GlnPro: 2.146 ± 0.97
3.219GlnGln: 3.219 ± 0.837
4.292GlnArg: 4.292 ± 1.092
2.682GlnSer: 2.682 ± 1.4
1.609GlnThr: 1.609 ± 0.539
2.146GlnVal: 2.146 ± 0.789
0.536GlnTrp: 0.536 ± 0.592
0.536GlnTyr: 0.536 ± 0.486
0.0GlnXaa: 0.0 ± 0.0
Arg
6.438ArgAla: 6.438 ± 2.628
0.536ArgCys: 0.536 ± 0.557
6.974ArgAsp: 6.974 ± 2.12
3.755ArgGlu: 3.755 ± 1.455
3.219ArgPhe: 3.219 ± 0.73
3.755ArgGly: 3.755 ± 1.779
3.219ArgHis: 3.219 ± 1.533
2.146ArgIle: 2.146 ± 1.6
1.609ArgLys: 1.609 ± 0.842
2.682ArgLeu: 2.682 ± 1.439
3.219ArgMet: 3.219 ± 0.712
3.219ArgAsn: 3.219 ± 1.641
4.828ArgPro: 4.828 ± 2.346
3.219ArgGln: 3.219 ± 0.893
8.584ArgArg: 8.584 ± 2.514
5.901ArgSer: 5.901 ± 1.561
3.219ArgThr: 3.219 ± 0.984
4.828ArgVal: 4.828 ± 2.003
1.073ArgTrp: 1.073 ± 0.734
3.755ArgTyr: 3.755 ± 1.231
0.0ArgXaa: 0.0 ± 0.0
Ser
3.755SerAla: 3.755 ± 1.329
0.536SerCys: 0.536 ± 0.582
2.682SerAsp: 2.682 ± 0.94
2.146SerGlu: 2.146 ± 0.617
1.609SerPhe: 1.609 ± 0.773
5.365SerGly: 5.365 ± 2.647
2.146SerHis: 2.146 ± 0.787
2.682SerIle: 2.682 ± 1.295
3.755SerLys: 3.755 ± 1.603
3.219SerLeu: 3.219 ± 0.716
2.146SerMet: 2.146 ± 0.857
1.073SerAsn: 1.073 ± 0.618
4.292SerPro: 4.292 ± 1.088
2.146SerGln: 2.146 ± 0.758
3.755SerArg: 3.755 ± 1.612
1.073SerSer: 1.073 ± 0.723
3.219SerThr: 3.219 ± 1.242
3.755SerVal: 3.755 ± 0.921
0.536SerTrp: 0.536 ± 0.582
1.073SerTyr: 1.073 ± 0.552
0.0SerXaa: 0.0 ± 0.0
Thr
7.511ThrAla: 7.511 ± 1.603
0.0ThrCys: 0.0 ± 0.0
1.609ThrAsp: 1.609 ± 1.034
1.073ThrGlu: 1.073 ± 0.515
2.682ThrPhe: 2.682 ± 1.265
2.146ThrGly: 2.146 ± 1.193
0.0ThrHis: 0.0 ± 0.0
2.146ThrIle: 2.146 ± 1.164
0.536ThrLys: 0.536 ± 0.557
3.219ThrLeu: 3.219 ± 0.939
2.682ThrMet: 2.682 ± 1.153
2.682ThrAsn: 2.682 ± 0.801
4.828ThrPro: 4.828 ± 1.376
4.292ThrGln: 4.292 ± 1.471
3.755ThrArg: 3.755 ± 1.041
2.682ThrSer: 2.682 ± 1.13
4.292ThrThr: 4.292 ± 1.246
4.292ThrVal: 4.292 ± 1.643
1.073ThrTrp: 1.073 ± 0.654
1.609ThrTyr: 1.609 ± 1.034
0.0ThrXaa: 0.0 ± 0.0
Val
6.438ValAla: 6.438 ± 1.434
0.536ValCys: 0.536 ± 0.557
2.682ValAsp: 2.682 ± 0.617
3.219ValGlu: 3.219 ± 1.159
2.146ValPhe: 2.146 ± 0.802
5.365ValGly: 5.365 ± 2.554
2.146ValHis: 2.146 ± 1.145
1.073ValIle: 1.073 ± 0.8
4.828ValLys: 4.828 ± 1.983
7.511ValLeu: 7.511 ± 2.527
2.682ValMet: 2.682 ± 1.05
4.292ValAsn: 4.292 ± 1.578
9.12ValPro: 9.12 ± 2.558
3.755ValGln: 3.755 ± 1.196
3.219ValArg: 3.219 ± 1.193
5.365ValSer: 5.365 ± 1.767
2.146ValThr: 2.146 ± 1.376
5.365ValVal: 5.365 ± 2.538
1.609ValTrp: 1.609 ± 0.655
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
2.146TrpAla: 2.146 ± 1.238
0.536TrpCys: 0.536 ± 0.582
1.609TrpAsp: 1.609 ± 1.242
1.073TrpGlu: 1.073 ± 0.662
0.536TrpPhe: 0.536 ± 0.557
0.536TrpGly: 0.536 ± 0.592
0.536TrpHis: 0.536 ± 0.557
0.0TrpIle: 0.0 ± 0.0
0.536TrpLys: 0.536 ± 0.486
0.536TrpLeu: 0.536 ± 0.582
0.0TrpMet: 0.0 ± 0.0
1.073TrpAsn: 1.073 ± 0.8
0.536TrpPro: 0.536 ± 0.557
0.0TrpGln: 0.0 ± 0.0
2.682TrpArg: 2.682 ± 0.841
1.073TrpSer: 1.073 ± 0.552
0.536TrpThr: 0.536 ± 0.584
2.146TrpVal: 2.146 ± 0.975
0.536TrpTrp: 0.536 ± 0.557
0.536TrpTyr: 0.536 ± 0.584
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.219TyrAla: 3.219 ± 1.052
0.0TyrCys: 0.0 ± 0.0
1.609TyrAsp: 1.609 ± 0.694
1.609TyrGlu: 1.609 ± 0.862
2.146TyrPhe: 2.146 ± 1.635
1.609TyrGly: 1.609 ± 1.114
0.536TyrHis: 0.536 ± 0.557
2.146TyrIle: 2.146 ± 1.103
0.0TyrLys: 0.0 ± 0.0
4.292TyrLeu: 4.292 ± 1.579
0.536TyrMet: 0.536 ± 0.4
2.146TyrAsn: 2.146 ± 1.031
2.146TyrPro: 2.146 ± 1.047
0.0TyrGln: 0.0 ± 0.0
0.536TyrArg: 0.536 ± 0.4
0.536TyrSer: 0.536 ± 0.557
0.0TyrThr: 0.0 ± 0.0
2.146TyrVal: 2.146 ± 0.968
0.536TyrTrp: 0.536 ± 0.557
0.536TyrTyr: 0.536 ± 0.557
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1865 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski