Amino acid dipepetide frequency for Culex tritaeniorhynchus totivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.871AlaAla: 6.871 ± 2.888
2.021AlaCys: 2.021 ± 0.39
3.638AlaAsp: 3.638 ± 0.572
3.638AlaGlu: 3.638 ± 1.223
1.617AlaPhe: 1.617 ± 0.182
4.042AlaGly: 4.042 ± 1.431
0.808AlaHis: 0.808 ± 0.416
4.042AlaIle: 4.042 ± 0.521
1.617AlaLys: 1.617 ± 0.469
8.488AlaLeu: 8.488 ± 1.118
1.213AlaMet: 1.213 ± 0.749
4.85AlaAsn: 4.85 ± 0.546
3.234AlaPro: 3.234 ± 1.665
2.425AlaGln: 2.425 ± 0.052
5.255AlaArg: 5.255 ± 1.848
7.276AlaSer: 7.276 ± 1.795
8.084AlaThr: 8.084 ± 1.561
4.85AlaVal: 4.85 ± 0.755
0.808AlaTrp: 0.808 ± 0.416
3.234AlaTyr: 3.234 ± 0.287
0.0AlaXaa: 0.0 ± 0.0
Cys
0.404CysAla: 0.404 ± 0.208
0.404CysCys: 0.404 ± 0.442
1.213CysAsp: 1.213 ± 0.624
0.404CysGlu: 0.404 ± 0.442
0.0CysPhe: 0.0 ± 0.0
0.404CysGly: 0.404 ± 0.208
0.808CysHis: 0.808 ± 0.234
0.404CysIle: 0.404 ± 0.208
1.213CysLys: 1.213 ± 0.677
1.213CysLeu: 1.213 ± 0.624
0.0CysMet: 0.0 ± 0.0
1.617CysAsn: 1.617 ± 0.469
0.404CysPro: 0.404 ± 0.208
0.808CysGln: 0.808 ± 0.416
0.808CysArg: 0.808 ± 0.416
0.0CysSer: 0.0 ± 0.0
0.404CysThr: 0.404 ± 0.208
0.808CysVal: 0.808 ± 0.416
0.0CysTrp: 0.0 ± 0.0
0.808CysTyr: 0.808 ± 0.234
0.0CysXaa: 0.0 ± 0.0
Asp
2.829AspAla: 2.829 ± 0.495
1.617AspCys: 1.617 ± 0.182
3.638AspAsp: 3.638 ± 0.079
2.829AspGlu: 2.829 ± 0.156
1.617AspPhe: 1.617 ± 0.182
3.234AspGly: 3.234 ± 0.287
1.213AspHis: 1.213 ± 0.026
6.871AspIle: 6.871 ± 1.016
2.425AspLys: 2.425 ± 1.249
5.255AspLeu: 5.255 ± 0.547
0.404AspMet: 0.404 ± 0.208
2.021AspAsn: 2.021 ± 0.26
1.617AspPro: 1.617 ± 0.833
2.021AspGln: 2.021 ± 0.39
2.425AspArg: 2.425 ± 0.703
2.425AspSer: 2.425 ± 0.598
1.617AspThr: 1.617 ± 0.182
2.021AspVal: 2.021 ± 0.39
0.404AspTrp: 0.404 ± 0.442
2.021AspTyr: 2.021 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
3.234GluAla: 3.234 ± 0.287
0.404GluCys: 0.404 ± 0.208
0.808GluAsp: 0.808 ± 0.234
3.638GluGlu: 3.638 ± 0.729
2.829GluPhe: 2.829 ± 1.796
4.042GluGly: 4.042 ± 0.13
1.617GluHis: 1.617 ± 0.469
1.617GluIle: 1.617 ± 1.119
1.617GluLys: 1.617 ± 0.182
4.85GluLeu: 4.85 ± 0.755
2.021GluMet: 2.021 ± 1.041
2.021GluAsn: 2.021 ± 0.39
4.446GluPro: 4.446 ± 1.639
2.425GluGln: 2.425 ± 0.703
2.425GluArg: 2.425 ± 0.598
2.021GluSer: 2.021 ± 1.041
4.446GluThr: 4.446 ± 0.313
2.021GluVal: 2.021 ± 0.26
1.617GluTrp: 1.617 ± 0.469
2.425GluTyr: 2.425 ± 0.598
0.0GluXaa: 0.0 ± 0.0
Phe
3.638PheAla: 3.638 ± 0.079
0.808PheCys: 0.808 ± 0.416
5.659PheAsp: 5.659 ± 0.312
3.638PheGlu: 3.638 ± 0.572
0.808PhePhe: 0.808 ± 0.234
3.234PheGly: 3.234 ± 0.287
0.404PheHis: 0.404 ± 0.208
1.213PheIle: 1.213 ± 0.677
2.425PheLys: 2.425 ± 1.354
1.213PheLeu: 1.213 ± 0.026
0.404PheMet: 0.404 ± 0.272
0.808PheAsn: 0.808 ± 0.234
2.829PhePro: 2.829 ± 0.495
0.404PheGln: 0.404 ± 0.208
1.617PheArg: 1.617 ± 0.182
3.234PheSer: 3.234 ± 0.364
2.425PheThr: 2.425 ± 0.598
3.638PheVal: 3.638 ± 0.079
0.808PheTrp: 0.808 ± 0.234
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.638GlyAla: 3.638 ± 1.223
0.404GlyCys: 0.404 ± 0.208
4.042GlyAsp: 4.042 ± 1.172
2.021GlyGlu: 2.021 ± 0.26
2.425GlyPhe: 2.425 ± 0.052
5.255GlyGly: 5.255 ± 0.547
0.404GlyHis: 0.404 ± 0.442
4.446GlyIle: 4.446 ± 0.313
3.234GlyLys: 3.234 ± 0.937
4.446GlyLeu: 4.446 ± 2.265
0.404GlyMet: 0.404 ± 0.208
4.042GlyAsn: 4.042 ± 0.78
4.042GlyPro: 4.042 ± 0.13
0.808GlyGln: 0.808 ± 0.234
3.234GlyArg: 3.234 ± 1.015
4.85GlySer: 4.85 ± 0.546
3.638GlyThr: 3.638 ± 1.223
4.042GlyVal: 4.042 ± 0.78
2.425GlyTrp: 2.425 ± 2.004
1.617GlyTyr: 1.617 ± 0.469
0.0GlyXaa: 0.0 ± 0.0
His
2.021HisAla: 2.021 ± 0.39
0.0HisCys: 0.0 ± 0.0
0.808HisAsp: 0.808 ± 0.234
0.808HisGlu: 0.808 ± 0.234
0.0HisPhe: 0.0 ± 0.0
0.808HisGly: 0.808 ± 0.234
0.404HisHis: 0.404 ± 0.208
0.404HisIle: 0.404 ± 0.208
0.0HisLys: 0.0 ± 0.0
3.638HisLeu: 3.638 ± 1.38
0.404HisMet: 0.404 ± 0.442
0.808HisAsn: 0.808 ± 0.416
0.404HisPro: 0.404 ± 0.442
0.808HisGln: 0.808 ± 0.885
0.404HisArg: 0.404 ± 0.442
1.617HisSer: 1.617 ± 0.833
1.617HisThr: 1.617 ± 0.833
0.808HisVal: 0.808 ± 0.234
0.0HisTrp: 0.0 ± 0.0
0.404HisTyr: 0.404 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
5.255IleAla: 5.255 ± 0.547
0.0IleCys: 0.0 ± 0.0
1.213IleAsp: 1.213 ± 0.026
4.85IleGlu: 4.85 ± 1.197
2.425IlePhe: 2.425 ± 0.598
3.234IleGly: 3.234 ± 1.588
0.404IleHis: 0.404 ± 0.208
2.021IleIle: 2.021 ± 0.39
3.638IleLys: 3.638 ± 0.729
5.255IleLeu: 5.255 ± 1.848
2.829IleMet: 2.829 ± 0.495
2.829IleAsn: 2.829 ± 0.495
4.446IlePro: 4.446 ± 0.338
2.021IleGln: 2.021 ± 0.911
2.829IleArg: 2.829 ± 0.495
4.042IleSer: 4.042 ± 0.521
6.467IleThr: 6.467 ± 1.875
4.446IleVal: 4.446 ± 0.338
1.213IleTrp: 1.213 ± 0.677
2.021IleTyr: 2.021 ± 1.562
0.0IleXaa: 0.0 ± 0.0
Lys
4.446LysAla: 4.446 ± 0.338
0.0LysCys: 0.0 ± 0.0
2.021LysAsp: 2.021 ± 0.26
2.829LysGlu: 2.829 ± 0.495
1.213LysPhe: 1.213 ± 0.677
2.021LysGly: 2.021 ± 0.911
1.213LysHis: 1.213 ± 0.677
3.234LysIle: 3.234 ± 2.239
1.213LysLys: 1.213 ± 0.026
5.255LysLeu: 5.255 ± 1.198
2.021LysMet: 2.021 ± 0.39
1.617LysAsn: 1.617 ± 1.119
3.234LysPro: 3.234 ± 0.287
1.617LysGln: 1.617 ± 0.469
3.638LysArg: 3.638 ± 0.729
2.425LysSer: 2.425 ± 0.703
3.234LysThr: 3.234 ± 0.287
3.638LysVal: 3.638 ± 0.729
1.213LysTrp: 1.213 ± 1.327
2.425LysTyr: 2.425 ± 0.703
0.0LysXaa: 0.0 ± 0.0
Leu
6.063LeuAla: 6.063 ± 0.131
2.021LeuCys: 2.021 ± 0.39
5.255LeuAsp: 5.255 ± 1.198
5.255LeuGlu: 5.255 ± 1.198
1.617LeuPhe: 1.617 ± 0.182
4.446LeuGly: 4.446 ± 0.338
1.213LeuHis: 1.213 ± 0.677
2.829LeuIle: 2.829 ± 0.495
3.638LeuLys: 3.638 ± 1.38
6.871LeuLeu: 6.871 ± 0.365
1.617LeuMet: 1.617 ± 1.119
5.659LeuAsn: 5.659 ± 0.339
9.297LeuPro: 9.297 ± 0.418
4.042LeuGln: 4.042 ± 1.431
4.85LeuArg: 4.85 ± 1.406
8.084LeuSer: 8.084 ± 0.259
5.659LeuThr: 5.659 ± 2.264
2.829LeuVal: 2.829 ± 0.156
2.021LeuTrp: 2.021 ± 0.26
4.042LeuTyr: 4.042 ± 1.172
0.0LeuXaa: 0.0 ± 0.0
Met
2.829MetAla: 2.829 ± 0.156
0.404MetCys: 0.404 ± 0.208
1.213MetAsp: 1.213 ± 0.677
0.808MetGlu: 0.808 ± 0.234
0.404MetPhe: 0.404 ± 0.208
0.404MetGly: 0.404 ± 0.208
0.808MetHis: 0.808 ± 0.416
1.213MetIle: 1.213 ± 0.026
0.404MetLys: 0.404 ± 0.208
1.617MetLeu: 1.617 ± 0.182
0.404MetMet: 0.404 ± 0.208
1.213MetAsn: 1.213 ± 0.624
0.404MetPro: 0.404 ± 0.208
1.213MetGln: 1.213 ± 0.026
0.808MetArg: 0.808 ± 0.416
1.213MetSer: 1.213 ± 0.624
2.425MetThr: 2.425 ± 0.598
1.213MetVal: 1.213 ± 0.677
0.0MetTrp: 0.0 ± 0.0
0.404MetTyr: 0.404 ± 0.442
0.0MetXaa: 0.0 ± 0.0
Asn
3.234AsnAla: 3.234 ± 1.015
1.617AsnCys: 1.617 ± 0.469
2.829AsnAsp: 2.829 ± 1.457
2.021AsnGlu: 2.021 ± 0.26
2.829AsnPhe: 2.829 ± 0.806
2.021AsnGly: 2.021 ± 1.041
0.808AsnHis: 0.808 ± 0.234
4.042AsnIle: 4.042 ± 0.521
2.829AsnLys: 2.829 ± 0.156
2.829AsnLeu: 2.829 ± 0.156
1.617AsnMet: 1.617 ± 0.182
6.467AsnAsn: 6.467 ± 1.379
4.85AsnPro: 4.85 ± 0.105
2.021AsnGln: 2.021 ± 0.39
2.021AsnArg: 2.021 ± 0.26
6.467AsnSer: 6.467 ± 0.728
2.425AsnThr: 2.425 ± 0.598
4.85AsnVal: 4.85 ± 0.546
1.617AsnTrp: 1.617 ± 0.182
1.213AsnTyr: 1.213 ± 0.677
0.0AsnXaa: 0.0 ± 0.0
Pro
6.063ProAla: 6.063 ± 3.122
0.0ProCys: 0.0 ± 0.0
1.617ProAsp: 1.617 ± 0.182
2.021ProGlu: 2.021 ± 0.911
3.638ProPhe: 3.638 ± 0.079
5.659ProGly: 5.659 ± 1.64
0.808ProHis: 0.808 ± 0.416
3.638ProIle: 3.638 ± 0.079
3.234ProLys: 3.234 ± 0.364
5.255ProLeu: 5.255 ± 0.103
0.808ProMet: 0.808 ± 0.234
3.234ProAsn: 3.234 ± 1.015
3.234ProPro: 3.234 ± 1.015
4.042ProGln: 4.042 ± 0.13
3.234ProArg: 3.234 ± 1.015
4.042ProSer: 4.042 ± 0.78
6.063ProThr: 6.063 ± 0.131
4.042ProVal: 4.042 ± 1.431
1.213ProTrp: 1.213 ± 0.677
0.808ProTyr: 0.808 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
4.042GlnAla: 4.042 ± 0.78
0.404GlnCys: 0.404 ± 0.442
0.808GlnAsp: 0.808 ± 0.416
1.213GlnGlu: 1.213 ± 0.026
0.808GlnPhe: 0.808 ± 0.234
2.425GlnGly: 2.425 ± 0.052
1.213GlnHis: 1.213 ± 0.677
3.234GlnIle: 3.234 ± 0.364
0.404GlnLys: 0.404 ± 0.442
3.234GlnLeu: 3.234 ± 0.287
0.404GlnMet: 0.404 ± 0.208
1.213GlnAsn: 1.213 ± 0.677
2.829GlnPro: 2.829 ± 0.495
2.021GlnGln: 2.021 ± 0.39
3.638GlnArg: 3.638 ± 0.729
4.85GlnSer: 4.85 ± 0.105
3.638GlnThr: 3.638 ± 0.572
3.234GlnVal: 3.234 ± 0.364
0.404GlnTrp: 0.404 ± 0.208
2.425GlnTyr: 2.425 ± 0.598
0.0GlnXaa: 0.0 ± 0.0
Arg
3.234ArgAla: 3.234 ± 0.287
0.404ArgCys: 0.404 ± 0.208
2.425ArgAsp: 2.425 ± 0.703
1.617ArgGlu: 1.617 ± 0.182
1.617ArgPhe: 1.617 ± 0.182
3.234ArgGly: 3.234 ± 0.937
0.404ArgHis: 0.404 ± 0.442
3.638ArgIle: 3.638 ± 0.729
3.638ArgLys: 3.638 ± 2.03
6.063ArgLeu: 6.063 ± 1.17
0.0ArgMet: 0.0 ± 0.0
5.255ArgAsn: 5.255 ± 1.405
2.425ArgPro: 2.425 ± 0.052
3.638ArgGln: 3.638 ± 1.38
2.829ArgArg: 2.829 ± 0.156
2.829ArgSer: 2.829 ± 0.156
2.829ArgThr: 2.829 ± 0.495
2.425ArgVal: 2.425 ± 0.598
0.404ArgTrp: 0.404 ± 0.208
1.213ArgTyr: 1.213 ± 0.677
0.0ArgXaa: 0.0 ± 0.0
Ser
5.659SerAla: 5.659 ± 1.613
0.0SerCys: 0.0 ± 0.0
2.829SerAsp: 2.829 ± 0.806
4.042SerGlu: 4.042 ± 1.431
4.446SerPhe: 4.446 ± 0.313
3.638SerGly: 3.638 ± 0.572
1.617SerHis: 1.617 ± 0.182
5.255SerIle: 5.255 ± 1.198
6.063SerLys: 6.063 ± 0.781
4.85SerLeu: 4.85 ± 0.546
2.425SerMet: 2.425 ± 0.598
4.85SerAsn: 4.85 ± 1.197
2.829SerPro: 2.829 ± 0.495
2.425SerGln: 2.425 ± 0.598
2.829SerArg: 2.829 ± 1.457
7.68SerSer: 7.68 ± 0.599
4.446SerThr: 4.446 ± 0.988
6.063SerVal: 6.063 ± 0.52
1.617SerTrp: 1.617 ± 0.182
2.021SerTyr: 2.021 ± 0.911
0.0SerXaa: 0.0 ± 0.0
Thr
3.234ThrAla: 3.234 ± 1.015
0.808ThrCys: 0.808 ± 0.416
3.638ThrAsp: 3.638 ± 0.572
4.446ThrGlu: 4.446 ± 0.988
4.446ThrPhe: 4.446 ± 0.338
6.063ThrGly: 6.063 ± 0.781
0.404ThrHis: 0.404 ± 0.208
6.063ThrIle: 6.063 ± 0.52
4.042ThrLys: 4.042 ± 2.473
8.084ThrLeu: 8.084 ± 0.391
0.404ThrMet: 0.404 ± 0.208
2.829ThrAsn: 2.829 ± 0.806
3.638ThrPro: 3.638 ± 1.223
3.638ThrGln: 3.638 ± 0.572
3.234ThrArg: 3.234 ± 0.287
5.659ThrSer: 5.659 ± 0.312
6.063ThrThr: 6.063 ± 1.17
4.446ThrVal: 4.446 ± 1.639
1.213ThrTrp: 1.213 ± 0.624
1.617ThrTyr: 1.617 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
6.871ValAla: 6.871 ± 0.936
0.404ValCys: 0.404 ± 0.442
2.021ValAsp: 2.021 ± 1.041
2.829ValGlu: 2.829 ± 1.796
4.446ValPhe: 4.446 ± 0.313
3.638ValGly: 3.638 ± 0.572
0.0ValHis: 0.0 ± 0.0
2.021ValIle: 2.021 ± 0.39
3.234ValLys: 3.234 ± 0.287
3.234ValLeu: 3.234 ± 1.665
0.808ValMet: 0.808 ± 0.416
4.446ValAsn: 4.446 ± 1.639
5.255ValPro: 5.255 ± 0.754
4.446ValGln: 4.446 ± 0.338
2.829ValArg: 2.829 ± 1.796
4.042ValSer: 4.042 ± 1.431
4.446ValThr: 4.446 ± 0.313
2.829ValVal: 2.829 ± 0.806
2.021ValTrp: 2.021 ± 0.39
3.234ValTyr: 3.234 ± 0.364
0.0ValXaa: 0.0 ± 0.0
Trp
1.617TrpAla: 1.617 ± 0.833
0.0TrpCys: 0.0 ± 0.0
0.808TrpAsp: 0.808 ± 0.234
0.404TrpGlu: 0.404 ± 0.208
0.808TrpPhe: 0.808 ± 0.885
0.404TrpGly: 0.404 ± 0.442
0.404TrpHis: 0.404 ± 0.442
1.213TrpIle: 1.213 ± 0.677
0.404TrpLys: 0.404 ± 0.442
2.425TrpLeu: 2.425 ± 0.703
0.808TrpMet: 0.808 ± 0.416
2.021TrpAsn: 2.021 ± 0.26
0.808TrpPro: 0.808 ± 0.234
0.404TrpGln: 0.404 ± 0.208
0.808TrpArg: 0.808 ± 0.234
1.617TrpSer: 1.617 ± 1.119
1.617TrpThr: 1.617 ± 1.119
1.617TrpVal: 1.617 ± 0.833
0.0TrpTrp: 0.0 ± 0.0
0.808TrpTyr: 0.808 ± 0.416
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.234TyrAla: 3.234 ± 1.588
0.404TyrCys: 0.404 ± 0.208
2.021TyrAsp: 2.021 ± 0.26
1.213TyrGlu: 1.213 ± 0.677
1.213TyrPhe: 1.213 ± 0.624
1.213TyrGly: 1.213 ± 0.624
1.213TyrHis: 1.213 ± 0.624
4.042TyrIle: 4.042 ± 1.172
3.234TyrLys: 3.234 ± 2.239
3.234TyrLeu: 3.234 ± 0.937
0.0TyrMet: 0.0 ± 0.0
0.808TyrAsn: 0.808 ± 0.234
2.425TyrPro: 2.425 ± 0.598
1.213TyrGln: 1.213 ± 0.677
0.404TyrArg: 0.404 ± 0.208
1.213TyrSer: 1.213 ± 0.026
2.425TyrThr: 2.425 ± 0.052
3.234TyrVal: 3.234 ± 0.287
0.0TyrTrp: 0.0 ± 0.0
1.213TyrTyr: 1.213 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2475 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski