Monday, September 28, 2009

Arabic Diacritics Based Steganography

Mohammed A. Aabed, Sameh M. Awaideh, Abdul-Rahman M. Elshafei and Adnan A. Gutub, 'Arabic Diacritics Based Steganography,' 2007 IEEE International Conference on Signal Processing and Communications (ICSPC 2007), 24-27 November 2007, Dubai, United Arab Emirates. [ PDF ]

Abstract

New steganography methods are being proposed to embed secret information into text cover media in order to search for new possibilities employing languages other than English. This paper utilizes the advantages of diacritics in Arabic to implement text steganography. Diacritics - or Harakat - in Arabic are used to represent vowel sounds and can be found in many formal and religious documents. The proposed approach uses eight different diacritical symbols in Arabic to hide binary bits in the original cover media. The embedded data are then extracted by reading the diacritics from the document and translating them back to binary.

Diacritics 就是音標的意思, 換句話說, 就是出現在文字旁邊用來表示不同發音的發音符號。作者在 Table 1 中列出了阿拉伯文的 8 種主要的音標。阿拉伯人用這些音標來改變發音或是用來區別不同字義但拼法卻相似的文字(to alter the pronunciation of a phoneme or to distinguish between words of similar spelling)。由於這些音標在文章中是可有可無的(optional), 因此, 作者提出了本篇利用音標來嵌入機密訊息到阿拉伯文章的論文。

From StegoRN

根據作者的統計分析指出, 上述 8 個音標, 出現頻率最高的是 Fatha, 幾乎等於其他 7 種音標出現的總合。因此, 機密訊息 1 被指定嵌入於 Fatha 所出現的位置中, 機密訊息 0 則被指定嵌入於其他 7 種音標所出現的位置。

本篇論文所提出的 嵌入程序(embedding process) 如下:

選定一篇所有文字都標有音標的掩護文章(a fully diacritized Arabic text), 然後從文章的開頭開始向下搜尋,

如果要嵌入的機密訊息是 1, 掩護文章中所遭遇的文字的音標剛好也是 Fatha, 那就完全不更改掩護文章上的文字, 如果所遭遇的文字的音標不是 Fatha, 而是其他 7 種音標, 則刪去該音標; 然後, 繼續在掩護文章中尋找下一個文字, 如果還是其他 7 種音標, 繼續刪去音標, 直到遭遇帶有 Fatha 的文字;

如果要嵌入的機密訊息是 0, 掩護文章中所遭遇的文字的音標並不是 Fatha, 而是其他 7 種音標, 那就完全不更改掩護文章上的文字, 如果所遭遇的文字的音標是 Fatha, 則刪去 Fatha 音標; 然後, 繼續在掩護文章中尋找下一個文字, 如果還是 Fatha , 則繼續刪去, 直到遭遇其他 7 種音標為止。

萃取程序(extracting process)

要取出機密訊息則是從文章的開頭開始向下搜尋帶有音標的文字, 只要判斷該音標為何? 就可以知道此位置所藏的機密訊息為何?

嵌入容量

這篇論文所提出的嵌入技術, 嵌入容量其實非常容易估計, 只要去算算整篇文章最後留下多少音標, 就是嵌入機密訊息的長度。理論上, 假使音標是隨機出現, 藏入的 0, 1 機密訊息也是隨機的, 那就是有 50% 的音標會被保留下來; 然而, 作者在論文中指出, 平均嵌入容量約為 26.16% (= 50% * 3.27% / 6.25% ), 個人覺得應該是阿拉伯文中存在某種的發音特質, 使得音標出現的次序並不是隨機的, 造成嵌入容量無法達到理論值的緣故。

這篇論文最有趣的一點是作者指出所提出的隱藏技術可能會引起別人的注意:

From StegoRN

換句話說, 當你需要使用隱藏技術來掩護秘密通訊的行為時, 你敢使用這篇論文所提出的技術嗎?

恐怕, 作者自己都不敢!

但, 作者很誠實就是了 :)

Wednesday, September 23, 2009

A Novel Arabic Text Steganography Method Using Letter Points and Extensions

Adnan A. Gutub and Manal M. Fattani, 'A Novel Arabic Text Steganography Method Using Letter Points and Extensions,' WASET International Conference on Computer, Information and Systems Science and Engineering (ICCISSE 2007), 25-27 May 2007, Vienna, Austria. [ PDF ]

Abstract

This paper presents a new steganography approach suitable for Arabic texts. It can be classified under steganography feature coding methods. The approach hides secret information bits within the letters benefiting from their inherited points. To note the specific letters holding secret bits, the scheme considers the two features, the existence of the points in the letters and the redundant Arabic extension character. We use the pointed letters with extension to hold the secret bit ‘one’ and the un-pointed letters with extension to hold ‘zero’. This steganography technique is found attractive to other languages having similar texts to Arabic such as Persian and Urdu. Keywords — Arabic text, Cryptography, Feature coding, Information security, Text steganography, Text watermarking.

論文的 Introduction 中, 作者討論到隱藏學(or 偽裝學, steganography) 這個領域應該要從三個角度來探討: 嵌入容量(capacity), 安全性(security) and 強韌性(Robustness)。作者也討論到 steganography 與 cryptography, 還有與 watermarking 之間的差異。

然而, 比較可惜的一點是作者並沒有指出不同的應用, 對各種需求有極大的差異。例如: steganography 的目的是秘密通訊, 要求整個通訊行為絕對不可以引起監控人員(or一般人)的絲毫懷疑。如果沒有辦法做到這一點, 換句話說, 在沒有保證絕對安全(security)的情況下, 縱使有再大的嵌入容量, 終究也是一套完全沒用的技術。

這篇論文所提出來的技術, 用到了阿拉伯文(Arabic)的兩個特徵:
1. 28 個阿拉伯文字母中, 有 15 個字母有點, 其餘 13 個則沒有點。(Fig.1)
2. 可以在字母前後加入不影響語意的擴充文字(extension)。

From StegoRN

因此, 本篇論文所提出的 嵌入程序(embedding process) 如下:

從文章的開頭開始向下搜尋,

如果想要嵌入的機密訊息為 1, 則尋找下一個有點文字(pointed letter), 如果這個有點文字允許加入擴充文字, 則加入擴充文字, 代表此處嵌有機密訊息, 如果不允許加入擴充文字, 則放棄, 再尋找下一個有點文字;

如果想要嵌入的機密訊息為 0, 則尋找下一個無點文字(un-pointed letter), 如果這個無點文字允許加入擴充文字, 則加入擴充文字, 代表此處嵌有機密訊息。同樣地, 如果這個無點文字並不允許加入擴充文字, 則放棄, 再尋找下一個無點文字。

萃取程序(extracting process)

要取出機密訊息則是從文章的開頭開始向下搜尋擴充文字, 當找到擴充文字時, 只要判斷該文字是否為有點或無點文字(pointed or un-pointed letter), 就可以知道此位置所藏的機密訊息為何?

這篇論文完全沒有討論到一篇加入不少擴充文字(extensions)的文章是否會引起別人的懷疑, 在阿拉伯文中, 擴充文字的使用狀況為何? 總之, 一個隱藏技術所應該面對的安全問題, 卻完全沒有討論, 真的很可惜 !

Monday, September 21, 2009

About Arabic Text Steganography (1)

Prof. Adnan Abdul-Aziz Gutub 是沙烏地阿拉伯(Saudi Arabia) 法德國王石油與礦物大學(King Fahd University of Petroleum and Minerals) 電腦工程學系(Computer Engineering Department) 的主席。

最近連續研讀了四篇 Prof. Adnan Abdul-Aziz Gutub 所發表的研討會論文, 論文主軸都是圍繞在 Arabic Text Steganography, 也就是將機密訊息藏在 Arabic Text 檔案之中, 希望達到秘密通訊的目的。

這四篇論文分別如下:

1. Adnan A. Gutub and Manal M. Fattani,
'A Novel Arabic Text Steganography Method Using Letter Points and Extensions,'
WASET International Conference on Computer, Information and Systems Science and Engineering (ICCISSE 2007),
25-27 May 2007, Vienna, Austria. [ PDF ]

2. Mohammed A. Aabed, Sameh M. Awaideh, Abdul-Rahman M. Elshafei and Adnan A. Gutub,
'Arabic Diacritics Based Steganography,'
2007 IEEE International Conference on Signal Processing and Communications (ICSPC 2007),
24-27 November 2007, Dubai, United Arab Emirates. [ PDF ]

3. Adnan A. Gutub, Yousef S. Elarian, Sameh M. Awaideh and Aleem K. Alvi,
'Arabic Text Staganography Using Multiple Diacritics,'
2008 5th IEEE International Workshop on Signal Processing and Its Applications (WoSPA 2008),
18-20 March 2008, University of Sharjah, Sharjah, United Arab Emirates. [ PDF ]

4. Fahd Al-Haidai, Adnan A. Gutub, Khalid Al-Kahsah and Jameel Hamodi,
'Improving Security and Capacity for Arabic Text Steganography Using Kashida Extensions,'
2009 IEEE/ACS International Conference on Computer Systems and Applications,
10-13 May 2009, Rabat Morocco. [ PDF ]

Saturday, September 12, 2009

News: Gordon Brown: I'm proud to say sorry to a real war hero

Gordon Brown: I'm proud to say sorry to a real war hero
The treatment of code-breaker Alan Turing was utterly unfair, says Gordon Brown

By Gordon Brown
Published: 9:30PM BST 10 Sep 2009
Original Link

This has been a year of deep reflection – a chance for Britain, as a nation, to commemorate the profound debts we owe to those who came before. A unique combination of anniversaries and events have stirred in us that sense of pride and gratitude that characterise the British experience. Earlier this year, I stood with Presidents Sarkozy and Obama to honour the service and the sacrifice of the heroes who stormed the beaches of Normandy 65 years ago. And just last week, we marked the 70 years which have passed since the British government declared its willingness to take up arms against fascism and declared the outbreak of the Second World War.

So I am both pleased and proud that, thanks to a coalition of computer scientists, historians and LGBT (lesbian, gay, bisexual and transgender) activists, we have this year a chance to mark and celebrate another contribution to Britain's fight against the darkness of dictatorship: that of code-breaker Alan Turing.

From StegoRN
Computer pioneer Alan Turing, who helped crack German Enigma codes during WWII

Turing was a quite brilliant mathematician, most famous for his work on breaking the German Enigma codes. It is no exaggeration to say that, without his outstanding contribution, the history of the Second World War could have been very different. He truly was one of those individuals we can point to whose unique contribution helped to turn the tide of war. The debt of gratitude he is owed makes it all the more horrifying, therefore, that he was treated so inhumanely.

In 1952, he was convicted of "gross indecency" – in effect, tried for being gay. His sentence – and he was faced with the miserable choice of this or prison – was chemical castration by a series of injections of female hormones. He took his own life just two years later.

Thousands of people have come together to demand justice for Alan Turing and recognition of the appalling way he was treated. While Turing was dealt with under the law of the time, and we can't put the clock back, his treatment was of course utterly unfair, and I am pleased to have the chance to say how deeply sorry I and we all are for what happened to him. Alan and the many thousands of other gay men who were convicted, as he was convicted, under homophobic laws, were treated terribly. Over the years, millions more lived in fear in conviction. I am proud that those days are gone and that in the past 12 years this Government has done so much to make life fairer and more equal for our LGBT community. This recognition of Alan's status as one of Britain's most famous victims of homophobia is another step towards equality, and long overdue.

But even more than that, Alan deserves recognition for his contribution to humankind. For those of us born after 1945, into a Europe which is united, democratic and at peace, it is hard to imagine that our continent was once the theatre of mankind's darkest hour. It is difficult to believe that in living memory, people could become so consumed by hate – by anti-Semitism, by homophobia, by xenophobia and other murderous prejudices – that the gas chambers and crematoria became a piece of the European landscape as surely as the galleries and universities and concert halls which had marked out the European civilisation for hundreds of years.

It is thanks to men and women who were totally committed to fighting fascism, people like Alan Turing, that the horrors of the Holocaust and of total war are part of Europe's history and not Europe's present. So on behalf of the British government, and all those who live freely thanks to Alan's work, I am very proud to say: we're sorry. You deserved so much better.